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writer applied to cross-cultural research 
a technique for eliciting preference for 
immediate, smaller as opposed to delayed, 
larger reinforcements (in the form of candy 
varying in size, with the smaller obtainable 
immediately and the latter requiring a week’s 
waiting period). Trinidadian children aged 
7 to 9 were the subjects. The main finding was 
a significant relationship between father 
absence and preference for immediate rein- 
forcement. The present study is designed to 
explore some further antecedents and correlates 
of such choice preferences. Again the subjects 
were drawn from the Trinidadian culture, and 
all testing was conducted within the field 
situation. 

Numerous nonexperimental references have 
been made to the importance of tolerance or 
preference for delayed rewards, and the 
necessity for reward delay in a multitude of 
complex human situations has long been 
recognized: it is reflected in such theoretical 
formulations as the “pleasure principle” and 
the “reality principle” (Freud, 1922). ““Psycho- 
pathy,” immaturity, and criminal and neurotic 
behavior have been seen as at least partially 
explicable in terms of the inability to postpone 
immediate gratification for the sake of delayed 
rewards (e.g., Mowrer & Ullman, 1945). In 
spite of this, there exists a relative paucity of 
directly relevant experimentation. Recently, 
a number of investigators have turned to the 
Study of “delay capacity” (Singer, 1955), 
primarily as inferred from the Human Move- 
Ment response (M) on the Rorschach, and its 
behavioral correlates (e.g., Levine, Glass, & 
Meltzoff, 1957; Levine, Spivack, & Wight, 
1959; Meltzoff & Levine, 1954; Singer, 
Wilensky, & McCraven, 1956; Spivack, 


I an earlier study (Mischel, 1958) the 


1 This study was carried out under the support of a 
grant from the Laboratory of Social Relations, Harvard 
niversity. 
z ? Grateful acknowledgement is also due to the 
Fpritcation Office of the Government of Trinidad and 
Obago, and to the cooperation of its officials. 


PREFERENCE FOR DELAYED REINFORCEMENT 
AND SOCIAL RESPONSIBILITY: 


WALTER MISCHEL? 
Harvard University 


Levine, & Sprigle, 1959), such as planfulness 
and intelligence. In contrast to these studies, 
the present research approaches the problem 
of delay of gratification by using direct choice 
measures of preference for immediate, smaller 
or delayed, larger rewards in particular choice 
situations and seeks to explore the antecedents 
and correlates of such choices. 

The major aim of the present report is to 
explore the relationship between preference 
for immediate, smaller as opposed to delayed, 
larger rewards and social responsibility. It was 
hypothesized that subjects preferring imme- 
diate, smaller rewards would show less social 
responsibility than subjects preferring larger, 
delayed: rewards. Social responsibility was 
determined by responses to a quite carefully 
constructed and independently validated 
Social Responsibility Scale (Harris, 1957; 
Harris, Clark, Rose, & Valasek, 1954) as well 
as by the behavioral criterion of committed 
delinquency. The Social Responsibility Scale 
(SRS) used has been found to correlate sub- 
stantially with other measures of personal and 
social adjustment and was designed to “dis- 
criminate children who have, with their peers, 
a reputation for responsibility as contrasted 
with children who have little reputation for 
responsibility” (Harris, 1957, p. 326). Harris’ 
theory of responsibility, to which the items in 
the scale were related, conceptualized responsi- 
bility. “as a composite of attitude elements 
reflecting behavior classifiable as reliable, 
accountable, loyal, or doing an effective job” 
(Harris, 1957, p. 322). 

In the present study it was predicted that 
the SRS would discriminate subjects varying 
in preference for immediate, smaller or 
delayed, larger rewards, with those choosing 
the delayed, larger rewards showing greater 
social responsibility. The underlying reasoning 
is as follows. Within the general framework of 
Rotter’s social learning theory (Rotter, 1954) 
the subject’s choice behavior may be taken asa 
function of his expectation that the reward will 
actually occur (i.e., that the particular promise 
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will be kept) and the reward value, or prefer- 
ence value, of the particular reinforcement. 
Previous research (Mahrer, 1956) has already 
shown that the subject’s expectancy that even 
after time delay reinforcement would follow 
from the social agent making the promise is an 
important variable in this kind of choice 
behavior. The orientation of the present 
research similarly emphasizes the subject’s 
expectancy for reinforcement to follow as a 
function of making a particular choice with a 
particular social agent in a given situation, 
rather than the time interval itself. In common 
sense terms, this kind of choice behavior may 
be thought of as “trust” or the belief that the 
agent promising delayed reinforcement will 
actually supply it. 

Such trust or willingness to postpone imme- 
diate smaller gratification for the sake of a 
later but larger reward in a choice situation 
may be regarded as a basic ingredient of 
social responsibility and as such should be 
(a) positively related to social responsibility 
Scores as elicited by the SRS, and (6) negatively 
related to delinquent or criminal behavior. 
The SRS in turn, having been correlated with 
a series of measures of adjustment and ma- 
turity, may well supply a gross index of a set 
of variables that could not be individually 
manipulated within the vicissitudes of the 
present field research situation, but the overall 
relationship of which to preference for delayed 
teward requires empirical examination. A 
third, and much more tentative and minor 
prediction was made, anticipating a positive 
relationship between accuracy in the recall 
of the time of a past event with preference for 
delayed reward. The measure of time recall 
accuracy was included with a part of the 
sample as a step in an exploratory attempt to 
ascertain relations between preference for 
delayed reward and time judgments, it being 
anticipated that subjects preferring the 
larger, delayed rewards would show greater 
accuracy. The rationale for this is too elaborate 
and tentative for presentation here (Epley & 
Ricks, 1959). However, since recent research 
(Spivack, Levine, & Sprigle, 1959) has demon- 
strated relations between “delay capacity” 
(measured primarily by the Rorschach M 
response) and time estimation scores, the 
Presently available data warrant reporting. 


The two major hypotheses are 
testing the linkage between delay of 
ment and the concepts of social i 
bility, “psychopathy,” and criminal 
whose inverse relationships to the abi 
delay immediate rewards for the sake o 
larger gains have often been assumed wil 
adequate demonstration. 


METHOD 


Subjects. A total of 206 ‘subjects, all 
Negro children aged 12, 13, and 14, with a 
of 13, were tested. Of these, 136 (78 boys, 
came from a large elementary government school 
on the outskirts of the capital city, and 70 boy: 
from a “boys’ industrial school” which is a 
school for committed juvenile delinquents (JD) 
located on the outskirts of the capital, A te 
pretation requires a more detailed description 
two schools from which subjects were draw 
within which they were tested. GS is a go 
“elementary school” (primary grades with the 
of the American equivalent of the junior high £ 
grades), containing the most “representative” 
of relatively urbanized, Negro, lower class chi 
the basis of advice from the local school a 
and several inspection trips to schools in the i 
latter tended to confirm the impression given | 
authorities, Approximately one third of the 
at GS come from ethnic backgrounds oi 
Trinidadian Negro (e.g., East Indian, Ch 


proper at both schools were Trinidadian Ne 
no further restrictions, other than age (all 
ranging from 12 to 14) and availability at the ti 
„testing were imposed in the selection of subj 
Examination of the family backgrounds, e.g. 
size and constellation, living conditions, 
occupations and income, etc., of 10 randomly 
children from each of the two schools sugges: 

similarities with respect to the gross external as 
of living and economic circumstances in the 
grounds of the GS and JD groups. JD contains 
mately 250 boys, ranging in age from 10 to 
have been committed for extremely hetero 

reasons and offenses, with a minimum of two prél 
convictions for delinquency prior to commitm 
most common offense being petty larceny. The phi 
facilities at the school are relatively good a 

physical conditions (with respect to food, cloi 
living quarters) compare favorably on the who 
those of noninstitutionalized children from the | 
middle and lower socioeconomic strata in r 
(e.g., GS). Specific inquiry was made into the 
ability of candy for the children at JD, since 
relevant to the experimental procedure, This 


3 There were no differences approaching 
significance in the proportion of subjects obta 
each age level at JD as compared to GS. 


bn 
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revealed that the children are allowed to have and eat 
candy freely, that they have “allowances” sufficient 
for them to purchase a large variety of candy on their 
own three or four times weekly, and that this is done 
regularly by virtually all of the children. This situation 
is again highly comparable to that of the sample from 
GS with respect to the subjects’ familiarity with, and 
use of, candy. The school is residential, with the children 
living in dormitories, 40 to 50 per dormitory, assigned 
on an age basis. 

Measures, An actual behavior choice that elicited 
preference for either immediate, smaller reinforcement 
or delayed, larger reinforcement was used at each 
school (see below). In addition, two direct questions 
were added at JD in the attempt to obtain a preliminary 
test of the consistency with which subjects prefer 
immediate or delayed rewards, and in the attempt to 
relate the consistency of such preferences to the de- 
pendent measures. The two questions were inserted as 
Items 5 and 15 of the shortened SRS form admin- 
istered at JD and were phrased as follows: 

1. I would rather get ten dollars right now than 
have to wait a whole month and get thirty 
dollars then. 

2. I would rather wait to get a much larger gift 
much later rather than get a smaller one now. 

The behavioral choice of preference for immediate, 
smaller as opposed to delayed, larger reinforcement 
was given as the last item of the total battery. 

All items from the 50-item SRS (Harris, 1957, 
called “Social Attitudes Scale,” p. 324, Table 1), with 
the exception of Item 50, were administered at GS. 
This item was omitted on the basis of anticipated 
inappropriateness or irrelevance due to cultural dif- 
ferences, both as directly observed in the course of 
field work and on the basis of advice from local in- 
formants familiar with the culture. Minor modifications 
in phrasing were employed on six other items to make 
them culturally appropriate.‘ The size of the scale was 
drastically reduced for administration at JD since the 
administration at GS (which was conducted first) 
revealed that the procedure was too long and restless- 
ness on the part of the subjects was imminent. It was 
anticipated that subjects at JD would show even less 
attention for any prolonged time period and the number 
of items was consequently further reduced to shorten 
the procedure, An essentially random procedure was 


4The scale used was identical with that presented 
in Table 1, p. 324 of the Harris (1957) study, with 
Item 50 omitted and the following items modified to 
read: 18. When I work in a group (instead of “on a 
committee”) I usually let other people do most of the 
planning; 32. Never give a fool (instead of “sucker”’) 
an even chance; 35. It’s a good thing the Ocean (instead 
of “Atlantic ocean”) separates us from Europe because 
then we don’t have to worry about them; 40. If every- 
one helps (instead of “pitches in”) to do a job, it can 
always get done; 42. Doing things which are important 
should come before things you enjoy doing or like to do; 
49. When given a job (instead of a “task”) I stick to it 
even if things I like to do better come along. The 
italicized words above indicate the new modifications 
made in the items. 


used for dropping items, retaining, however, a similar 
proportion of items with personal vs. nonpersonal 
reference, The SRS constituted 18 items of the 25-item 
revised battery: Items 1, 2, 3, 7, 11, 17, 19, 23, 24, 27, 
28, 29, 31, 32, 40, 43, 44, and 48 of the SRS as reported 
by Harris (1957). 

Subjects at JD were also asked, as one of the items 
in the series of questions, to record the year during 
which the national election took place. This event, 
occurring in 1956, was of major importance to most 
Trinidadians since it marked the beginnings of a new 
government, the first election of its kind, the first step 
towards Federation, etc., and continued to be a matter 
of major popular concern at the time of the study 
(1959). All the children had heard of the election and 
were, in general terms, familiar with its occurrence: 
the task here was to place it correctly in time. This 
served as the measure of accuracy in the recall of a 
past event. 

The reinforcements. Following the procedure de- 
veloped in an earlier study (Mischel, 1958) pre-experi- 
mental sessions were used to select two appropriate 
reinforcements for use in the experiment proper. 
Towards this end, 20 Negro boys randomly selected 
from another group of Trinidadian delinquent subjects 
living within the capital city and of comparable ages, 
were seen in individual sessions and their preferences 
for various specific reinforcements were elicited. As a 
result, two reinforcements (both candy, but varying 
markedly in price, size, and packaging, i.e., a $.05 and 
a $.25 candy bar) were selected, These met the desired 
requirements inasmuch as the larger reinforcement was 
uniformly preferred in a straight choice situation 
(“Which one of these two would you like to take?”), 
but when the choice was “You can have this one (the 
smaller) today or this one (the larger) in one week,” 
approximately 50% of the group chose the former and 
approximately 50% the latter. The purpose of this 
procedure was to select a reinforcement pair that 
would, as closely as possible, dichotomize the choices of 
subjects in the JD group, without actually testing 
within the JD setting to avoid the anticipated con- 
tamination effects arising from pre-experimental 
discussions by the subjects in the same residential 
school. 


Procedure, The experimenter was introduced as an 
American from a college in the United States who was 
interested in gathering information on the children in 
the various schools of the island. To help with this 
the subjects were asked to answer a number of ques- 
tions. The following statement was included in the 
general introduction: 

Let me explain to you what we are going to do 
today. It is really quite simple if you will listen very 
carefully. As a part of our studies we are collecting 
information of all sorts from children and students 
all over the world. None of the things you will be 
doing are tests or examinations ,..none of them 
have right or wrong answers...none of them will 
be marked or graded...and none of them will in 


5 The same experimenter was used in all testing and 
every effort was made to create similar testing atmos- 
pheres in all sessions. 
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any way affect your own work here at school. As a 
matter of fact, your teachers here will never see your 
answers to any of the questions that we will be asking 
you. Nothing will be marked or graded. Instead, we 
want to get some information about you... like 
how old you are and how you feel about different 
things. Although there are no right or wrong answers, 
it is most important that you try to be honest and 
truthful throughout: we are interested in how and 
what you are feeling and thinking. ... Remember, 
this is nof an examination, there are no right or 
wrong answers, your teachers will not see what you 
write...and you will help us most by being com- 
pletely truthful and honest throughout. 

All questions were deliberately posed orally, with 
the subjects following the experimenter question by 
question on the printed questionnaire, since previous 
work in the same culture strongly indicated the su- 
periority of this method with respect to insuring greater 
attention, interest, and clarity, Subjects were carefully 
instructed about how to record their answers next to 
the appropriate numbers on their questionnaires. 
The experimenter then said: 

Now I will read each of the sentences, one at a 
time, and repeat each sentence once to make sure 
you understand .... After I have read the sentence 
two times I want you to put a Y for Yes, I agree with 
it down if you really do agree with the sentence, that 
is, if it is true for you. But put down an N for No 
if you do not agree with it or if it is not true for you. 
For example, if I read: I am a boy who lives in 
Trinidad, all the boys would put down a Y and all 
the girls would put down an N. Do you understand? 
Make sure that you keep up with me on your paper 
and that you put your first ¥ or N down next to the 
Number 1, your second next to the Number 2, and 
so forth. Now, we will read one sentence at a time, I 
will repeat each sentence once, and you will put 
down a Y for Yes if you agree with it, it is true for 
you, and an N for No if you disagree with it or it is 
not true for you. Some of the sentences will be true 
for you and some will not. Now let us read the 
first one, listen carefully. 

Upon completion of the questions, the experimenter 
expressed his wish to thank the group for their co- 
operation. He displayed the two kinds of reinforcement 
and said: 

T would like to give each of you a piece of candy 
but I don’t have enough of these [indicating the 
larger, more preferred reinforcement] with me today. 
So you can either get this one [indicating the smaller, 
less preferred reinforcement] right now, today, or, if 
you want to, you can wait for this one [indicating] 
which I will bring back next Friday [one week delay 
interval]. 

These instructions were repeated in rephrased form to 
insure clarity and both reinforcements were carefully 
displayed. It was stressed that getting the (smaller) 
candy today precluded getting the (larger) one next 
week and vice versa. Subjects were asked to indicate 
their choice by writing T (today) or F (Friday, next 
week) on their questionnaires. The response made 
here was the measure of the behavioral choice of a 
larger (or more preferred) delayed reinforcement or a 


l 
| 
smaller (less preferred) immediate reinforcement 
Subjects were seated sufficiently far apart from each 
other to insure reasonably that their choices were 
independently. All testing was done in a group setting, 
and conducted on the same weekday (but in different 
weeks) and at similar times of day, within each of the 
schools, in large classrooms. | 
| 
RESULTS AND DISCUSSION | 


Examination of the relationship between 
sex and preference for delayed or immediate 
reinforcement revealed no differences ap 
proaching statistical significance and conse 
quently the male and female data were 
treated collectively in subsequent analyses, 
Comparison of the delinquent (JD) and 
nondelinquent (GS) groups on the experi 
mental choice was based on the following 
data. At JD, 31 subjects chose the immediate 
reinforcement and 39 chose the delayed) 
reinforcement; at GS, 35 subjects chose the 
immediate and 101 subjects chose the delayed. 
Application of the chi square test to these 
data yielded a chi square of 6.48 (p < .02), 
a significantly larger proportion of delinquent 
subjects choosing immediate, smaller zeintan 
ment, as was predicted.’ | 

Turning to the within-group data, thej 
following results were obtained. At GS, com- 
parison of SRS scores of subjects choosing 
immediate, smaller reinforcement with thost 
choosing delayed, larger reinforcement (Table 
1) resulted in a ż of 2.24 (p < .025), with 
differences in the predicted direction, if 
subjects preferring the delayed, larger rein 
forcement have significantly higher social 
responsibility scores. | 

At JD, where three measures of reinforce- 
ment preference were used, the consistency o 
such preferences (Table 2) can be examined. 
The responses to the two questionnaire items 
eliciting preference for immediate, smaller oF 
delayed, larger reinforcement were group 
into the following categories: consistent 
immediate reward (both questions answe 
in the direction of preference for immediate, 
smaller reinforcement); consistent delay 
reward (both questions answered in thé 
direction of preference for delayed, larget 
reward); and inconsistent responses (oné 


* All chi square tests on 2 X 2 contingency tables 
are corrected with Yates correction for continuity 
throughout this study. 
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TABLE 1 
RELATIONSHIP BETWEEN REINFORCEMENT PREFERENCE AND SOCIAL Responsipitity Scare (SRS) 
SRS 
Group Reinforcement Preference 
N “ Sam Seon * 
JD (delinquents) Immediate (candy choice) 31 8.1 2.51 45 
Delayed (candy choice) 39 9.0 2.26 -50 1.55 <.10 
Consistently immediate? 12 7.17 2.61 40 
Consistently delayed’ 19 9.11 2.32 51 2.04 025 
GS (nondelinquents) Immediate (candy choice) 35 28.77 3.29 59 
Delayed (candy choice) 101 30.43 3.92 62 2.24 <.025 


a SRS mean divided by the number of items in the scale: 18 at JD, 49 at GS. 


b Over the three measures. 
e One-tailed test of significance. 


TABLE 2 
CONSISTENCY OF PREFERENCE FOR IMMEDIATE OR 
DELAYED REINFORCEMENT: COMPARISON OF ACTUAL 
CHOICE WITH THE TwO QUESTIONNAIRE 
Responses AT JD 


Actual Choice* aee TRA N Corlay N Inconsistent 


Immediate 12 4 * 15 
Delayed 4 19 16¢ 


® Candy situation. 

b Both questionnaire items answered in the direction of pref- 
erence for immediate reward. 

© Both questionnaire items answered in the direction of pref- 
erence for delayed reward. 

d ya = 13.07; p < .01. 


question answered in the direction of immedi- 
ate reward and the other in the direction of 
delayed reward). The responses falling within 
each of these categories were in turn compared 
with the subject’s actual choice in the be- 
havioral preference situation. Applying the 
chi square test to this table yielded a chi 
square of 13.07 (.01 > p > .001); there is, 
thus, a more than random association between 
responses made to each of the three measures 
used. Since the three measures have much in 
common in their phrasing and were all ob- 
tained within the same testing situation, these 
data provide only a limited, first step in the 
appraisal of the intra-individual consistency 
of such reinforcement preferences. 

At JD, a ż test comparing the SRS scores of 
subjects choosing immediate, smaller reinforce- 
ment in the actual behavior choice (candy 
situation) with those choosing delayed, larger 
reinforcement (Table 1) indicates that the 
difference in the mean scores of the two groups, 
although in the predicted direction, does not 


reach statistical significance. For this group, 
however, it is possible to examine the relation- 
ship between relatively consistent reinforce- 
ment preferences and SRS scores. Table 1 com- 
pares the mean of the SRS scores of subjects 
showing a consistent preference for immediate, 
smaller reinforcement with the mean of those 
consistently preferring larger, delayed rein- 
forcement and this comparison yields a £ 
(one-tailed test) of 2.04, with the differences 
between means being significant (p < .025) in 
the anticipated direction. Thus, although the 
candy choice behavior in itself does not differ- 
entiate subjects within this group with respect 
to SRS, the classification of subjects into rela- 
tively more consistent reinforcement prefer- 
ence patterns through the use of multiple 
preference measures does yield the predicted 
differences. This finding strongly suggests that 
future research exploring the antecedents and 
correlates of preference for immediate or de- 
layed reward might more fruitfully employ 
multiple rather than single measures of rein- 
forcement preference. 

Table 3 shows the relationship between 
accuracy in the recall of the time of a past 
event and reinforcement preferences at JD. 
Responses to the question eliciting the year of 
the last national election were categorized into 
“accurate” (1956) and “inaccurate” (all other 
responses). Applying the chi square test to the 
data in Table 3 results in a significant relation- 
ship between the choice behavior and accuracy 
of time recall. Subjects preferring the im- 
mediate smaller reward in the candy choice 
situation show less accuracy in time recall 
62 = 6.35, p < .02). 
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TABLE 3 


REINFORCEMENT PREFERENCE AND 
Twe RecALL ar JD 


ACCURACY OF 


Accuracy of Time Recall 
Reinforcement Preference N N 
Accu- Inac- x ? 
rate curate 
Immediate choice* 4 2 
Delayed choice* 17) 6226.35. <.02 
Consistently im- 4 8 


mediate? 


Consistently delayed® 15 4 4.86 <.05 


® Choice in candy situation only. 
b Consistent preference over the three measures. 


This finding is relevant to the work of 
Spivack, Levine, and Sprigle (1959) who re- 
lated the length of time estimates made by 
subjects asked to state when 15”, 30”, and 60” 
time intervals had elapsed to other measures of 
“ego delay function” and to intelligence. The 
possible theoretical significance of such rela- 
tionships between particular patterns of prefer- 
ence for immediate or delayed rewards and 
time judgments, time span (Epley & Ricks, 
1959), and other measures of the subjects’ 
conceptions and uses of time, cannot be elabo- 
rated in the present report but is the object of 
ongoing research. At present, this finding pro- 
vides only another provocative empirical 
correlate of the choice behavior of interest. It 
should be noted, however, that a recent study 
(Melikian, 1959) has indicated relations be- 
tween a preference for delayed reinforcement 
measure similar to that used in the present 
study and intelligence, and that scores on our 
time recall measure may be more dependent 
upon the subject’s intelligence than upon his 
accuracy specifically in making time state- 
ments. In Table 3, comparison of the accuracy 
of time recall scores of the subjects consist- 
ently preferring immediate or delayed reward 
with the total figures based on the candy choice 
only suggests that the major difference is that 
those making consistent reward choices over 
the three measures, whether for delayed or 
immediate reward, give the preponderance of 
“Tight” responses to the time recall question 
for the total sample. As such, the time recall 
Measure may indeed reflect the subject’s 
alertness, memory, consistency, and other such 
intellectual functions which in turn may be 
related to response consistency over the three 
measures. 


Finally, the data permit some comparison „ 
of SRS scores of Trinidadian subjects with the» 
scores obtained in the United States standardi- 
zation procedures on overlapping age groups, 
The differences in mean SRS scores obtained | 
in the GS (nondelinquent) sample, as com: 
pared to those reported by Harris in 1957 for | 
the United States groups (for boys, M of SRS 
at age 12 = 36.94, SD = 5.14; M of SRS for 
girls at age 12 = 39.42, SD = 3.99) are sa 
great as to make tests for the significance of 
the difference superfluous. Clearly (and recog 
nizing that the present data are based on a 49- 
item scale whereas Harris’ are based on 50) 
items) the Trinidadian subjects showed sig- 
nificantly lower social responsibility scores on 
the modified United States scale. This finding) 
is fully consistent with anthropological ob- 
servations made about the culture on the basis 
of three field trips and well supported by 
numerous other workers in the Caribbean (e.g, 
Herskovits & Herskovits, 1947). Of course, 
the dangers of cross-cultural comparisons on & 
variable such as social responsibility, using & 
measure developed and standardized within 
one culture only, are obvious and great. Never 
theless, the sharpness of the differences ob- 
tained supports the possible fruitfulness of 
using relatively simple quantitative measures 
when gross differences between cultural groups, 
are sought to test anthropological hypotheses 

The overall findings lend further support to 
the fruitfulness of using relatively simple 
preference measures for eliciting preferences 
for immediate, smaller as opposed to delayed) 
larger reinforcement in order to explore the 
dimension of delay of reinforcement and itg 
antecedents and correlates. Before the rea 

| 


TS I 


tionships obtained can be generalized it will be 
necessary to test other cultural groups Mi 
similar experimental situations. The maja 
significance of the present results is that they. 
constitute an early step in testing empirically” 
the popular theoretical conception, which 
has been relatively neglected experimentallyy 
that delay of reinforcement is related post 

tively to such concepts as “adjustment” and 
“maturity,” and negatively to criminal, 
psychopathic, or socially irresponsible be 
havior. It remains for future research to isolaté x 
further the major component variable relate® 
to particular patterns of preference for de 
layed or immediate reinforcement. 
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A SUMMARY 

This experiment, using Trinidadian subjects 
aged 12 to 14, explored the relationship be- 
tween preference for immediate, smaller re- 
inforcement as opposed to delayed, larger re- 
inforcement and (a) social responsibility as 
measured by an independently standardized 
scale, (6) delinquent behavior, and (c) accuracy 
in time statements. The primary results were 
(a) differences, as predicted, between de- 
linquent and nondelinquent subjects, the 
former showing greater preference for immedi- 
ate, smaller reinforcement, and (6) a tendency 
for subjects who preferred immediate, smaller 
reinforcement to show lower social responsi- 
bility scores, although in one of the two groups 
used this difference was significant only when 
restricted to subjects who showed consistent 
reinforcement preference patterns on three 
measures. Data relating delayed reinforce- 
ment preference patterns to accuracy in time 
Statements were also presented. The overall 
findings suggest that quite simple and direct 
measures of reinforcement preference, in which 
smaller but immediately available rewards are 
pitted against larger but delayed rewards, 
may be effectively used to test often-discussed 
but rarely tested relationships between “ma- 
turity,” “adjustment,” “psychopathy,” etc. 
and particular patterns of reinforcement 
preference with respect to delay in choice situa- 
tions. é 
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STABILITY AND FLUCTUATION IN THE POWER RELATIONSHIPS 
OF YOUNG CHILDREN! 
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Is study explores one aspect of indi- 

vidual constancy in social interaction. 

In it, stability has been investigated in 
terms of day-by-day fluctuations in the domi- 
nant, acquiescent, and resistant behavior of 
preschool age children. For each subject, 
power related acts were recorded during three 
successive play sessions. Social and physical 
environments were held maximally constant 
by providing the same setting and an identical 
playmate for each child in all three play 
periods. 

The stability of individual interaction pat- 
terns may be explored (a) by comparing the 
social behavior of the same subject (or subjects) 
in a variety of physiologic states, (b) by com- 
paring the social behavior of the same subject 
in a variety of social situations, (c) by compar- 
ing the social behavior of the same subject in a 
variety of physical settings, and (d) by com- 
paring the behavior of the same subject over 
time—that is, in terms of sequences or succes- 
sive behavior samples collected for a more or 
less extended period. The present study is con- 
cerned with the last mentioned aspect of con- 
sistency. 

A number of investigators have reported 
that persons tend to maintain consistent pat- 
terns of interaction in social situations (Bell & 
French, 1950; Hanfmann, 1935; Matarazzo, 
Saslow, & Guze, 1956; Matarazzo, Saslow, & 
Matarazzo, 1956; Merrill, 1954). However, as 
Chein et al. (1945) have pointed out, “... 
allowing the assumption of an overall stable 
pattern (of social behavior), the question of the 

1The data cited here are part of a larger study 
entitled, “Patterns of Dominance, Submission, and Re- 
sistance in the Interaction of Young Children” (Gellert, 
1956). This investigation was submitted in partial 
fulfillment of the EdD requirements at Harvard 
Graduate School of Education. 

2The author is indebted to J. W. M. Whiting, 
John Carroll, and George Mandler for their guidance. 
Grateful acknowledgement is made, also, to the staff 
and children of the Harvard Preschool for their coopera- 
tion. Lelia Washburn assisted in the experiment. Sup- 


port was also provided by NIMH Grant 2M 6418(C4) 
College 65, Interdisciplinary Program. 


average extent of day to day variability 
within this pattern of response remains rela- 
tively unexplored.” If observations of social 
behavior are to be used in assessing personality, 
the degree of stability to be expected with 
regard to the behavior dimensions sampled 
becomes important. Particularly in the assess- 
ment of individual and group differences” 
among the very young, measures are frequently 
based upon direct observation (Gellert, 1955). 
However, as Arrington (1943) has indicated, 
“the evidence contributed by time sampling 
and related studies (of children) concerning 
the stability of patterns of social . . . behavior 
observed to be characteristic of given indi- 
viduals in particular situations is extremely 
meager.” The present study represents an 
attempt to supply such evidence with respect 
to the power related behavior of preschool 
children. 


METHOD 
Subjects 


The subjects were 16 boys and 16 girls enrolled at 
Harvard Preschool, ranging in age from 46 months to 62 
months. All were Caucasian. Harvard Preschool is & 
nursery school attended, for the most part, by the chil- 
dren of college graduates. Most of the subjects had been 
acquainted with one another previous to taking 
in the study. 


Procedure 


Stability in social behavior should be maximally 
apparent in a constant setting—that is, under condi- 
tions where there is little change from one time t0 
another with respect to the physical and social environ 
ment wherein a person is observed. Such condition’ 
were approximated in the experiment to be described: 
All experimental sessions took place in the same 
playroom. An identical array and arrangement of 
play materials was maintained. The toys provid 
were selected according to the following criteria: 
they should facilitate social interaction, they shoi 
lend themselves to a variety of uses so that the same 
subjects may play with them for repeated sessions 
without getting bored, they should appeal to preschool 
age boys and girls, they should not require adult 
direction for their use. 

A trough filled with water and materials to be used 
for water play were provided; in another part of the 
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room, child-size furniture and housekeeping equipment 
were set out. An assortment of “dress up” clothes 
(donated adult wearing apparel) was included. In a 
third area, blocks, toy vehicles, and a train track set 
were placed. The playroom was equipped with one-way 
vision mirrors. Sound was transmitted to the adjoining 
observation booths, 

The children were observed in pairs.? Each subject 
interacted with the same partner for three, 20-minute 
play sessions. Intervals between sessions, per pair, 
ranged from one day to a week. All pairs were homo- 
geneous with respect to sex. Upon being taken to the 
playroom, the subjects were told that they could play 
with any of the things in the room. After demonstrating 
how she could be reached, if needed, the experimenter 
left the room, promising to return when playtime was 
over. For the most part, ongoing interaction was not 
interrupted by the experimenters. The subjects were 
not told that they were being observed. Individual be- 
havior records of both children per dyad were charted 
concurrently. The experimenter and her assistant spot- 
checked their degree of agreement in recording their 
observations. Their average interobserver reliability 
was .81.4 

Behavior was recorded in terms of unit acts defined 
as follows: Any scorable action or verbalization that is 
terminated by interruption as a result of other person’s 
acts, by change of intent, change of direction, or by 
addition of new ideas on the part of the subject (that is, 
“Let’s put on the shoes and then let’s play house” is 
scored as two units because two ideas are expressed). 
Whenever a series of interacts turns into a ritualistic, 
Tepetitive game, only two units per subject are scored 
in a given category. 


Selection and Development of Behavior Categories 


Descriptions of power related behavior given by 
Murray (1938), Allport (1937), Cattell (1954), Chapple 
(Matarazzo, Saslow, & Matarazzo, 1956), Jack (1934), 
Anderson (1937, 1939), and Child (1954) were culled.® 


*Tt was found that attending to more than two chil- 
dren at a time greatly reduced the reliability of observa- 
tions. 

4In addition to preliminary observations of indi- 
vidual children during play in their nursery school 
group, 12 pre-experimental paired play sessions were 
required to attain interobserver reliabilities between 
-75 and .85. The formula used to compute reliability 
coefficients was derived from Jersild and Markey 
(1935). It was: 


2 (number of agreements) 


2 (number agreements) 
+ A’s omissions + B’s omissions 


(Omissions were units recorded by one observer, but 
not the other; agreements were the number of units, per 
category, that were recorded by both observers.) 

5 There is no assurance that the aggregate of be- 
haviors and inclinations referred to in the literature 
under the denominations of dominance, ascendancy, 
leadership, and initiative spring from a common 
motivation, In fact, Clark and Birch (1945) have 


Actions or dispositions they considered to typify domi- 
nance or submissiveness were phrased in terms of ap- 
parently corresponding behavior observed in young chil- 
dren, and grouped into a set of discrete categories. Dur- 
ing trial observations, these categories were modified 
and refined; several were deleted because corresponding 
acts occurred very rarely. (A complete list of the initial 
set of behavior categories appears elsewhere—Lambert, 
1960.) 


Definition of Dominance 


In subsuming several behavior categories under one 
heading and by assigning the same value to acts in all 
categories, the categories are treated as though they 
were equivalent with respect to dominance. The be- 
haviors represented by each are presumed to stem from 
a common inclination: the urge to dominate. While a 
significant correlation among dimensions of behavior 
assures neither equivalence nor common motivation, 
it does suggest that their association probably is not 
due to chance. With this consideration in mind, only 
those dimensions were retained in the final set of domi- 
nance categories that correlated significantly (p < .05) 
with composite dominance scores for each sex taken 
separately. 

Composite scores were obtained by combining 
ranked performance in all 10 categories which initially 
were thought to reflect the intention to dominate. For 
each of these categories, the subjects were ranked in 
terms of frequency of acts. The sum of each child’s rank 
positions on all 10 categories was taken as his composite 
score. Although positive Spearman rank correlations 
with the composite score were obtained for all the 
original categories, only 6 of them met the criterion 
for being retained. The final group of categories used 
to define dominance is described below.® 

1. Dominate play; instruct.” P attempts to gain prece- 

dence in play; P pushes O out of the way; P 
changes what O has just built; P grabs toys O 
is using in order to use them himself; P initiates 
rules or refers to real or contrived “adult es- 
tablished rules” in order to direct play. (Only 
behavior occurring in the course of active, 
joint play was included in this category.) 
Examples: 
“Tf you want to play with me you'll have 


cited evidence that in the case of animals “priority of 
response in one type of competitive situation does not 
necessarily correlate highly with dominance in other 
competitive activities...” and that relative position 
in the pecking order may bear “little or no predictive 
relation to the initiation of new activities.” Clearly, 
there is a continuing need to identify and circumscribe 
qualitative entities associated with power in social 
intercourse, and to study the interrelationships among 
such entities. 

ê The results of a more fundamental test of homo- 
geneity, with respect to the categories used, are cited 
elsewhere (Lambert, 1960). This test was based upon 
data collected in a subsequent experiment. 

7“P” refers to the subject whose behavior is being 
recorded. “O” refers to the other child comprising the 
dyad. 
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to...”; verbal threats to gain control, such as, 
“the teacher will be mad if you...”; also in- 
structions such as, “Pll show you . . .,” “‘you’re 
supposed to...,” “it goes this way...,” 


“we're going to...” “you better. ..”; giving 
permission and assigning roles in play, such 
as, “you can be...,” “it’s my turn and then 
it’s your turn...,” “this can be yours,” “you 
can be mother... “nobody can come here 


but us”; refusing permission when asked, such 
as, “no, you can’t haveit.” 

2. Positive mandate, Spontaneous order or suggestion 
that does not emerge in the course of play. (Ex- 
ception: orders that seem intended for atten- 
tion seeking, such as, “look at me” or “look 
what I made.”) 

Examples: 
“Come here!” “Let me take it!’ “Put it 
down!’—when said aside from joint play, 

3. Negative mandate. “Don’t” “Stop!” etc. when 
used as an order rather than in the context of 
an altercation or as a defense against manifest 
domination on O’s part. Negation has to be 
verbalized, 

Examples: 
Children are in different part of the play area 
and P attempts to control O’s play by saying 
“Don’t do...” “No, that’s mine,” etc. 

4. Suggest orientation. This is a more gentle, or gen- 
teel form of domination than the above. In- 
cluded are suggestions qualified by terms, such 
as, “please” “O.K.?” “would you like to...” 
or “let’s....” Conditional suggestions, such 
as, “would you do it...” “could you....” 
(Exception: “Let me...” is scored as Positive 
mandate rather than in this category.) 

5. Boast. Favorable comparison of self with other 
child, with regard to prowess, property, crea- 
tion, appearance, etc.; boasting. 

Examples: 
“Mine is bigger” “I can jump way to the 
sky!” “My father can do it better than yours!” 

6, Tease. P initiates a threatening gesture made “in 
fun,” that is, without carrying out the implied 
threat; name calling; P derogates O, O’s prop- 
erty, or O’s efforts, 

Examples: 
“You didn’t make it good.” “Ha, ha, 
you can’t catch me!” “Stinker!” “I don’t like 
your....” 

Only behavior initiated by an individual, rather than 
occurring in response to domination on the part of a 
playmate, was defined as dominance. Behavior was 
scored in terms of apparent intention irrespective of its 
outcome, For example, trying to appropriate another 
child's toy during joint play, was scored as “Dominates 
ey 1), whether the attempt was successful 
or not. 


Definition of Submission 
The subcategories comprising submission were 


selected by applying the same methods which have been 
described in relation to dominance. However, responses 


as well as initiated behavior reflecting deference were 
included in the definition of submission. The following 
categories were retained in the final set. 

7. Comply. P complies in response to O’s directions, 
domination, or bid for attention; P obeys or — 
accepts O’s impositions passively, desists, gives 
in; P concedes actively in response to O’s domi- 
nation; P does not defend self, property, or 
tights. (Act may be accompanied by verbal 
agreement, which is then not scored separately.) 

8. Agree verbally. Verbal concession not accom- 
panied by active compliance of any sort, in re- 
sponse to O’s domination. (Exceptions: “yes” | 
in response to O’s asking permission; “yes” ~ 
in response to O’s asking a question.) 

Examples: 
“Yes” “O,K.” “I will” “All right.” 

9, Ask for permission, directions, orientation. Almost 
any question is included here. (Exceptions: 
“What?” in response to not having understood 
something O said; “You know what I did? ...” 
—which is really a comment designed to give 
information rather than obtain it.) 

Examples: 
“Can I put mine here?” “Where does 
this go?” “I don’t know how to....” 


Definition of Resistance 


With respect to the present subjects, it was found 
that opposition to impositions was not correlated sig- 
nificantly with dominance. Hence, such behavior was 
treated as a separate dimension, called resistance. The 
following categories constituted the subclassifications 
of resistance: 

10. Countermandate. Domination in response to 
domination, noncompliance, or frustration 
occasioned by O. (Any scorable form of domina- 
tion or noncompliance on O’s part may pro- 
vide a stimulus to a countermandate.) Condi- 
tional surrender, such as, “I will, if you...”5 
“Tl give it to you later”; counteraggression, 
counterteasing, counterboasting; sentences 
starting with “yes, but...” “no, but...” 
“but...”; counterarguments and disagree- 
ments in response to O’s domination; a con- 
trary order or a revocation of O’s command. 
Included here is the mechanism of distracting 
O’s attention as a defense—that is, O splashes 
water on P; P distracts him by saying, “Look 
what’s out there!” (Exception: a simple, factual 
correction, when not said in the context of 
heated argument is scored in Category 1 rather 
than here.) 

Example: 
O: “Let’s play with boats!” P: “No, let’s 
build a castle.” 

11. Noncompliance; self-defense. Resistance to O'S 
domination, per se—that is, not in conjunction 
with attempts to counterdominate. Persistence 
in the direction of own activity in the face of 
O’s attempt to change it; defense of self, prop- 
erty, or rights. (Exception: when a refusal is 
followed by a counterattack or countersugges” 
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tion the whole episode is scored under Category 
10.) 

Examples: 
“No!” “Don’t!? “I don’t want to...” 
“I won’t”—in response to O’s domination or 
in self-defense. 


RESULTS AND Discussion 


Table 1 presents a summary of the subjects’ 
scores in the experimental sessions. On the 
average, girls tallied more unit acts than did 
boys. Since this difference did not approach 
statistical significance (Median test: p > .2 
—Edwards, 1954), the data for both sexes were 
combined in some procedures used for their 
analysis. 

When performance in the three sessions was 
examined per individual, it was found that 
mean intersession differences in dominance 
tanged from 0 to 18 unit acts. While most 
children varied by less than seven dominance 
units when their records in three play periods 
were compared, nine subjects averaged session- 
to-session fluctuations of 12 to 18 unit acts. 
(Mean dominance per session, for boys and 
girls combined, was 19.7 units.) In terms of 
absolute frequency, then, some individuals 
varied considerably in dominance from period 
to period, whereas others demonstrated re- 
markable intersession stability. A comparable 


range of constancy was found with respect to 
submission and resistance. 

In addition to the inspection of raw scores, 
the two following methods were used to deter- 
mine the degree and manner of the children’s 
consistency in power related behavior. First, 
the 30 subjects were compared with one 
another by ranking them according to the 
number of dominant acts recorded for each, in 
Sessions I, II, and II, respectively. In this 
way, three sets of ranks, each with an W of 30, 
were obtained. By using Kendall’s coefficient 
of concordance (Siegel, 1956), the degree of 
association among these three ranks was as- 
sessed. The same procedure was applied to the 
records of submission and resistance. The 
results are presented in Table 2. They show 
that for the sample taken as a whole, significant 
Coefficients of concordance were found with 
regard to dominance in all three sessions as well 
as in submission for all three sessions, How- 
ever, resistance did not prove to be as markedly 
constant when ranks for the three sessions 
were compared. 

A further measure of consistency is the ex- 
tent to which an individual maintains the posi- 
tion of greater (or lesser) dominance, submis- 
sion, or resistance relative to his partner. In 


TABLE 1 


AVERAGES AND VARIABILITIES OF SCORES 
(N = 14 Boys + 16 Girls) 


Session I Session II Session III All Sessions 

Màn M SD Mia M SD Min m è sD MEAD Per Range Biter 
Dominance 

me WS Bd ME BS RY BE BE Ma Bt Bt tS 8 
Submission 

Ge WS heeg addaa i a U aoe ae we a 
Resistance 

Gis, EEE ee E E E 


* For each subject, the absolute differences in scores between Sessions I and II, Sessions II and II, and nee ree nad ha 
“immed and divided by 3 in order to obtain his Mean Difference score. The numbers in this column represent means 2 
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TABLE 2 
COEFFICIENTS OF CONCORDANCE FOR THE THREE 
Pray SESSIONS 
WN = 30) 
w # >» 
Dominance 
Category 1 
Dominate Play 65 <.01 48 
Categories 2 and 3 
Mandate (positive and 75 <.001 63 
negative, 
Category 4 
Suggest Orientation .59 <.05 38 
Combined Score .18  <.001 67 
Submission 
Categories 7 and 8 
Comply and Agree .67 <.001 51 
verbally 
Category 9 
Ask for permission, di- .73 <.001 56 
rections, orientation 
Combined Score .13 <.001 56 
Resistance 
Category 10 
Countermandate 5 < .02 .33 
Category 11 
Noncompliance, self- 43 >.05ns .14 
defense 
Combined Score AD  <.10ns = .23 


Note.—Ws for Categories 5 and 6 were not computed separately 
because of the generally low frequency, per subject, of behavior 
assigned to these classifications. The Combined Score is the sum of 
entries in all categories per behavior system. 

* One-tailed test. 

b Average value of the rank correlation coefficients (Edwards, 
1954), 


order to determine this, the records of the 
subjects comprising each dyad were compared 
with one another. Figure 1 indicates which 
subject had the higher dominance, submission, 
and resistance score, respectively, in each of 
the sessions. 

It is apparent that there was a marked ten- 
dency for the same individual to tally the 
higher dominance score during all three ses- 
sions. This also held true with respect to sub- 
missive acts. For two-thirds of the pairs, there 
was no shift in dominance balance when per- 
formance in three sessions was compared. 
Thus, the same individual maintained the 
higher dominance score throughout in a sig- 


nificant proportion of cases (p < .001). In 8 
out of 14 pairs, the submission balance re- 
mained the same for all three play periods 


proved to have no greater than chance stability 
when it was viewed in terms of preponderan 
of tallies within each dyad, during the three 
play sessions. The apparent difference in 
stability between resistance on the one hand, 
and dominance and submission on the other, 
may have resulted from any of the following 
factors. 
Resistant behavior may have been recorded: 
with less accuracy than dominance or submis 
sion. It will be remembered that this dimension 
initially was not viewed or treated as a separate 
behavior system. Therefore, the observation 
categories comprising the system, resistance, 
may have been comparatively less exhaustive 
than those classified under dominance and 
submission. Thus, instances of expressing 
resistance in ways not included in the present 
category system may have remained unre" 
corded. Such omissions may account for the 
apparent inconsistency of resistant behavior 
Perhaps more than three 20-minute play 
sessions are required for the establishment of & 
stable resistance level per individual. Thus 
far, it has been assumed that the experience 
of interacting with the same partner in the 
same room for a series of play periods repre 
sented a repetition of rather equivalent situa 
tions for most subjects. This is not necessarily 
the case. For example, individual variability 
in resistance, when performance in three ses 
sions was compared, might have been a func 
tion of systematic differences in conditions 
—perhaps due to phase effects—among thé 
three periods. In order to examine they 
probability that the behavior samples com 
prising the three sessions were drawn from ani 
identical population of sessions, the Friedman 
two-way analysis of variance by ranks (Siege 
1956) was computed for each of the powe 
dimensions studied. No significant intersessio® 
differences were found for dominance or sub= 
mission. However, it was dicovered that the 
series of play sessions probably did not repre= 
sent equivalent situations with respect to 
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Explanation: The letters, Aand B refer to the two children in any dyad, The letter in 
| each box indicates which subject tallied the higher score per session, for each 


dimension. “=" refers to tied scores. 


Fic, 1, Maintenance of higher dominance, submission, and resistance score for members of each dyad. 


expression of resistant behavior (p < .02). 
For most subjects, resistance was highest in 
the second play period, perhaps reflecting an 
ongoing power struggle. Such an interpretation 
is suggested by the findings of Heinecke and 
Bales (1953) who reported that, among groups 
of adults, a status struggle occurred in the 
second session of any series of group meetings. 
In exploring the vicissitudes of power in small 
groups, it is desirable to use subjects who were 
unacquainted with one another previous to 
being observed in interaction—as was done by 
Bell and French (1950). Since the present 
subjects attended the same nursery school, 
they had opportunities to establish power 
patterns with respect to one another prior to 
the experiment. This factor may have inter- 
fered with the development of more pro- 
nounced indications of a status struggle during 
Session IT. A more extended series of sessions 
would be required to explore the possibility 
that periodic power struggles may take place 
in the course of repeated association in any 
Stable, small collection of individuals. Such an 


investigation would have obvious relevance to 
the study of interaction patterns in the nuclear 
family and in other constant primary groups. 

Some dimensions of social interaction prob- 
ably are highly innerdetermined, whereas 
others are primarily situation-determined. If 
social behavior were arranged along a con- 
tinuum in such terms, dominance and sub- 
mission might well fall closer to the “inner- 
determined” end than would resistance. Such 
a view of resistant behavior is consistent with 
the findings reported in Caille’s (1933) study 
of resistant behavior in preschool children. 

It is tempting to speculate about the possible 
effect of developmental level upon the stability 
of interaction patterns. The assumption is 
often made that “the personality achieves a 
type of inner organization which becomes more 
and more powerful with the passage of time, 
and which frees the personality to a greater 
degree from the dominance of the situation” 
(Anderson, 1948). In other words, as he grows 
older, the individual is thought to become in- 
creasingly consolidated, or fixed, with respect 
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to his characteristic patterns of behavior. Even 
if this were so, all aspects of social behavior 
probably would not mature at the same rate. 
Hence, it may be that for most individuals, 
characteristic dispositions with respect to 
dominance and submission are well established 
by age 5, while their inclination to exhibit 
resistance has not yet achieved a commen- 
surate degree of consolidation. 

Arrington (1943) has suggested that con- 
sistency may be characteristic of some indi- 
viduals, and inconsistency of others. The data 
were examined in search of such pervasive dif- 
ferences in intrapersonal stability. This was 
done by comparing each subject’s average dif- 
ference scores (see Table 1 for derivation of 
these scores) for dominance, submission, and 
resistance. No coincidence was found in degree 
of individual variability with respect to these 
three dimensions. Hence, the data do not sup- 
port Arrington’s proposition. However, the 
evidence with regard to this point must be 
viewed with considerable reservation, since 
the present study provided a very limited 
measure of individual patterns of consistency. 
A better picture could have been obtained if 
the subjects had been observed for a longer 
series of sessions. 

The method of recording behavior used in 
this investigation was confined to measuring 
the frequency of power related acts. Intensity, 
duration, and latency of relevant behaviors 
were not assessed. Neither were group differ- 
ences in interaction rate taken into account. 
The difficulties of devising and putting into 
operation a procedure that includes these 
aspects of social behavior led to the adoption of 
the present scheme of assessment. Despite 
the obvious limitations of frequency counts 
in comparing samples of behavior, the data 
suggest that such counts may have some merit 
as an index of individual consistency with 
respect to some of the dimensions involved in 
the present study. 


SUMMARY 


This study was designed to explore day-by- 
day fluctuations in the power related behavior 
of young children. Sixteen pairs of children 
were observed in a constant setting during 
three, 20-minute play periods. Frequency of 
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dominant, submissive, and resistant acts was 
recorded for each individual. 

For many subjects, a fairly high degree of 
intersession variability was found when per- 
formance during the three play periods was 
compared. However, for the group as a whole, 
interindividual differences exceeded individual 
inconsistency with respect to dominance and 
submission. The majority maintained fairly 
stable rank positions within the group in terms 
of these dimensions during the three sessions, 
In a significant proportion of dyads, the same 
child had the higher dominance and submission 
scores, respectively, during all three play 
periods, when the two playmates’ records were 
compared. Such stable individual behavior 


patterns were not found with respect to the y 


expression of resistance. 
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another. But their ratings have a certain 

defect. They tend to correlate unduly. 
Rated voice quality is likely to correlate very 
highly with rated intelligence, even if they are 
in fact quite independent traits. Thorndike 
(1920) christened this tendency “halo.” Since 
then it has been one of psychology’s bread- 
and-butter phenomena. It is a strong, impres- 
sive phenomenon, little known to the public, 
yet easy to describe, and controllable in some 
degree with devices like forced-choice ratings. 
But Thorndike’s act seems to have taken the 
surprise out of it, so that it has remained for 
40 years mainly a bother for the practical 
psychologist, rather than a poser for scientific 
psychology. So little regard has been given to 
explaining it, in fact, that the best explanatory 
discussion Guilford (1954) finds to cite is that 
published by Symonds (1925). And this is not 
because Symonds’ reasons for halo are defini- 
tive—indeed, they seem to suffer mutual 
inconsistency, 

The classic view of halo is that it shows a 
reliance on general impressions arising from 
stupidity or ignorance or carelessness. In 
support of this view is Symonds’ assertion 
that halo is especially strong for ratings on 
traits that are not commonly observed or 
thought about. People are especially stupid and 
careless in handling these unfamiliar, unim- 
portant traits. 

But the argument that halo is merely a 
puerile failing runs into complications with 
other assertions by Symonds. He says that 
halo is also especially Strong for traits of high 
moral importance and for traits involving reac- 
tions with other people. Why should people be 
especially stupid and careless in dealing with 
such traits? Still more difficulty for the argu- 
ment is provided by Sherif, White, and Harvey 
(1955), who found that even on a task of 
Supreme importance to them, handball throw- 
ing, boys based their predictions and judg- 
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ments of another’s performance mainly on his 
status in the group. It has in fact been s 
gested by some that halo is stronger for trai 
that are important to people than for thi 
that are unimportant (Asch, 1952, p. 230; 
Riecken & Homans, 1954). 

At first glance, it might seem that these 
complications make halo a more subtle and 
important phenomenon than the general- 
impression hypothesis would have it, but an 
augmented general-impression hypothesis 
triumph over them. It is argued that in th p 
cases the affective reaction toward a person 
governs judgments of him, preventing differ- 
entiated judgments. It might seem surprising 
that emotion should consistently override 
thought. It might even be supposed that the 
affective reaction itself could be differentiated, 
permitting differentiated judgments. But no 
serious doubts have been expressed about this 
view of halo, and it remains the commonly ac- 
cepted one, with little elaboration and less 
competition. 

There is, however, another kind of difficulty 
which is harder to incorporate under the gen- 
eral-impression hypothesis: people have a 
Positive predilection for the real-world coun- 
terpart of halo, for a high correlation between 
the variables on which people are ranked. A 
number of studies, mostly by sociologists, 
demonstrate this fact. Lenski (1954) has shown 
that if a person has discrepant scores on the 
status variables of society at large (income, 
occupation, education, ethnic background), 
he tends to be dissatisfied. Adams (1953) found 
that the morale of bomber crews was positively 
correlated with the average intercorrelation of 
their status variables, rank, flying time, educa- 
tion, reputed ability, popularity, length of 
service, combat time, and position importance. 
Homans (1953) showed that discrepancies 
between pay and prestige of office jobs pro- 
duced dissatisfaction. Exline and Ziller (1959) _ 
found that congeniality in small groups de- 
pended on the correlation of the members’ 
ability and voting power, which were controlled 
experimentally. And Whyte (1943), in his 
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famous study of a street-corner gang, the 
Nortons, describes how the members of this 
gang actively made bowling scores correlate 
with status in the group, by appropriate 
encouragement and heckling. Indeed, Benoit- 
Smullyan (1944) argues that what produces 
discontent and rebelliousness in people, 
whether in a small group or great society, is 
not so much oppression or low status itself, 
as it is the possession of disparate statuses. 
The rebel isn’t the man who is low on all 
ladders; he is the man who is low on some and 
high on others. 

It is hard for the general-impression hy- 
pothesis to cope with these cousins of the halo 
effect. The general-impression hypothesis can 
directly explain only why people misperceive, 
not why they might be dissatisfied with what 
they perceive. It is possible that in some cases 
misperception plays a mediating role, leading 
to inappropriate behavior which results in 
punishment and dissatisfaction with the social 
structure. But in many cases the discrepant 
orderings are clear enough and misperception 
unlikely. 

It seems indeed that people have a predilec- 
tion for single orderings, an antipathy toward 
multiple orderings. This is not to say that the 
predilection itself is not rooted in the relative 
ease with which people can handle single 
orderings as compared with multiple ones. In 
recent years psychologists have realized that 
people find many ways and go to surprising 
lengths to avoid cognitive strain (Festinger, 
1957; Heider, 1958). But there is a big differ- 
ence between the trivial statement that multi- 
ple orderings are naturally harder than single 
orderings and the statement that they are so 
much harder that people are discontent with 
them and actively seek to transform them to 
single orderings. 

Since social scientists are people first of all, 
it might be predicted that they bear the same 
predilection. Do they? It appears that they do 
and that the predilection is a prime source of 
difficulty for social theory. Benoit-Smullyan 
(1944), Lenski (1954), and Rose (1958) all 
have chided their fellow social scientists for 
their tendency to treat society as having a 
single hierarchy or ordering rather than dis- 
tinguishing the ordering variables of wealth, 
power, education, prestige, and so on. Lenski 
says that from Aristotle on, most social philoso- 


phers and social scientists have made this 
particular oversimplification. Despite recogni- 
tion early in this century of the desirability of 
considering these ordering variables separately, 
there is a stubborn, recurring urge to reduce 
them to a single ordering. The modern, sophisti- 
cated way to do it is to average each subject’s 
ranks on the several orderings, an involved 
procedure which loses much and gains nothing, 
according to Rose and Lenski. Another way, 
one discussed by Benoit-Smullyan, is to insist 
that only one of the ordering variables is im- 
portant, the others being only direct or indirect 
expressions or consequences of it. Thus a 
Marxist might say that what counts in a 
Western society is wealth, and power and 
prestige are merely derivatives of wealth. 
Pareto might say instead that wealth is only 
the material expression of social power. 

In treating societies and cultures and social 
systems themselves, social scientists have also 
suffered from the predilection for single order- 
ings. The early sociologists and cultural anthro- 
pologists taught, for example, that there is a 
single ordering of cultures from primitive to 
advanced—the so-called doctrine of unilinear 
evolution (Steward, 1953). This teaching is 
rejected by most anthropologists today. What 
is wrong with it? Not that there are no ways 
in which cultures are ordered from primitive to 
modern, but that there are many. A given 
culture might occupy a high rank in the order- 
ing by metallurgy but a low rank in the order- 
ing by mathematics, The early anthropologists 
blinded themselves to this multiplicity of 
discrepant orderings in their predilection for a 
single ordering. Even today some anthro- 
pologists cling to the conception of cultures as 
having a single ordering by reducing the 
number of steps in the ordering to a paltry 
three or four—“savagery,” “barbarianism,” 
“civilization.” This degradation of the doctrine 
keeps it alive for the sake of the predilection, 
but in a form that is as useless as it is un- 
assailable (Steward, 1953). 

Still another topic on which thinking may be 
blurred by the predilection for single orderings 
is leadership. Scientists and laymen both call 
a number of different attributes and functions 
leadership—popularity, power, skill, promi- 
nence, for example. All of these are variables 
which to some extent order the members of a 
group, and in principle it is quite possible for 
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these orderings to be discrepant, with different 
members at the top. But of course people will 
expect all the orderings to be the same, will 
expect the most popular member to be the 
most powerful and most skillful and most 
prominent, to be, in a word, the leader. It 
seems so fitting to say, “First in war, first in 
peace, and first in the hearts of his country- 
men.” This expectation lands the social 
scientist in a curious impasse. To begin with, 
he has the predilection, like anyone else, and 
his discussion of leadership is likely to show it. 
But at the same time as a wiser colleague 
criticizes him for confusing popularity with 
eminence and skill with influence, there may 
be justification for him in the fact that his 
subjects, having the predilection too, tend 
actually to bring about the single ordering, 
investing a single leader with all these things. 
In fact, if the leader renounces some of his 
leadership functions and tries to distribute 
them among his followers, as many enlightened 
social scientists would have him do, what is 
his reward? He has violated his followers’ 
expectation of a single ordering and they are 
likely to be displeased rather than grateful 
(Berkowitz, 1953). 

Even in colloquial language one can detect 
people’s tendency to act as though there is 
only one ordering of a set of people. They seem 
to delight in gross expressions like “Joe’s tops,” 
or “Joe’s the greatest,” avoiding any hint that 
there may be orderings on which Joe does not 
stand uppermost. 

Probably in any case where people are made 
to think of an ordering, even an ordering of 
elements other than people, they will resist 
thinking of any disparate ordering. If so, a 
kind of halo effect may be a more pervasive 
problem for judgments and ratings than is 
ordinarily realized. The study of social de- 
sirability of traits and behaviors seems to have 
run afoul of this problem. At first, it would 
Seem a straightforward matter to have the 
subjects judge items drawn from personality 
inventories as to their social desirability 
(Edwards, 1957). But it does not turn out to 
be straightforward (De Soto, Kuethe, & 
Bosley, 1959; Johnson, 1955). These items were 
originally selected so as to vary widely on 
desirability for personal well-being or health, 
and this variation seems to provide a salient 
ordering of the items for the subjects. As a 


consequence, they have a troublesome tend- 
ency to judge their desirability for personal 
well-being when they are supposed to judg 
their social desirability. 

Even in defining the key mathema 
property of an ordering relation called tra 


transitivity of a relation is usually stated 
something like this: If A bears the relation to 
B, and B bears the relation to C, then A must 
bear the relation to C. It is sometimes con 
venient, however, to state it thus (De Soto & 
Kuethe, 1959): If A bears the relation to C, 
and C bears the relation to B, then A mi t 
bear the relation to B. These two formulations 
are equivalent, mathematically speaking. B 
people find the former much easier to appre- 
hend than the latter, which seems to confuse 
if not mislead them. It asks them to think 

the ordering A — C — B, a task that would be 
easy but for the obtrusion of the overlearned 
disparate ordering A —> B — C. In the other 
formulation, of course, A —> B —> C is the only 
ordering that presents itself, and understandi 
is quick. 


EXPLANATION OF THE PREDILECTION: 
EXPERIMENT 


Evidently the predilection for single order- 
ings materializes in many forms. Knowing this. 
should eliminate the surprise and bewilderment 
of people like Rose (1958) who encounter it 
unawares. But what more can be said about it? 
Can one go beyond it to its source or explana- © 
tion? Certainly the data cited so far help little 
in the quest; they simply show its generality. 
But how can one do more? Can one even pose 
some possible sources or explanations? a 

It has been suggested that multiple orderings 
are odious because they are difficult. Is there 
any way to measure this difficulty? Is there 
any way to demonstrate that multiple order- 
ings are more or less difficult than somethi 
else? 

The difficulty of single social structures, _ 
including single orderings, has been success- | 
fully measured by the time people take to — 
learn the structure under controlled conditions _ 
(De Soto, 1960). It should be Possible to apply — 
this measure to multiple orderings. It should 
take a long time to learn two orderings of a — 
set of people. But a long time in comparison i 


i 
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with what? Certainly it should take longer to 
learn two orderings of a set of people than one 
ordering of them, if only because of the differ- 
ence in size of the tasks. It would be more 
enlightening to compare the learning of the 
two orderings with other tasks of comparable 
size. Comparison of this sort might decide 
whether or not there is something peculiarly 
difficult about multiple orderings, and if there 
is, what it is. 

The following experiment provides such 
comparisons. In this experiment all subjects 
were given tasks of about the same magni- 
tude: all were required to learn two social 
structures or organizations at once. In one 
condition subjects were required to learn two 
discrepant orderings of a set of names: an 
ordering by height and an ordering by age, 
the two orderings being uncorrelated. Sub- 
jects in a second condition had to learn 
two discrepant groupings of a set of names: a 
gtouping by majors and a grouping by houses— 
a division of the names into groups according 
to majors and a disparate division of them into 
groups according to houses. Subjects in a third 
condition were required to learn one ordering 
and one (uncorrelated) grouping of a set of 
names, for example, an ordering by height and 
a grouping by majors. In a fourth condition 
subjects were required to learn an ordering 
by age of one set of names and an ordering by 
height of another set of names—two orderings, 
as in the first condition, but of distinct sets 
Tather than the same set. In a fifth condition 
Subjects learned a grouping by majors of one 
Set of names and a grouping by houses of 
another set. In a final condition subjects 
learned an ordering of one set of names and a 
grouping of another set. In the conditions with 
one set of names, only men’s names were used; 
In the conditions with two sets of names, one 
Set was composed of men’s names and one of 
Women’s names, to make them distinct. In all 
Cases a grouping of a set was a division of them 
Into two groups. 

he experimental conditions, as well as one 
Set of experimental results, are summarized in 
Table 1. The conditions were chosen with a 
Hh to testing several possible explanations of 

3 Predilection for single orderings. 
sha tf every condition in the left-hand column 
ould turn out to be harder than its counter- 
Part in the right-hand column, one would 


have evidence that it is peculiarly difficult to 
handle two organizations of the same set, and 
the difficulty of two orderings of the same set 
would emerge as a special case of this general 
statement. 

2. If the upper conditions should turn out 
to be harder than the bottom condition of 
each column, indicating that people had 
difficulty learning two orderings at once or two 
groupings at once, whether they were dealing 
with one or two sets of names, one would be 
led to another general statement: it is difficult 
to handle two organizations of the same kind 
at the same time. Again, the difficulty of two 
orderings of the same set would emerge as a 

jal case. 

3. It would be quite conceivable, of course, 
for both of the preceding possibilities to be 
true, in which case the upper conditions of the 
left-hand column should be hardest of all and 
the lowest condition of the right-hand column 
easiest, with the other conditions intermediate 
in difficulty. In this case, the halo effect and 
its cousins would be the resultant of the two 
factors acting in concert. People have difficulty 
in handling two organizations of the same set 
of objects; people have difficulty in handling 
two organizations of the same kind; the 
difficulty of two orderings of the same set 
consists of these difficulties combined. 

4. On the other hand, if the two upper con- 
ditions of the left-hand column should emerge 
as the hardest of all but the bottom condition 
of the right-hand column should not turn out 
to be the easiest of all, the preceding general 
statements would be ruled out. Organizations 
of one set would be harder to learn than or- 
ganizations of two sets only if they were 
organizations of the same kind. Organizations 
of the same kind would be harder to learn than 
organizations of different kinds only if they 
were organizations of the same set. The pre- 
sumptive general statement would be: it is 
hard to handle two organizations of the same 

e same kind. 
ety the top condition of the left-hand 
column alone turned out to be abnormally 
difficult, any general statement would be 
barred. The conclusion Poa ae cone 

ing uniquely difficult about two order- 
ser ae a difficulty not paralleled 
elsewhere and therefore not derivable from a 
general principle as a special case. 
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6. Finally, if the conditions emerged as 
equally difficult, one would be led to conclude 
that there is after all nothing peculiarly diffi- 
cult about discrepant orderings of a set, that 
other arrangements of equal proportions or 
complexity are equally difficult. 


Method 


To check the generality of whatever pattern of 
difficulties emerged, the experiment was performed 
with two different arrangements of the subject’s task. 
In the first version of the experiment, results for which 
are reported in Table 1, the subjects were taught the 
social structures in an expedient way: the experimenter 
recited the structures to subject on each trial, then 
questioned him about the relation between pairs of 
people. A typical question would be, “Is Bob older 
than Dave?” After obtaining the subject’s answer, the 
experimenter gave him the correct answer. In the second 
version (Table 2) the experimenter was niggardly with 
information about the structures: he questioned the 
subject about the relation between pairs of people and 
(after obtaining the subject’s answer) gave the correct 
answer to each question, but gave no other information. 
This technique is the one used earlier for the study of 
learning a single social structure (De Soto, 1960). 

In the first version of the experiment the social struc- 
tures had six members each; in the second version, only 
three, because learning of larger structures would have 
been too slow. 

In the first version, a trial consisted of the experi- 
menter’s recitation of the social structure followed by 
6 questions, 3 about each structure. In the second ver- 
sion, a trial consisted of 12 questions, 6 about each 
structure. For both versions, the criterion of mastery 
was two successive trials on which the subject answered 
all questions correctly. 

The subjects for the experiment were male under- 
graduates of Johns Hopkins University. Each subject 
was run individually and served in a single experimental 


TABLE 1 
MEAN TRIALS To CRITERION 
(First Version of Experiment) 


condition. Five subjects were used in each condition for 
the first version; 16 in each condition for the second 
version, which produced greater individual differences, 


Results 


Examination of Table 1 shows that the sub- 
jects chose Outcome 4, Table 2 is a little more 
complicated, but seems to show the same thing, 
The interaction pattern is the same, as can be 
seen more easily if the middle row of figures is 
increased by an additive or multiplicative con- 
stant. There may be justification for doing this 
in the possibility that a grouping is easier than 
an ordering for three-member sets although not 
for six-member sets. Certainly the results in 
Table 2 are in far better correspondence with 
Outcome 4 than with any of the other proposed 
outcomes. An analysis of variance performed 
on the Table 1 data yielded an interaction sig- 
nificant at the .05 level; another performed on 
the Table 2 data yielded an interaction sig- 
nificant at the .10 level; and one performed on 
the pooled data yielded an interaction sig- 
nificant at the .01 level. 

The experiment then yields as a strong candi- 
date for the explanation of halo the statement: 
it is hard to handle two organizations of the 
same set of the same kind. But, as always, it 
Seems there must be an explanation lurking 
behind the explanation, an explanation of the 
explanation. Why is it hard to handle two or- 
ganizations of the same set of the same kind? 


OPERATION OF SCHEMATA 


So far, it has been taken for granted that 
people are facile in treating a set of objects as 
having a single ordering or a single grouping. 
Not that there seems reason to doubt the 


One Set Two Sets 
Two Orderings 16.2 6.6 
Two Groupings 10.0 6.6 
Ordering and Grouping 7.2 7.0 
TABLE 2 
MEAN TRIALS To CRITERION 
(Second Version of Experiment) 
One Set Two Sets 
Two Orderings 15.7 11.5 
Two Groupings 9.7 6.6 
Ordering and Grouping 8.4 10.5 
es 
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facility—it is manifest in many ways. De Soto 
and Kuethe (1958), for example, found that 
children as young as 9 can usually answer 
questions about the mathematical properties of 
ordering and grouping relations. They know 
that if A is older than B, it follows that B is 
not older than A (asymmetry). They know 
that if A is older than B, and B is older than Cc, 
it follows that A is older than C (transitivity). 
They know that if A is in the same grade as B, 
it follows that B is in the same grade as A 
(symmetry). They know that if A is in the 
same grade as B, and B is in the same grade 
as C, it follows that A is in the same grade 
as C (transitivity). Another kind of evidence 
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for the facility is provided by Scott’s (1956) 
study of the errors made by enlisted submarine 
personnel in naming people in their organiza- 
tional chart. Scott found that omissions (fail- 
ing to recall names) were the most common 
kind of errors. Echelon errors (placing people 
too high or too low in the chain of command) 
and unit errors (placing them in the wrong 
group) were relatively rare. Recalling names 
gave the subjects the most difficulty, not 
placing them in the ordering or grouping once 
recalled. 

Clearly, there is no mistake in saying people 
are facile in handling single orderings or 
single groupings. But there may be a mistake 
in leaving it at that. For people are not so good 
at everything. Valuable perspective on the 
ease with which people answer the questions 
about mathematical properties of ordering 
and grouping relations can be gained by con- 
trasting it with the difficulty they have with 
other formally similar questions. They have 
extraordinary difficulty with simple questions 
about logic, for example. If you tell someone 
that A implies B, and B is true, he is likely 
to infer mistakenly that A must be true. Even 
a slight variation of the relation “is older 
than,” changing it from a simple ordering to a 
weak ordering, produces dramatic difficulties. 
If you tell people that A is at least as old as B, 
and ask if B is at least as old as A, many will 
say “yes,” others will say “no,” and only a 
minority will correctly say he may or may not 
be (De Soto & Kuethe, 1958). 

Another contrast with people’s skill with 
orderings is their ineptitude with dominance 
structures (De Soto, 1960). Dominance struc- 
tures are poor relatives of orderings, having two 
of the three properties of an ordering—asym- 
metry and completeness, but not transitivity. 
They are called dominance structures because 
social dominance relations among animals 
(especially peck rights among birds) produce 
such structures. They are primitive and simple 
social structures, the inevitable outcome of 
the pair-wise encounters among animals by 
which they are formed. Yet most people, 
scientists included, consider them puzzling and 
complex as compared with orderings. Learning 
such a structure takes much longer than 
learning an ordering; people fail to profit from 
the properties it does have—in fact, they may 
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even be hindered by them, especially if they 
expect an ordering. 

In short, people are very good at handling 
orderings and groupings but very bad at some 
other similar tasks. It can be inferred that 
their facility with orderings and groupings 
does not reveal any general power of thought, 
but rather quite specific thinking mechanisms 
or processes which might aptly be called 
schemata. Thus one can say that people pretty: 
universally have well-functioning ordering 
schemataand grouping schemata. This proposal 
may smack of the instinct-naming or faculty- 
naming approaches which are now anathemas 
in psychology, but it need not mark the end of 
investigation, as naming an instinct or faculty 
so often did. 

For present purposes, it has value in suggest- 
ing a different tack in the explanation of the 
predilection for single orderings. Like most 
human traits and functions, a schema surely 
has adaptive or survival value. Skirting tele- 
ology, as biologists are prone to do, one can 
ask, what is a schema for? How does it serve 
the organism? What properties should a good 
schema have? 

Although it has rarely been made explicit 
in discussions of schemata, it appears that 
buried in the traditional meaning of schema 
are two key requirements. A schema con- 
tributes to cognition, but it only contributes. 
An ordering schema, for example, should 
enable a person to treat objects as ordered, 
but should not thereby blind him to other 
urgent aspects of the objects. In everyone’s 
life there must be countless situations where 
the ability to apply two schemata to a set of 
objects is well rewarded. This ability would 
develop through evolution or learning or 
both—and is evidenced in the lower left-hand 
mean of Tables 1 and 2. The ability to apply 
the same schema twice to a set of objects, on 
the other hand, might be a special case, less 
often needed, less often rewarded, hence less 
developed. The other key requirement of a 
schema is that it is multiply applicable. Treat- 
ing one set of objects as ordered must not bind 
the ordering schema so as to prevent treating 
another set as ordered. Again, the development 
of this ability might not be accompanied by 
equal ability in the special case of multiple 
applications of the same schema to the same 
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set of objects. Thus the upper means of the 
left-hand columns are greater than the upper 
means of the right-hand columns. 

This approach to the problem keeps in the 
forefront the fact that it is an achievement to 
treat objects as ordered at all, that the suc- 
cesses have to be accounted for as well as the 
failures. At the same time it exposes our 
crippling ignorance of the dynamics of schema 
arousal and application. 

In the enormous literature on serial learning, 
for example, one finds almost no discussion of 


the instructions with which the subject is ` 


initially induced to treat the items as ordered. 
Why this nonconcern with the instructions, 
almost a denial that they have been used? The 
explanation probably lies in the faith that all 
that goes on in such learning is conditioning, a 
primitive process that shouldn’t require in- 
structions. But the fact is that the instructions 
are critical. If the subject is told to remember 
the items in order he does so much better than 
if he is simply told to remember the items 
(Horowitz, 1960; Kay & Poulton, 1951; 
Postman, 1956). Appropriate instructions evi- 
dently arouse a helpful ordering schema in the 
subject—but this arousal is accomplished so 
easily and quickly that it is easy to overlook 
that it has occurred at all. 

Likewise in the literature on halo there are 
only suggestive glimpses of the ordering 
schema being engaged and disengaged. It has 
been noticed that the halo effect is less when 
judges rate one trait at a time than when they 
Tate one person at a time (Guilford, 1954). 
Why might this be? A plausible explanation is 
that two applications of the ordering schema 
to a set of objects conflict only if they occur 
simultaneously, that is, if the subject is think- 
ing of both orderings at the same time. Whena 
person judges on one trait at a time, he has to 
think of only one ordering of the ratees at a 
time, whereas when he judges one person at a 
time, he has to think all at once of the many 
orderings on which the ratee occupies positions. 
This explanation would also serve for another 
Phenomenon of ratings—the tendency for 
nearby traits on a rating form to intercorrelate 
more highly than remote ones, This tendency, 
only recently discovered, has been named the 
proximity error (Guilford, 1954), but it might 
be regarded as showing simply the gradual 
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dying away or disengaging of earlier applica- 
tions of the ordering schema. The proposition 
that the halo effect depends on the subject’s 
thinking of the orderings involved simul- 
taneously, disappearing to the extent he is 
able to think of them one at a time, may even 
help explain the rather puzzling observation 
noted earlier that halo may be stronger for 
orderings that are important to people than 
for the unimportant ones. The argument is 
this: the more important orderings are to a 
person, the more time he will spend thinking 
about them, and as a statistical consequence, 
the more likely he is to think about them 
simultaneously, 

But it is easy for discussion of schemata and 
their operation to get far out of empirical 
bounds. There seems to be good evidence that 
people have a far-ranging predilection for 
single orderings. There seems to be evidence 
that the predilection and other phenomena 
can be elucidated by crediting people with an 
ordering schema that is subject to certain 
principles of operation, Rather than more 
speculation, what is needed now are experi- 
mental studies frankly concerned with these 
things. 


SUMMARY 


It was proposed, as an inductive generaliz- 
tion, that people have a predilection for a single 
ordering or ranking of a set of people or other 
objects, an aversion to discrepant orderings of 
the set. In ratings, this predilection shows up 
as the halo effect—the tendency to reduce 
discrepant orderings on different traits to 4 
single ordering by merit. In social structures, 
including small groups and large societies, it 
shows up in general dissatisfaction with dis- 
crepant orderings of the members and in 
efforts to eliminate the discrepancies. In the 
theorizings of social scientists about society 
and culture it shows up as a stubborn urge 
somehow to reduce discrepant orderings of 
people, or classes, or cultures, to single order- 
ings. And in other judgmental and intellectual 
tasks people show difficulty in handling dis- 
ctepant orderings which contrasts strongly 
with their facility in handling single orderings. 

Once the generalization had been established 
by appeal to such phenomena, an effort was 
made to give it context and explanation 
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through experimentation. The difficulty of 
learning two discrepant orderings of a set of 
people was compared with that for five other 
tasks in which subjects also learned two social 
structures at once. It was found that two 
orderings of a set of people were relatively 
difficult to learn, providing one more instance 
of the generalization. From the pattern of 
difficulties over the six conditions, however, it 
was inferred that the aversion to a set having 
discrepant orderings is a special case of an 
aversion to a set having more than one or- 
ganization of a given kind. Certain other pos- 
sible explanations appeared to be ruled out 
by the data. 

Finally, a discussion was given of how 
schemata might operate to produce the above 
and related phenomena. 
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OTIVATIONAL factors affecting the 
performance of schizophrenics, es- 
pecially social rewards and punish- 

ments, have become the focus of considerable 
research interest. This interest partially stems 
from the speculation of such writers as 
Cameron (1947) and Fromm-Reichmann 
(1950), who have attempted to account for 
schizophrenic withdrawal (and the concomi- 
tant psychological deficit) as a defensive 
response to criticism, failure, and threat. 
These speculations have received some empiri- 
cal support from the studies of Rodnick and 
Garmezy (1957) and their students, who have 
reported that schizophrenics show behavioral 
deficits following censure in experimental tasks 
involving stimulus generalization (and dis- 
crimination) and concept formation. Olson 
(1958), however, using a simple digit symbol 
task, reported that schizophrenics responded 
with enhanced performance following censure 
(failure instructions), although they appeared 
to show more enhancement following praise 
(success instructions) than they did following 
censure. There was no evidence in Olson’s 
results to suggest that censure had any detri- 
mental effect upon schizophrenic performance. 

Rosenbaum, Mackavey, and Grisell (1957) 
studied the effect of “ordinary social motiva- 
tion” and “biological motivation” upon the 
simple reaction time of schizophrenics. The 
biological motivation condition involved the 
presentation of a moderate faradic current 
delivered directly to the reacting finger. This 
stimulation began with the auditory reaction 
signal and terminated with the performance of 
the required response. The authors reported 
that the schizophrenic deficit in reaction time 
(as compared to normal controls) under the 
social motivation condition was substantially 
eliminated under the biological motivation 
condition. 


1 Now at the University of Florida. 
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It should be noted that the Rosenbaum, 
Mackavey, and Grisell study used only a single 
type of socially motivating instruction, one 
apparently involving a mild urging to do well, 
The work of Rodnick and Garmezy (1957), 
Olson (1958), and others strongly suggests 
that any adequate test of such motivational 
instructions on performance must include at 
least three types or levels of instructions; 
positive, negative, and neutral or nonevalua- 
tive. This consideration seems especially im- 
portant as the instruction used by Rosenbaum 
and his collaborators appears most similar to 
the nonevaluative condition typically found 
to be most ineffective in modifying the level 
of performance. 

The purpose of the present study is to in- 
vestigate further the effects of social motiva- 
tion upon schizophrenic reaction time, par- 
ticularly the differential effects of success, 
failure, and nonevaluative instructions. Follow- 
ing Olson (1958), it was hypothesized that 
both the success and failure instructions 
should enhance the reaction time performance 
of schizophrenics, with success instructions 
producing more enhancement than failure 
instructions. Again following Olson, it was 
hypothesized that the schizophrenics perform- 
ance would not show any detrimental effects 
following failure (social punishment). 


METHOD 


Subjects. The experimental group consisted of 42 
male patients bearing the uncontested diagnosis of 
schizophrenia. All were hospitalized in “open buildings” 
at the Veterans Administration Hospital, Knoxville, 
Towa. None had received lobotomies, and the over- 
whelming majority were receiving a minimal dosage of 
tranquilizing drugs. 

The control group data? previously reported by 
Shankweiler (1959), were gathered from a group of 42 
male general medical patients at the University Hospi- 
tals and Veterans Administration Hospital, Iowa City: 
Any patient who was acutely ill, whose behavior raised 
the question of psychosis, who had a history of treat- 
ment for psychiatric disorders, or whose history sug- 


* The authors are indebted to Donald P. Shankweiler 
for making these data available to them. 
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gested mental deficiency, epilepsy, severe head injury, 
or chronic alcoholism was excluded from the control 
group. 

The mean age of the experimental group was 39 
years, and the mean educational level was 10 years. The 
mean age of the control group was 38 years, and the 
mean educational level was 9 years. Neither the differ- 
ences in age nor those in mean educational level were 
statistically significant (¢ < 1 in both cases). 

AJl subjects were between the ages of 16 and 62 years, 
had normal use of both upper extremities, had sufficient 
vision to see the apparatus with ease, and were capable 
of sitting up for an hour of testing and of understanding 
and cooperating in the experiment. 

Apparatus. The reaction time apparatus, which was 
identical with that used in previous studies (Benton & 
Blackburn, 1957; Benton, Jentsch, & Wahler, 1959; 
Blackburn, 1958; Shankweiler, 1959), provided for the 
presentation of one of two light stimuli to the subject, 
each presentation being preceded by a warning buzz of 
2seconds duration. 

Procedure. All subjects were first given 30 trials of 
simple reaction time to familiarize them with the gen- 
eral nature of the task. Following the simple reaction 
time, all subjects were given two sets of 30 trials, each 
involving choice reaction to one of the two stimulus 
lights, 

The initial set of 30 choice trials always was pre- 
ceded by the same standard orienting instructions, 
which emphasized that only the key with the il- 
luminated light above it should be depressed. As a 
check on whether or not the subject understood the 
instructions, 5 practice trials, which were not counted in 
the 30 trials of the initial set, were made part of these 
orienting instructions. Any response completed in .100 
second or less, or any depression of the key on the in- 
correct side, regardless of whether the correct key was 
depressed before or after, was considered an anticipa- 
tory error and an additional trial was added at the end 
of the set. Only the times of the correct responses were 
recorded, 

The final set of choice trials was always preceded 
by one of three sets of motivating instructions: non- 
evaluative, success, or failure. The schizophrenic and 
control groups were each randomly divided into three 
subgroups based upon which set of motivating instruc- 
tions they were given. A rest period of approximately 
2 minutes was permitted each subject, after which one 
j the three following motivating instructions was 

ven: 

Nonevaluative Instructions: We’ll record another 
Set of reaction times. We always make at least two 
determinations of this choice reaction time so that 
we have more reliable results. We get more accurate 
readings that way. That’s why we have so many Tepe- 
titions, Now let’s try this choice test again. The same 
instructions apply as before. Let’s have a few warm- 
ups first. 

Success Instructions: You've done very well. Your 
Teaction time is quite a bit faster than average. It’s 
teally a very good performance. Your average time 
yas about a quarter of a second. For most people 
E about a half of a second. That’s very good. Now 
let’s try this choice test again. Sometime during the 
test PII tell you how you're doing. [After Trial 10 


and Trial 20, the experimenter assured the subject 
that his performance was still “very good.”] Let's 
have a few warmups first. 

Failure Instructions: You haven’t done very 
well. Your reaction time is slower than average. It’s 
slower than it should be. Your average time was 
about half a second, For most people it’s about a 
quarter of a second, So you see, yours is pretty slow. 
Now let’s try this choice test again. Sometime during 
the test PII tell you how you’re doing. [After Trial 
10 and Trial 20, the experimenter commented that 
the subject’s performance remained slow.] Let’s have 
a few warmups first. 

These motivating instructions were followed by 
three warmup trials and then by the 30 trials of the 
final set. 


RESULTS 


The means and SDs of the 30 initial trials 
and the 30 final choice reaction times for the 
six subgroups are presented in Table 1. Also 
presented in Table 1 are the means and SDs of 
the gain scores, which were obtained by sub- 
tracting the mean final reaction from the mean 
initial reaction time for each subject. These 
gain scores were the criteria used in the major 
analysis. 

As might be expected, the schizophrenic 
subjects were consistently slower than the 
corresponding controls. The three subgroups 
of schizophrenics and the three subgroups of 
controls both showed small and nonsignificant 
(F < 1 in both cases) differences in initial 
choice reaction time. i 

An inspection of the gain scores in Table 1 
indicates that, under the nonevaluative instruc- 
tions, both the schizophrenics and controls 
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showed no marked change in reaction time. 
Success instructions and failure instructions 
both resulted in reductions in reaction time for 
both groups, with the failure instructions 
producing the larger reduction. 

The gain scores were analyzed by a simple 
two-by-three factorial analysis of variance 
(Lindquist, 1953, p. 207f.)* with the following 
results: 

1, The diagnosis-by-instruction interaction 
was nonsignificant (F < 1 for 1, 2 df), indicat- 
ing that the instructions did not differentially 
affect the gain scores of the schizophrenics and 
the controls. 

2. The main effect of diagnosis for the in- 
struction subgroups combined was nonsig- 
nificant (F < 1 for 1, 78 df), indicating that 
there were no differential gains by the schizo- 
phrenic and the control groups as a function 
of the instructions. 

3. The main effect of instructions over the 
combined diagnostic groups was highly sig- 
nificant (F = 6.89 for 2, 78 df, p < .01), 
indicating that the instructions were sig- 
nificantly different in their effect upon the 
gain scores. 

The simple effects of instructions were then 
analyzed by means of ¢ tests, using the pooled 
error term from the overall analysis, with the 
following results: 

1. The difference in mean gain between the 
failure and the nonevaluative instruction 
groups was highly significant (¢ = 3.70, 
p < 01). 

2. The difference in mean gain between the 
success and nonevaluative instruction groups 
was not quite significant (¢ = 1.75, p < .10). 

3. The difference in mean gain between the 
success and failure instruction groups was 
also not quite significant (f = 1.93, p < .10). 

Since the initial reaction times of the schizo- 
phrenic subjects were significantly slower than 
those of the control subjects, the dependence 
of the gain scores on the initial reaction time 
was tested. Rank-difference correlations be- 
tween initial and gain scores were computed 
for each of the six diagnostic-instruction 
groups. Three of the coefficients were positive 
and three were negative. None was significant. 

4 An application of Hartley’s test for homogeneity 


of variance (Walker & Ley, 1953, p. 192) indicated no 
significant differences in homogeneity of variance. 


Discussion 


The results of the present investigation are 
noteworthy in that they offer striking evidence 
that the performance of schizophrenics can be 
improved by verbally presented social re- 
inforcers, although the absolute level of the 
improved performance is still defective. The 
often reported failure of schizophrenics to 
respond to social motivation is not confirmed 
in this investigation. As hypothesized, both 
the success and failure instructions resulted in 
an enhancement of the choice reaction time 
of the schizophrenic subjects as well as the 


- control subjects, although only the result with 


failure was at an acceptable level of signifi- 
cance. Contrary to expectation, the failure or 
social punishment instructions resulted in 
greater enhancement of the performance of 
both groups than did the success condition, 
although this finding is mot statistically re 
liable. As hypothesized, there was no evidence 
to support the contention that schizophrenics 
show marked detrimental effects following 
socially punishing instructions. On the con- 
trary, the schizophrenic failure group showed 
the largest mean gain score of any of the six 
groups of subjects. 

The present results are in striking agreement 
with those of Shankweiler (1959), who studied 
the effect of these same motivating instructions 
on the choice reaction time of brain damaged 
subjects. The mean gain scores of the three 
subgroups of schizophrenic subjects used in the 
present investigation are within five milli- 
seconds of the means of Shankweiler’s brain 
damaged subjects; the means for the initial 
and final reaction times are also remarkably 
similar. Benton, Jentsch, and Wahler (1959); 
using different subjects on the same apparatus, 
previously demonstrated a similarity between 
brain damaged and schizophrenic subjects in 
both simple and choice reaction time undet 
nonevaluative instructions. It therefore ap- 
Pears that schizophrenic and brain damage 
patients may show not only a similarity in 
general reaction time performance, but also 4 
similarity of response to these motivating 
instructions. 

Studies of success and failure in normals 
(eg., Steisel & Cohen, 1951; Williams, 1955) 
tesemble the foregoing in that they have fairly 
consistently reported that mild failure in 
telatively simple tasks facilitates performance 
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to a larger extent than success, and the present 
findings with the control subjects are rather 
consistent with this. The results of the studies 
of success and failure in schizophrenia have 
been far from consistent. While Rodnick and 
Garmezy (1957) reported facilitation with 
success and decrements with failure, Olson 
(1958) reported facilitation with both success 
and failure, with the former more facilitating 
than the latter. The present results with 
schizophrenics are rather like those expected 
with normal subjects in that the failure re- 
sulted more in facilitation than success, 
although the difference is not statistically 
reliable. Leventhal (1959), using a verbal con- 
ditioning task, also reported social punishment 
(“not so good”) as a more effective reinforce- 
ment for schizophrenics than social reward. 
Since these quite discrepant results have been 
gleaned from studies involving, at the very 
least, different tasks, different instructions, 
different subjects, and different experimenters, 
the discrepancies may simply be a function of 
any or all of these differences. What appears 
necessary is a systematic program of research 
in schizophrenia directly attacking these ap- 
parent contradictions. 


SUMMARY 


The main purpose of the present study was 
to determine the effectiveness of three types 
of motivating instruction, success, failure, and 
nonevaluative, on the choice reaction times of 
42 schizophrenic patients and to compare their 
performances with those of a similar group of 
42 hospitalized, nonpsychiatric patients. All 
subjects received an initial set of 30 choice 
Teaction times under standard instructions; 
this was followed by the differential motivating 
instructions and by a final set of 30 choice 
reaction times. The criterion score for each 
subject was the difference in mean time scores 
between his 30 initial and 30 final trials. 

The schizophrenic subjects were consistently 
slower in reaction time than the-control sub- 
jects. The analysis of the gain scores indicated 
pat the instructions were differentially effec- 
i ve with the rank order of effectiveness being 

ailure, success, and nonevaluative for both 
schizophrenics and controls. Only the difference 


between the failure and nonevaluative instruc- 
tions was statistically reliable. There was no 
difference in gain scores between the schizo- 
phrenic and control subjects. 

The similarities and the differences between 
the present findings and previously reported 
research were discussed in terms of differences 
in task, instructions, experimenters, and 
subjects, Additional research was suggested as 
necessary to resolve the several contradictions 
noted. ‘ 
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is rather commonly defined in terms of the 

words a person uses to describe himself. 
One person describes himself as honest, en- 
ergetic, impulsive, and self-assertive; his self- 
concept differs correspondingly from that of a 
person who describes himself as sensible, shy, 
earnest, and inarticulate. Such an interpreta- 
tion implies that the words have common 
meaning for the different persons concerned. 

On the other hand, Kelly (1955) in his work 
on “personal constructs” has emphasized the 
differences in the way individuals use concepts 
about personality. Differences in self-descrip- 
tions, from this point of view, may represent 
differences in the meanings the persons are 
giving to the words used, rather than differ- 
ences in the way they view themselves. 

In interpreting self-descriptive material, 
then, it becomes an important question how 
much disagreement exists in the meanings 
subjects give to the words they use to describe 
themselves. That some such disagreement 
exists, no one seriously questions. But many 
psychologists seem willing to take self-descrip- 
tions in terms of traits (eg., adjective 
checklists, trait rating scales, the typical free 
self-description) as a first approximation to 
self-concepts: How serious are the resulting 
errors likely to be? 

In evaluating the importance of differences 
in the meanings given to words, it seems ap- 
propriate to consider the fineness of the dis- 
tinctions that must be made with them. If 
self-descriptions differ only slightly from per- 
son to person, even a fairly small variation in 
the meanings assigned to words by subjects 
will create serious difficulties in making dis- 
criminations, whereas the same absolute 
amount of variation may be of little conse- 
quence if the self-descriptions differ widely. 
The practical question is: How large are differ- 
ences in word meanings relative to differences 
in the use of the words by subjects to describe 
themselves? The present investigation repre- 
sents an attempt to provide an answer to this 
question for a situation which is important 


E psychological research, the self-concept 


because of its frequent use in psychological 
research, namely, college student subjects in 
a classroom setting. 

Specifically, the question may be considered 
in the following terms: Following Osgood, 
Suci, and Tannenbaum (1957), Cronbach 
(1958), Hays (1958), and others, a set of con- 
cepts may be conceived of as occupying a 
multidimensional “semantic space,” which is 
defined by the relationships among the con- 
cepts in such a way that the similarity in 
meaning of the concepts is represented by 
their closeness in the space. In such a space, 
concepts identical in meaning occupy the same 
point in space, concepts similar in meaning are 
close together, concepts opposite in meaning 
are maximally distant, etc. The self-concept 
may also be located in such a space, with its 
position defined in relation to the various 
other concepts; thus if the concepts defining 
the space include careful, honest, and emo- 
tional, and a person conceives of himself as 
very honest, moderately careful, and not at all 
emotional, his self-concept will be located 
close to the point occupied by honest, at a 
moderate distance from careful, and far from 
emotional, 

In these terms, the question becomes: What 
happens when we compare the “semantic 
spaces” of different persons? How great 1S 
disagreement on concept meanings (repre 
sented by variation from person to person 1m 
the set of interconcept distances) compared to 
differences in the attribution of concepts to 
the self (represented by variation from person 
to person in the set of concept-self distances)? 
Given a suitable method for measuring the 
distances in a person’s “semantic space,” this 
question can be answered empirically for any 
particular population of persons. 


METHOD 
Subjects 
The subjects were from a class in the introductory 
study of personality at the University of Nebraska. 
Most were in their second semester of psychology. All 
but a few were nonpsychology majors—mostly liberal 
arts students taking the course to complete a university 
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requirement. The data were gathered during a regular 
meeting of the course, as a class exercise. 


Concepts 

In order to insure that the words used in the experi- 
ment be representative of a variety of aspects of per- 
sonality, random samples were drawn from a pool of 82 
adjectives, which had previously been selected by the 
writer to represent Cattell’s (1946) “personality sphere” 
(in turn based on a condensation of Allport and Odbert’s 
[1936] list of all the trait names in an unabridged Eng- 
lish dictionary). The 82 adjectives included two for 
each of 41 of the 42 bipolar dimensions in Cattell’s 
most recent version of the personality sphere (1957, 
Appendix 2). The dimension omitted was “Marked 
Interest in Opposite Sex vs. Slight Interest in Opposite 
Sex,” for which no satisfactory single-word equivalents 
were found. 

Three samples of six concepts each were independ- 
ently drawn from the 82-adjective pool by means of a 
table of random numbers; each sample was used with a 
different third of the subjects, to provide three replica- 
tions of the experiment. The three samples of concepts 
are listed in Table 1. 


| Measurement of Distances in the Semantic Space 


R Since no one technique of measuring distances in a 
semantic space” is entirely free from objections, three 
different methods were used. 

1. Direct rating. As one measure of the concept- 
concept distances, direct ratings of the similarity in 
meaning of pairs of concepts were used. For each con- 
Cept in turn of the group of six, the subject rated its 
similarity in meaning to each of the other five, on a 
Seven-point graphic scale with the end points labeled 

Practically the same” and “just the opposite.” The 
Subject was permitted to use whatever criteria of 
similarity of meaning he wished. The 30 resulting rat- 
Ings represented two of each distance, one in each direc- 
a the two ratings were summed to yield estimates of 

le 15 concept-concept distances for each subject. 
at a corresponding direct meastre of concept-self 
of fae the subjects simply rated themselves on each 
Sile e six concepts, using seven-point graphic scales 
ie the end points labeled “very much so” and ‘just 
igh Opposite.” The rating was done twice, and the two 
i ngs summed to yield estimates of the concept-self 

ee for each subject. 

a emantic Differential. Os} ’s “Semantic Differ- 
ential” (Osgood et al., 1957) EEA a means of meas- 


TABLE 1 
Lists or Concepts UsED 

Form A Form B Form C 
donplaining submissive inflexible 
aaa inant solemn dependent 
aoa expressive responsible 
inconsid, considerate tough 
een inflexible persevering 

essive obstructive irresponsible 


uring distances in a semantic space based on similarities 
in connotative meaning. Each of the six concepts, and 
the self, was rated on six seven-point bipolar graphic 
scales—two for each of Osgood’s dimensions of value, 
activity, and potency. The scales used were active- 
passive, foolish-wise, weak-strong, good-bad, hard- 
soft, and calm-excitable; they were chosen on the basis 
of data cited by Jenkins, Russell, and Suci (1958), and 
the instructions used were adapted from theirs. 

The projection of each concept and the self on each 
of the three main dimensions was obtained by com- 
bining appropriate pairs of ratings (reversing direction 
where necessary), and the 15 concept-concept and 6 
concept-self distances estimated for each subject by 
computing the corresponding 21 Ds (where D* equals 
the sum of the squares of the differences on the three 
dimensions). 

3. Rating of role persons. This technique emphasized 
denotative, rather than connotative, meanings, by 
using a set of persons to provide a frame of reference 
within which distances were measured. Each subject 
selected a set of 12 persons who had played specified 
roles in his life, by means of an adaptation of Kelly’s 
“Role Title List” (1955, p: 227). Each of these persons 
was rated on each of the six concepts, on seven-point 
graphic scales with the end points labeled “very much 
so” and “just the opposite.” The subject also rated the 
similarity of himself to each of the 12 persons on scales 
running from “very similar” to “just the opposite.” 

‘The persons were then considered as dimensions of a 
semantic space, and the ratings as projections on these 
dimensions, and Ds computed to yield estimates of the 
15 concept-concept and 6 concept-self distances for 


each subject.! 


Procedure 

The various rating scales were assembled in a three- 
part questionnaire, with the parts corresponding ap- 
proximately to the three methods fee ing Hes : 

iven , nonself ratings preceded rat im 

os valk This questionnaire was prepared in three forms, 
A, B, and C, each of which used a different set of con- 
cepts (see lists in Table 1) but was otherwise identical 


ith the others. ý S 
i ‘An effort was made to standardize as far as possible 
the format of the rating scales and the wording of the 
instructions for the three parts of the questionnaire. 
These instructions included “work fairly rapidly, 
making each judgment on the basis of your first im- 
pression,” and “do not hesitate to use the extreme ends 
of the scales, whenever these are appropriate, as well 
as cautions against omitting items, talking during the 
experiment, etc. 

ing, D can be interpreted as a dis- 
tance i Euclidean space only if the persons are con- 
ae to one another. However, with 
of persons, D should pro- 


vide a good indication of relative distances between con- 


and disliked, same and opposite sex, authority figures 


and peers, etc. 
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Subjects were randomly assigned one of the three 
forms of the questionnaire, and completed its three 
parts in the order described. The data were collected 
within a 50-minute class period, with most subjects 
requiring between 30 and 40 minutes for the task. 


Analysis of the Data 


Seventy-one subjects participated in the experiment. 
For ease in handling the data, only complete question- 
naires were used for analysis—in a few cases where only 
single items were omitted they were arbitrarily assigned 
a middle rating of 4. The arrangement of the question- 
naire made a skipped last page a .fairly common 
hazard—seven questionnaires were eliminated on this 
ground. Eight others were excluded for other devia- 
tions from instructions—mostly for failure to rate one 
or more persons in Part 3. Two subjects failed to finish 
in the allotted time. Of the 52 remaining questionnaires, 
which form the basis of the subsequent analysis, 19 
were Form A, 17 Form B, and 16 Form C. 

The 15 concept-concept distances and the 6 concept- 
self distances were computed for each subject for each 
method of measurement as described above. As com- 
puted, the distances were in units that differed for each 
of the three measurement methods. Furthermore, sub- 
ject-to-subject differences in mean and dispersion of 
ratings, due to differences in interpreting the rating 
scale or differences in response style, were to be ex- 
pected, and in fact were apparent in the raw data. To 
make comparisons across methods and subjects 
meaningful, therefore, the set of distances for each sub- 
ject for each method was converted to rank-order 
form, The 15 concept-concept and the 6 concept-self 
distances were ranked independently, because it was 
desired to examine the stability of the two separately. 
This procedure also has the advantage of minimizing 
any lack of comparability of concept-concept and con- 
cept-self distances that might result from the somewhat 
different forms of rating used for the two in two of the 
methods, or from possible differences in response set for 
judgments involving or not involving the self. 

For each form of the questionnaire, mean Spearman 
rank-order correlation coefficients (9) were computed 
among the rankings of concept-concept and concept- 
self distances as follows: 

i 1. Consistency over all rankings, To obtain an indica- 
tion of overall consistency of ranking, despite variation 
in both subjects and methods, Kendall’s coefficient of 
concordance (W) was computed over the 3 W sets of 
rankings in each group (where W is the number of sub- 
jects in the group, i.e., 19, 17, or 16).? This was done 
separately for the concept-concept and concept-self 
distances, and then each of the six resulting Ws was 


4 No correction was made for tied ranks. The effect 
of such a correction would be to raise W very slightly, 
and to about the same extent for concept-concept and 
concept-self distances. In making comparisons among 
the methods, however, allowance may be made for the 
fact that ties were somewhat more frequent in the 
“direct rating” method than in the other two, and 


hence its consistency is probably slightly und 
timated. P y slightly leres- 


converted to 5 by the relation 6 = (mW —1)/(m—1), 
where m is the number of arrays over which W is com- 
puted (Kendall, 1948). 

2. Consistency within methods, If the three measure- 
ment methods are measuring different things, but 
measuring them dependably, the overall consistency 
of rankings may be low, but consistency within methods 
should be considerably higher. Therefore, for each of the 
three methods, Ws were computed over the N subjects 
in each group, separately for the concept-concept and 
concept-self distances, and each W converted to / as 


above. 
3. Consistency within subjects. Subjects may display 
individual differences, consistent over measurement 
methods, in the assignment of meaning to concepts 
(“personal constructs”), and it is ordinarily assumed 
that they display consistent individual differences in 
the “meanings” they assign to themselves (i.e., self- 
concepts). These should result in within-subject con- 
sistency among rankings that is substantially higher 
than consistency measured across subjects, Therefore, 
Ws were computed for each subject over the three 
measurement methods, separately for the concept-con- 
cept and concept-self distances, and these converted 
to A’s as above. i 

4. Consistency across subjecis and methods. To pro- 
vide an unconfounded baseline against which to 
evaluate intrasubject and intramethod consistency, the — 
f for all possible comparisons made across both sub- 
jects and methods was computed. This was done by 
multiplying the overall 5 by the number of comparisons 
involved, subtracting from this the within-person and 
within-method f’s, each multiplied by the number of 
comparisons on which it was based, and dividing the 
remainder by the number of across-method-and-subject 
comparisons. Thus, for example, in Group A there are 
1,596 possible paired comparisons among the 57 sets of 
rankings, with 19 X 3 = 57 of these within subjects, 
3 X 171 = 513 of these within methods, and the re- 
maining 1,026 across both subjects and methods. The 
computation is (.390 X 1,596 — .443 X 57 — .449 x 
513)/1,026 = .358 for the f for comparisons across both 
subjects and methods. 


RESULTS 


Table 2 contains the mean rank-order corre- 
lation coefficients among rankings within the 
categories specified. 

Let us begin by examining the self-descrip- 
tions. First, do individual subjects describe 
themselves with any consistency whatsoever? 
The affirmative answer is in the left-hand part 
of the first row of Table 2. When p’s are com- ; 
puted for individual subjects over the three’ 
measurement methods, they tend to be sub=. 
stantially positive: the means of these p’s i 
the three subgroups are .48, .40, and .50; 
respectively. : 

The second, and more crucial question for 
our purposes is: To what extent do the in- 


TABLE 2 


CONSISTENCY AMONG RANKINGS OF CoNCEPT-CONCEPT 
AND CONCEPT-SELF DISTANCES 


Mean p Among Rankings 
h Of6 Concept- | _Of 15 Concept- 
Comparisons Self Distances | Concept Distances 


Form | Form | Form | Form | Form | Form 
A B c A B c 


Among all subjects® 
Within subjects, across] .48 | .40 | .50 | .44 | .43 | .60 
3 methods 
Across subjects and | .38 | .21 | .40 | .36 | .28 | .34 
methods 
Among consistent sub- 
jects only > 
Within subjects, across} .70 | .70 | .74 | .58 | .61 
3 meth 
Across subjects and | .64 | .58 | .62 | .50 | .46 | .33 
methods 
Within methods, across 
N subjects” 
Direct rating +29) .34 | .55 | .47 | .42 | .38 
Semantic Dif. +30 | .13 | .45 | .30 | .31 | .32 
Role persons .54 | .30 | .25 | .57 | .38 | .36 
3 methods combined +37 | .26 | .42 | .45 | .37 | .35 
Across subjects and +38 | .21 | .40 | .36 | .28 | .34 
methods 
Over all 3 N rankings +38 23 | .41 | .39 | .31 | .35 


® N = 19, 17, and 16 for Forms A, B, and C, respectively. 
b N = 7, 3, and 8 for Forms A, B, and C, respectively. Con- 
sistency as defined in text. 


dividual subjects describe themselves dis- 
tinctively? The usefulness of the self-concept to 
the personality psychologist usually pre- 
supposes that differences from individual to 
| individual can be dependably ascertained. The 
relevant information in Table 2 lies in the 
relationship between Rows:1 and 2 on the 
left-hand side. The difference between con- 
Sistency across methods within persons and 
consistency across methods across persons 
Tepresents the effect of consistent individual 
differences. This effect is not impressively 
large, but it is present in all three groups.’ 
Thus these subjects do appear to differ from 
Person to person in how they describe them- 
Selves, although there is also a substantial 
Consensus in their self-descriptions, which 
appears upon inspection of the data to be 
based on a tendency noted by Edwards (1957) 


* Individual subjects having f’s above the ñ across 

ns in their group number 12 out of 19, 11 out of 17, 
and 10 out of 16, in the three groups. The two-tailed 
Probability (based on the binomial) of obtaining such a 
Tesult by chance is .014. 
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and others for subjects to apply socially de- 
sirable words to themselves, and avoid socially 
undesirable words. 

Person-to-person differences in self-descrip- 
tion might arise, however, simply from person- 
to-person differences in the meanings attributed 
to the concepts used in self-description. 
Are such differences present in the data, 
and if so, how large are they? The answer may 
be found by comparing Rows 1 and 2 on the 
right-hand side of Table 2. Here again Row 1 
reflects consistency across methods within 
subjects, and Row 2, consistency across 
methods across subjects. The difference be- 
tween the two reflects consistent individual 
differences in the meanings assigned to con- 
cepts. And this difference is roughly of the 
same magnitude as that on the left-hand side 
of the table:* in other words, the concepts 
appear to differ in meaning from person to 
person fully as much as their use in self- 
description does. 

Are these differences related? In order to 
shed some tentative light on this question, 
one concept in each group was selected (the 
one that appeared to show the greatest vari- 
ability in concept-self distance from person 
to person) and the relation between its distance 
from the self and its distance from the other 
concepts in its set was evaluated by computing 
p’s between them over the subjects in the 


up. 

ik at the relationships observed sug- 
gest that a concept was more likely to be used 
inself-description when it was given a more 
favorable meaning. Take, for example, the 
concept solemn in Group B. Individuals dif- 
fered considerably as to whether they located 
jt near or far from their self-concept—for 
some it was one of the more self-descriptive of 
the concepts in its set; for others it was the 
least. And the variation in the use of the con- 
cept in self-description appeared to be accom- 
panied by shifts in its meaning: those persons 
who judged solemn to be self-descriptive 
located it relatively close in the semantic space 
to expressive and considerate; those who did 
not, placed it distant from these concepts, and 
closer to submissive or to obstructive. In the 


4Its significance is greater—well beyond the 001 
Jevel—due to less variability among individuals in the 
consistency of these rankings. 


A 
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same way, in Group A, when relaxed was seen 
as relatively close in meaning to expressive 
and imaginative, it was more likely to be con- 
sidered self-descriptive than when it was far 
from these concepts. Or in Group C: whether 
dependent was considered self-descriptive or 
not went with its nearness to persevering and 
responsible, on the one hand, or to irresponsi- 
ble, on the other.® It seems plausible, therefore, 
to attribute the differences in self-description 
at least in part to the differences in meaning 
that subjects attributed to the concepts. 

One possible complication in the inter- 
pretation of the results was suggested by the 
fact that for several subjects consistency across 
methods was around zero, while for others it 
was quite high. It seemed desirable to check 
the general conclusions of the study among 
those subjects for whom the measurement 
appeared to be best: that is, those subjects who 
showed high consistency across methods, both 
in the measurement of concept-self distances 
and in the measurement of concept-concept 
distances. A z of .50 was chosen as the arbitrary 
cut-off point for including subjects in the 
“consistent” group. This was approximately 
at the median of each distribution. About one- 
third of the total sample exceeded this level 
of consistency on both distributions, and were 
included in the group of consistent subjects. 
The third and fourth rows of Table 2 contain 
p’s based on this group. It will be seen that 
despite the now considerably higher p’s (par- 
ticularly for the concept-self distances), the 
differences between within- and across-person 
p’s remain of roughly the same order of magni- 
tude on the two sides of the table. Again, it 
appears that the shift in concept meanings 
from person to person is about the same as the 
change in self-descriptive use of the concepts. 

Finally, it is of some interest to examine the 
consistencies within the three measurement 
methods. The relevant p’s are in the lower part 
of Table 2. There appears to be some tendency 
for the “direct rating” and “role persons” 
techniques to yield higher within-method than 
across-method j’s, but there is an exception 
among the concept-self comparisons in each 
case. The Semantic Differential may produce 
slightly less agreement than the other tech- 


5 The relationships mentioned are based on p’s rang- 
ing from .45 to .79. 


niques, but again, this is not uniformly true 
in all the groups. In general it appears that — 
the three methods agree reasonably well on 
what they are measuring, in that within- 
method ’s are not, on the average, greatly in 
excess of across-method j’s. (This is particu- 
larly true for the concept-self distances.) 


Discussion 


If it is true, as the present results appear to 
indicate, that consistent individual differences 
in self-description are on the whole no larger 
than consistent individual differences in the 
meanings of the words used in self-description, 
then obviously inferences about the self- 
concept from such self-descriptions are very ‘ 
hazardous. j 

Some restrictions on the generality of these 
results should be noted at the outset. 

First, the subjects were college students in | 
a psychology class. However, one might expect 
that in a more heterogeneous population con- 
sensus on word meanings and on self-concepts 
would both decrease somewhat, leaving the 
nature of the problem of interpreting self- 
descriptions much the same. 

Second, the results apply to self-descriptions — 
in terms of adjectival trait names. Presumably 
if the words are defined for the subject, as 1$ 
done in some rating scales, the variability in 
meaning from person to person should decrease 
(to an extent limited by the fact that the words 
used in definition are likely to be themselves 
ambiguous). Reduction in variability of mean- 
ing may also result from the incorporation of 
the adjectives in sentences, as is done in many 
Q sorts and inventories, but this need not 
always be true. In “I am more sensitive than 
most other people,” the word sensitive 18 
probably just as variable in meaning as when 
it is used alone. However, the findings of the 
present study apply most directly to self- 
description by such means as adjective check- 
lists or simple trait rating scales.* | 

®Tt should be made clear that these results do not 
imply that adjective checklists and similar self-rating 
devices are without value. For some uses of such in- 
struments, the fact that individuals may differ in the 
meanings they assign to words does not present serious 
difficulties. For example, in strictly empirical predic- 
tion problems it may be immaterial whether the in” 
strument works because it measures self-concepts 
personal constructs, or some mixture of the two. In 


— 
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Third, the results apply to self-description 
ona single occasion—the typical research situa- 
tion; implications are necessarily indirect for, 
say, a clinical setting in which self-concepts or 
personal constructs emerge gradually over a 
series of interviews. 

Finally, the results depend on the validity of 
measuring meanings in terms of distances in a 
semantic space (the fairly good agreement 
across the three measurement methods sug- 
gests that the results should be independent of 
any particular method of measuring such dis- 
tances). The merits of a model of this kind 
have been discussed by Osgood et al. (1957) 
and others. A particular advantage for the pres- 
ent study is that it permits evaluation of con- 
sistency of meaning of self and other concepts 
in the same terms, i.e., as the stability across 
Persons or methods of the rank order of a set 
of distances in a given semantic space. 

This last is important. “Everyone knows” 
that People’s agreement on the meaning of 
words is greater than their agreement on their 
self-descriptions (contrary to the present find- 
ings), but this is agreement in terms of dic- 
tionary definitions, that is, in terms of distances 
to that set of concepts most stably related to 
the Concept in question, not in terms of posi- 
tion in the semantic space as a whole. A 
relevant example can be drawn from the data 
of this study. It so happened that the two 
adjectives responsible and irresponsible were 
pe ded in the sample of six concepts in 
op C. The data suggest that the subjects 
pce in high agreement as to the relationship 

these concepts fo each other. However, the 
Subjects Still differed substantially on how the 
| eae to other concepts in the set. No 
= dependable and untrustworthy had 
ae pended along with irresponsible and 
n nsible, one would have found highly con- 

ent relations among the four concepts 


_ and perhaps have concluded that the subjects 


4, 
cea on what they mean”—but do they? 
So era would insist that it is the general 
tio ae ips in the semantic space—the rela- 

nship between, say, independent and re- 


0 

ee the assumption of consistency within the 

meaning ey be sufficient: one may be able to make 

tices a comparisons of overall discrepancies be- 

mensions H self-concept and self-ideal, even if the di- 

vary fro along which such discrepancies are measured 
m person to person. 
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sponsible, rather than that between irresponsi- 
ble and responsible—which constitute the 
“meaning” with which the psychologist is 
usually concerned. And it is his kind of mean- 
ing which, in the present experiment, differs 
from person to person as much as the use of 
the concepts in self-description does. 

What is the process which produces these 
related differences in  self-description and 
concept meaning? If (to recur to an earlier 
example) solemn means something like ex- 
pressive to the person who uses it to describe 
himself, and something like obstructive to the 
person who does not, how does this come 
about? One line of interpretation is in terms 
of the theory of self-consistency proposed by 
Lecky (1945), and adopted in modified form 
by Murphy (1947), Rogers (1959), and others. 
These writers emphasize the fact that evi- 
dence about the self received from external 
sources tends to be modified to fit the person’s 
existing concept of himself. In the present 
case, such an explanation might take either of 
two forms. It might be that a person first 
interprets solemn in a favorable light, as 
meaning something like expressive, and there- 
after is more willing to apply it to himself than 
is someone who interprets it as more like ob- 
structive. Or it might be that he first accepts 
solemn as applying to himself (because people 
have called him that) and then tends to give 
the word a particular interpretation in line 
with his general view of himself. The present 
data do not offer a basis for decision between 
these alternatives. However, in either case. the 
general conclusion is the same: we know little 
about what it means when a person calls 
himself solemn unless we know what he means 
by the word—and one basic implication of 
the present study is that it isn’t at all safe to 
assume that he means the same thing by it 


that we do. 


SUMMARY 


College students made a series of ratings 
designed to determine the similarities of 
meaning among a set of six adjectival concepts, 
as well as the relation of the subject’s self- 
concept to each of the concepts in the set. 
Three different techniques of measuring mean- 
Inter- and intrasubject con- 


ing were used. t c 
meanings and self-descriptions 


sistency in word 
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were examined. For three groups of subjects 
using different sets of adjectives, consistent 
individual differences in the meanings at- 
tributed to words proved to be of the same 
order of magnitude as consistent individual 
differences in the use of these words in self- 
description. It was suggested that this implies 
considerable hazards in using self-descriptions 
of this nature to define the self-concept. 
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THE EFFECTIVENESS OF UNANTICIPATED 
PERSUASIVE COMMUNICATIONS 
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attempt to persuade a person to change 
his opinion on some issue would be more 
effective if the persuasive communication were 
unexpected than if the person anticipated the 
influence attempt. At least many people find 
this to be plausible. Lazarsfeld, Berelson, and 
Gaudet, for example, state “If we read or tune 
ina speech, we usually do so purposefully, and 
in doing so we have a definite mental set that 
tinges our receptiveness .... This mental set 
is armor against influence. The extent to which 
People, and particularly those with strong 
Partisan views, listen to speakers and read 
articles with which they agree in advance is 
evidence on this point” (1948, p. 152), and 
Festinger says: “It seems clear that the avoid- 
ance and evasion of material which might 
Produce or increase dissonance depends on 
anticipations (probably unverbalized ones) 
ut the material or on preliminary assess- 
Ments of the material” (1957, p. 158). 

There are, however, virtually no data either 
to challenge or to support such statements. A 
Study by Ewing (1942) comes closest to being 
Pertinent. Ewing exposed two groups of sub- 
jects to a persuasive communication that 
Supported a very extreme point of view. The 
only difference in treatment between the two 
groups was in the introduction to the persua- 
sive communication. This introduction led one 
Soup to expect that the communication would 
Support the extreme view which it actually 
a For the other group, the introduction 
ed the subjects to expect that the communica- 

n would agree with their existing views. The 
Tesults indicated that those subjects who 
at the communication to disagree with 
it nin were actually less influenced by 

+ dn other words, the persuasive communica- 

1 


[ seems plausible to suppose that an 


han Maccoby for his help in writing the persuasive 
communication and in the conduct of the experimental 
ony We are also indebted to John G. Caffrey and 
ah aes for their help and cooperation in ob- 

n jects. 
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tion was more effective if the audience falsely 
anticipated that it would support their existing 
views. This study does not deal with the 
effectiveness of a communication to an entirely 
unprepared audience, but it does point to the 
operation of some anticipatory resistance by an 
audience which expects to hear a disagreeing 
speech. J 

One may, of course, raise the further 
question as to the exact nature of the process 
by means of which the prepared audience is 
able to resist the effects of the attempted 
persuasion. 

Some understanding of this process may be 
obtained from Festinger’s theory of cognitive 
dissonance (1957). We may say that a persua- 
sive communication which argues for an 
opinion different from that held by the 
audience creates dissonance in the listeners. 
Specifically, dissonance is created between the 
opinion the listener holds and his knowledge 
of the arguments favoring a contrary view, 
Since a person who reads a persuasive com- 
munication, or is a listener in an audience, 
cannot attempt to influence the source of the 
communication, there are only two immediate 
ways in which he can reduce this dissonance. 
He can change his opinion to a position closer 
to that advocated by the communication or 
he can reject and derogate the communication 
and the communicator. 

What, then, are the effects of being prepared 
or unprepared to hear a persuasive communica- 
tion? If the conditions of exposure to a com- 
munication are such as to guarantee attentive- 
ness, it seems plausible to assume that, hearing 
the same speech, the same amount of 
dissonance is introduced whether or not the 
person is prepared for it. In such a case, the 
effect of preparedness, if any, would be on the 
particular mechanism used to reduce the 
dissonance, namely, whether the person more 
readily changes his opinion or rather rejects 

source of communication. 
vine is no rigorous derivation to be made 
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here but one may argue that, if a person 
anticipates hearing a communication that will 
disagree with an opinion he holds strongly, 
he will approach the situation with hesitancy, 
suspicion, and perhaps some hostility. If he 
does approach the situation in this way, then 
it seems natural to expect that his first and 
easiest reaction will be to reject the 
communicator. 

Thus, being prepared for the communication 
would not, in a sense, make the communication 
less effective. It would simply alter the way in 
which the dissonance is reduced. Those pre- 
pared for the communication would tend less 
to change their opinion and tend more to 
reject the communicator. The study by Ewing 
(1942) gives some support to this analysis of 
the situation. He found that the discrepancy 
between the subjects’ initial opinion and the 
position they expected the communication to 
espouse was the major determinant of how 
biased they felt the communication to be. In 
other words, irrespective of what the com- 
munication actually said, if they expected 
disagreement, they later felt the communica- 
tion to be more biased. Anticipation of 
disagreement seems to have led to more rejec- 
tion of the communication. 

The present experiment was designed to 
throw more light on this process by comparing 
two groups exposed to the same persuasive 
communication, one group expecting a disa- 
greeing persuasive communication and the 
other group not expecting a persuasive com- 
munication at all. 


PROCEDURE 


A questionnaire was given to 128 students of the 
Palo Alto High School in order to discover a topic that 
was important to these students and on which their 
opinions were relatively extreme and homogeneous. 
This questionnaire was administered in the classrooms 
by the teachers who described it as a “Youth Survey.” 
The subjects were assured that their teplies would be 
confidential. The questionnaire included questions 
about the control of teenage drivers, teenage curfew 
regulations, and the treatment of communists in this 
county: All questions had four alternative answers: 

‘agree strongly, agree slightly, disagree slightly, dis- 
agree strongly.” The subjects were also asked to rank 
the three topics according to how important they were. 
On the basis of these rankings of importance and the 
clustering of their opinions toward one extreme, the 
issue of controlling teenage automobile driving was 
chosen for use in the actual experiment. The four ques- 
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tions concerning this topic on this questionnaire also 
served as our first measure of opinion. 

Only 91 of these subjects were present at the experi- 
mental session 2 weeks later. The data from four of 
these were not analyzed; two arrived too late, and two 
were discarded because their responses were mainly 
attempts at humor. Of the remaining 87 subjects, 53 
were females and 34 were males. The mean age was 
15.7 years, with a range from 14-18. 

The experimental session was conducted in the high 
school auditorium. As each student entered, he was 
given a questionnaire booklet. The students were asked 
to sit apart from each other and not to open the book- 
lets until instructed to do so. The experimenter then 
explained to them that they were participating in an 
experiment conducted by the Stanford Psychology 
Department and assured them that their answers would 
be confidential and would have no effect on the grades 
they would receive in their classes, The subjects were 
then asked to open their booklets to the first page and 
read the printed instructions. There were two different 
sets of instructions and the booklets had been ran- 
domized in this regard when they were passed out, In 
one condition, “Opinion Orientation,” subjects read the 
following: 

We are very much interested in studying the 
opinions and attitudes of high school students on the 
teenage driving problem. We particularly want to 
get your reactions to a talk by an expert on the sub- 
ject. We have asked Mr. Nathan Maccoby, who is 
very well known for his reports and articles on this 
problem, to come here today and talk to you. In his 
articles, Mr. Maccoby has stated his very strong 
opinion that teenagers are a menace on the roads and 
should be strictly controlled by effective new laws. 
It is important that you pay close attention to him 
and to what he has to say, as you will be asked to 
give your opinions on the problem after he has 
finished his talk. 

In the other condition, “Personality Orientation,” sub- 
jects read the following: 

We are very much interested in studying how 
members of an audience form impressions of the 
personality of a speaker. We particularly want to see 
how high school students will “size up” the per- 
sonality of an adult who is speaking to them on & 
serious subject. We have asked Mr. Nathan Mac- 
coby, who is considered to be an expert in his field, to 
come here today and talk to you. We have asked 
him to speak on the topic that he knows the best, 8° 
as to put him more at his ease. It is important that 
you pay close attention to him and to what he has te 
say, as you will be asked to give your impressions of 
his personality after he has finished his talk. 

When they had finished reading the instructions, the 
speaker was introduced. He delivered a speech that 
stressed the teenager’s lack of a mature sense of re- 
sponsibility and advocated stricter legal measures for 
the prevention and control of teenage driving. The 
communication was more extreme in the direction of 
control than the opinions of any of the students. After 
the speech, the experimenter thanked the speaker and 
instructed the students to answer the questions in theif 
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booklets. The booklets contained the same four ques- 
tions on control of teenage driving which had been used 
previously together with other questions designed to 
measure rejection of the speaker, importance of the 
topic, and other items. These will be discussed in more 
detail under “Results.” At the conclusion of the session 
the experiment was explained in simplified terms and 
any questions the students had were answered. The 
session took approximately 45 minutes. 


RESULTS 
Effectiveness of the Experimental Manipulation 


The purpose of one of the sets of instructions 
was to create an orientation toward the person- 
ality of the speaker so that the actual persua- 
sive communication would be unanticipated. 
The other set of instructions was intended to 
Create an orientation toward the opinion of the 
Speaker together with the expectation that the 
Speaker would disagree with their own opinion. 
To check on whether or not the two different 
orientations were successfully created, the 
following question was asked: 

The psychologists who are doing this experiment are 
mainly interested in 
~ (A) what kinds of opinions people have 

(B) how people judge other people’s personalities. 
Tf our manipulation was successful, we would 
expect the majority of Opinion Orientation 
subjects to check the first alternative and the 
Majority of Personality Orientation subjects 
to check the second. Out of a total of 41 sub- 
_ received the Opinion Orientation 
i ctions, 80% said the experiment was 

neetned with opinions. Out of the total of 

i Subjects who received the Personality 
n = instructions, 63% said the experi- 
ality eB concerned with judgment of person- 
aks he difference between the two 
z Eel. is, of course, highly significant 
Tiental y. Thus it appears that our experi- 

i Manipulation was effective in creating 
erent orientations toward the speaker in 

€ two groups,? 
ae be noted in the instructions to the 
ee . at in each of the two conditions an 
oan as made to introduce the speaker as 
ie expert. This was done because we 
oe to keep constant the magnitude of 

i nce occasioned by the persuasive 
a a no significant differences in the re- 
it y boys and girls on any of the measures 


study. Theref 
Soup as a whole. fore, the data were analyzed for the 


communication. The following question was 
asked to check on this point: 


j When we introduced Mr. Maccoby, we said that he 
is considered to be an expert in his field. How much do 
you think he really knows about the problem he dis- 
cussed? 
(A) Heisa real expert, who is very familiar with the 
problem. 
(B) He knows quite a lot about the problem. 
(C) He knows about as much as most people do 
about the problem. 
(D) He knows less than most people about it. 
(E) He knows very little, and a lot of his information 
is wrong. 


The results show that we were successful 
in keeping the expertness of the speaker con- 
stant between the two conditions. In the 
Opinion Orientation condition 78% said the 
speaker was a real expert or knew quite a lot 
about the problem. In the Personality 
Orientation condition the comparable figure 
is 80%. Clearly, there are no appreciable 
differences between the two conditions. 

Since the purpose of the experimental manip- 


` ulation was to affect the orientation of the 


subjects but to keep constant the magnitude 
of dissonance, it is also important to know 
whether or not the subjects in the two condi- 
tions paid equal attention to what the speaker 
said. For this purpose a number of factual 
recall questions were asked. The results show 
no difference between the conditions in amount 
of material recalled. Since the two conditions 
were the same with respect to both attention 
paid to what the speaker said and degree of 
expertness attributed to him, we may regard 
the experimental manipulation as having 
created a difference in orientation or prepared- 
ness on the part of the subjects while holding 
constant the magnitude of dissonance intro- 
duced by the persuasive communication. 


Comparison of the Two Experimental Conditions 


We expected subjects in the Opinion 
Orientation condition to reduce dissonance 
mainly by rejecting the communicator and 
expected those in the Personality Orienta- 
tion condition to reduce dissonance mainly 
by changing their opinion in the direction 
advocated by the communication. Change 
scores were calculated as follows: If a subject 
d his answer to an adjacent category, 


hange 
chang ” to “agree slightly,” 


e.g., from “agree strongly 
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he received a score of 1; if he changed to an 
answer two steps away, e.g., from “disagree 
strongly” to “agree slightly,” he received a 
score of 2; since we used a four-point scale, the 
maximum possible change was 3. Changes in 
the direction of the position advocated by the 
communication were given a plus sign, and the 
changes were summed algebraically for the 
four questions to obtain a single score. Thus, 
if a subject changed from “disagree strongly” 
to “agree slightly” on one question in the 
direction advocated by the communication 
(a change of +2); changed from “agree 
strongly” to “agree slightly’ on another 
question in the direction opposite to that 
advocated by the communication (a change 
of —1); and did not change at all on the other 
two questions, his opinion change score would 
Dett 

Rejection of the communicator was meas- 
sured by the following question: 


Do you think Mr. Maccoby covered all the facts 
pretty well, or do you think he was unfair or biased in 
some way? 

(A) He was very unfair and biased, 

(B) He was somewhat unfair; he didn’t give all the 

facts, 

(C) He was very fair; he covered all the facts pretty 

well. 


Table 1 presents the data for opinion change 
and rejection. Two indices of opinion change 
are presented in the table: the average change 
and the percentage of subjects changing 
appreciably (two or more points) in the direc- 
tion advocated by the communication. Both 
of these measures show differences between 
the two experimental conditions in the 
Predicted direction. Subjects in the Person- 
ality Orientation condition change their 
opinion more in the direction advocated by the 
communication than do subjects in the 
Opinion Orientation condition. While the 
difference in average change of opinion is not 
at all significant, the Percentage who change 
appreciably is significant Statistically (chi 
square = 5.71, p = .02). The difference 
between the two experimental conditions on 
Tejection of the communicator is also in the 
expected direction. Subjects in the Opinion 
Orientation condition are more likely to 
teject the communicator. The difference is 


* All p values reported are two-tailed. 
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TABLE 1 
OPINION CHANGE AND REJECTION OF COMMUNICATOR 
FOR THE Two EXPERIMENTAL CONDITIONS 
Experimental Condition 
Opinion Personality 
Orientation Orientation 
(N = 41) W = 46) 
Average change of opinion +.49 +.63 
Percentage changing ap- 20% 43% 
preciably* 
Percentage saying com- 80% 61% 


munication was very or 
somewhat biased 


a An appreciable change is defined as a change of two or more 
points in the direction of the communication. 


l 
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statistically significant (chi square = 4.06, + 


p = .05). 

In other words, our expectations were borne 
out. If the subjects are led to expect that a 
persuasive communication will disagree with 
them, they reject the communicator more and 
are less influenced than subjects who are, so 
to speak, caught unprepared. The obtained 
differences, however, while significant statisti- 
cally, are rather small. Let us, therefore, 
examine the data further to see what additional 
light can be shed on the process with which 
we are dealing. 


Initial Opinion and Importance of Issue 


The topic of teenage driving was chosen for 
the experiment because the initial opinions of 


the high school students were more extreme * 


on this issue than on either of the other issues 
that we pretested. Theoretically, of course, we 
would expect that the differences between our 
two experimental conditions would be greater 
for people who held opinions very divergent 
from the position advocated by the com- 
munication since more dissonance is introduced 
for them. Although opinions on the issue of 
teenage driving were relatively extreme, still, 
only a minority of the subjects held entirely 
one-sided opinions on the issue. 

The four questions used as the measure of 
opinions about teenage driving on both ques 
tionnaires were as follows: 


1. Since most accidents involving teenagers happe? ~ 


at night, people between the ages of 16 and 20 should 
only be allowed to drive during the daytime. 
2. Tests for obtaining a license to drive should be 
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2 made much harder for teenagers and should also test for tance of the issue, the greater will be the 


judgment and knowledge of the traffic laws, 

3. I think that the minimum age for obtaining a 
driver’s license in California should be raised to 18, 
[Note: present age is 16.] 

4, If a person between 16 and 20 years of age receives 
a ticket for speeding or reckless driving, his license 
should be revoked until he is 21. 


Tf we define an extreme opinion as disagree- 
ment (either strong or slight) on all four ques- 
tions, we find that only 16 of the 41 subjects in 
the Opinion Orientation condition and 16 of 
the 46 subjects in the Personality Orienta- 
tion condition initially held such extreme 
opinions. 

Another variable that should affect the 
tesults of the experiment is the importance 
of the issue to the subject. The more important 
the issue of teenage driving is to a particular 
Person, the greater should be the magnitude 
of dissonance created in him by the persuasive 
communication. In order to obtain a measure 
of the importance of the issue to each subject, 
We asked the following question: 


| How important is it to you to be able to have a 
license to drive a car? 

(A) very important 

(B) quite important 

C) not too important 

(D) not at all important 


Ideally we would have wanted to analyze 
the data from the experiment separately for 
the effect of extremeness of opinion and the 
effect of importance of the issue. Unfortu- 
nately, it turns out that there is a very appreci- 
able correlation between these two variables. 

both experimental conditions equally, those 
who hold extreme opinions also overwhelm- 
ingly state that the issue is important to them. 

ence, we cannot look at the effect of each 
of these variables separately. Consequently 
the data were analyzed for those with extreme 
Opinions versus those with moderate opin- 
tons on the issue, ignoring the specific 
Tesponses on importance. It should be re- 
membered, however, that these two go 
hand-in-hand in this sample. Since, theoreti- 
cally, both of these variables should affect the 
data in the same manner, the interpretation 
will be, of necessity, slightly ambiguous. That 
is, the greater the difference between aa 
Opinion and the position advocated by the 
Communication, and the greater the impor- 


magnitude of dissonance introduced by the 
communication. Hence, we cannot separate 
the effects of these two variables in our data, 

Let us, then, examine the data to see the 
effect of the combined action of both variables, 
Table 2 presents the data on opinion change 
separately for those who initially held extreme 
opinions and those who initially held moderate 
opinions for each experimental condition, It 
should be remembered that a subject was 
labeled as holding a moderate opinion if he 
initially agreed with the position advocated by 
the communication on one or more of the 
questions. 

It is clear from Table 2 that the difference 
between initial opinion and position advocated 
by the communication is a major variable. For 
those who held extreme opinions initially 
(and for whom the issue was more important) 
there is an average change of +2.31 in the 
Personality Orientation condition as com- 
pared to an average change of only +.81 in the 
Opinion Orientation condition. The difference 
on the measure of percentage who change ap- 
preciably is also similarly large. Both differ- 
ences are significant at better than the 2% 
level. For those whose initial opinion was 
moderate (and the issue was less important) 
there is practically no difference between 
conditions and, indeed, practically no impact 
of the communication at all. The average 
change for the Opinion Orientation group is 
only +.28 and the average for the Personality 
Orientation group is even slightly in the 
opposite direction from that advocated by 
the communication. In other words, the 
entire difference that we found between 


TABLE 2 

OPINION CHANGE IN RELATION TO INITIAL OPINION 
Experimental Condition 
oeaan | Onestation 

Initial Opinion 

Parsee Gea yl yee itor 

Pe | = 16) | CW = 10) 
te Initial Opinion WW 
Moderate Change +.28 27 
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the experimental conditions is attributable 
to those subjects for whom the persuasive 
communication introduced considerable dis- 
sonance. 

We would also expect the difference between 
the two experimental conditions on the 
measure of rejection to be greater for those 
with initially extreme opinions. Surprisingly, 
however, this does not turn out to be the case. 
There are simply no differences between those 
holding extreme opinions and those holding 
moderate opinions in the percentage who feel 
the communicator was biased. In the Person- 
ality Orientation condition these figures are 
62.5% and 60%, respectively, and the com- 
parable figures for the Opinion Orientation 
condition are 81% and 80%. There is, of 
course, no reason to expect a difference in 
the Personality Orientation condition where 
dissonance was reduced mainly by changing 
opinion, There is reason, however, to have 
expected a difference in the Opinion Orienta- 
tion condition. It is conceivable, however, 
that the overall level of rejection was already 
so high in this condition (80%) that a virtual 
ceiling had been reached as far as our relatively 
crude measure is concerned. This cannot, of 
course, be settled with the present data. The 
only thing that can be said is that, if there is 
not a ceiling phenomenon here, there is a 
theoretical difficulty involved in interpreting 
the rejection data. 
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SUMMARY 


Eighty-seven high school students who were 


in favor of allowing teenagers to drive with 


few restrictions were presented with a com- _ 


munication advocating strict control of young 
drivers. One group was given an orientation 
to attend to the speaker’s opinions and was 
informed of his topic and his point of view in 
advance; the other group was given an orienta- 
tion to evaluate the speaker’s personality 
and was not told the topic of the speech or the 
speaker’s point of view. It was found that 
subjects who were forewarned of the nature of 
the communication changed their opinions 
less and rejected the communicator as biased 
to a greater degree than unprepared subjects. 
Differences in amount of opinion change 
between prepared and unprepared subjects 
were greater among those holding extreme 
opinions initially. 
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PERSONALITY AND VERBAL CONDITIONING EFFECTS! 


GEORGIA BABLADELIS? 
University of Colorado 


s a body of evidence about verbal 
LE has accumulated, inves- 
tigators have begun to discuss its 
implications for understanding the psycho- 
therapeutic process. An important question 
that arises in connection with psychotherapy 
is that of what happens in the psychothera- 
peutic process to produce change in the be- 
havior of the client. 

Increased attention has been given to the 
verbal behavior of the client in attempts to 
solve both practical and theoretical problems 
here (Mowrer, 1953; Raimy, 1948; Rogers & 
Dymond, 1954; Shaffer, 1947; Shoben, 1949; 
Wolpe, 1958). Conceivably, verbal behavior 
may serve as either cause or effect of both affec- 
tive and instrumental responses. If this is true, 
it is logical to expect that the manipulation of 
verbal patterns as mediators is one source of 
change in the range of both affective and 
instrumental response patterns of the client, 
and thus of the larger patternings of his 
interpersonal relationships. 

In verbal conditioning studies, the experi- 
Menter attempts to manipulate verbal 
mediators. Krasner (1958) has presented a 
thorough review of the studies in this area, 
Ponting out that this group of studies follows 
Skinner’s paradigm in that “the dependent 
yous are the subject’s verbal behavior, and 
the independent variables are generalized 
Conditioned reinforcers, intended to bring 
verbal behavior under the control of the 
examiner,” 

i Certain deficiencies exist in the general 
aen of verbal conditioning studies reported, 
aimed when they are considered as 
ae gues to psychotherapy. Usually, there is 
a a strained or distant relationship between 

experimental setting (noninterview) and 
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the therapeutic setting (interview), so the 
generalization of results is limited. Similarly, 
little attention is ordinarily given to person- 
ality variables, and when attention is given 
to them, complicated problems arise. Those 
experimenters (Sarason, 1958; Taffel, 1955) who 
have studied the relationship of personality 
variables to verbal conditioning effects have 
not employed an interview situation, nor have 
they selected a response class which appears 
to be meaningful to the subjects. Those experi- 
menters (Salzinger & Pisoni, 1958; Verplanck, 
1955) who have employed the interview or 
who have attempted to condition a response 
class which seems to be a more integral part 
of the subject’s verbal behavior have not 
studied the relationship of personality variables 
to verbal conditioning effects. 

The present study is an attempt to over- 
come these restrictions by using interviews 
with the subjects who vary along the measured 
personality dimension of Autonomy. It is an 
attempt to employ successfully the interview 
situation, to condition a response class that 
appears to be a meaningful aspect of subject’s 
verbal behavior (i.e., self-descriptive state- 
ments), and to demonstrate that a personality 
variable—Autonomy—imposes a limitation on 
verbal conditioning effects. The basic hypothe- 
sis is that the relationship between scores on 
Autonomy and verbal conditioning effects is 
linear and negative. 4 

Interaction and main effects were tested in 
an analysis of variance design. 


METHOD 


Edwards (1954) has given the following definition of 
my: 

OTs be able to come and go as desired, to say what 
one thinks about things, to be independent of others 
in making decisions, to feel free to do what one wants, 
to do things that are unconventional, to avoid situa- 
tions where one is expected to conform, to do things 
without regard to what others may think, to criticize 
those in positions are to avoid responsibil- 
iti obligations (p. 14). 

[eee scale of the Edwards Personal Prefer- 
ence Schedule is so constructed that a high scorer is one 
who is expected to manifest many of the characteristics 
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described and a low scorer is one who is expected to 
manifest few or none of them. The EPPS was adminis- 
tered to 102 students in undergraduate psychology 
courses at the University of Colorado, The Autonomy 
scale was arbitrarily divided into four segments by 
cutting the obtained distribution at the mean and at 
one standard deviation on either side. The tests of all 
102 subjects were sorted, on the basis of the scores, 
into one of these four segments. From a table of random 
numbers, 15 subjects from each range were selected, 
giving a total of 60 subjects. Thus, the entire range of 
the distribution was represented. 

‘The 15 subjects in each section were assigned to one of 
the following three conditions by random procedure: 
reinforcement of positive self-statements, reinforcement 
of negative self-statements, and control condition with 
random reinforcement. Thus, 5 subjects from each 
Autonomy group were included in each condition, giving 
a total of 20 subjects in each condition. 

The selected subjects were asked to participate in 
individual interviews. All interviews were conducted by 
the same interviewer and recorded on tape in the 
presence of the subject. The subjects were instructed to 
talk about themselves, covering both positive and 
negative characteristics. During the first 10 minutes, 
the interviewer said nothing and made no gestures, 
This was in order to establish the spontaneous or base 
rate emission of self-statements for each individual. 
During the second 10 minutes, the interviewer rein- 
forced the preselected response unit; the reinforcement 
cue was verbalization of “um-hmm” accompanied by a 
head nod. The control group received random rein- 
forcement at 10-second intervals during the experimen- 
tal period, regardless of the response of the subject. 
The interview was terminated at the end of 20 minutes, 
The tapes were transcribed with all comments of the 
interviewer omitted in the transcriptions in order to 
avoid any influence on the judges. 

Judges were two graduate students in clinical 
psychology. They were asked to rate the interviews for 
each of the 10-minute intervals as to whether or not a 
statement was a self-statement and if so, whether it was 
positive, negative, or neither. They were given explicit 
criteria for these ratings. The instructions to the judges 
were taken largely from Raimy (1948). The response 
unit was defined as a grammatical sentence, and each 
response unit was numbered. 


RESULTS AND Discussion 


The obtained reliabilities between judges 
and for the same judge on two separate occa- 
sions were high (91.3% and 97.1 %, respec- 
tively). In addition, the coder’s consistency 
was found to be high (98%). 

Analysis of the data revealed a negative 
linear correlation of .27, which does not reach 
significance at a .05 level. Because the direction 
Was as expected, the data were examined more 
thoroughly to gain a clearer Picture of what 
may be operative in the relation of Autonomy 


to verbal conditioning effects. The data were 
reanalyzed according to each type of state- 
ment reinforced. 

A significant difference (p < .01) was found 
between reinforcement conditions (i.e., whether 
Positive self-statements were reinforced, 
whether negative self-statements were rein- 
forced, or whether reinforcement was random) 
in their effects on the proportionate change in 
Positive self-statements. Specifically, the pro- 
portion of positive self-statements emitted 
increased significantly when positive self- 
statements were reinforced. The relationship 
of this increase to Autonomy scores (r = —.60) 
is significant beyond the .01 level. A similar 
increase in the proportion of negative self- 
statements when these were reinforced was not 
found. There were, of course, no significant 
changes in the proportions of either type of 
statement for the control group under random 
reinforcement. 

Because of the puzzling problem of why 
there was no significant increase in negative 
self-statements when they were reinforced, the 
data were reanalyzed to investigate the possi- 
bility that subjects scoring low in Autonomy 
gave significantly more negative self-state- 
ments in their base rates. No significant differ- 
ences were found in base rate emission of 
Positive or of negative self-statements among 
Autonomy levels. It is probable that our 
understanding of Autonomy as a personality 
variable is inadequate and that some other 
variable(s) is associated with high or low 
Autonomy scores, requiring differential predic- 
tions of conditioning effects on positive and 
negative self-statements. This suggestion 15 
consistent with the general expectation that 
personality variables impose a limitation oD 
conditioning effects. 

In a similar study, Rogers (1960) found 2 
significant conditioning effect for reinforced 
negative self-statements but not for reinforced 
positive self-statements. Clearly, some unspeci- 
fied variable(s) is exerting a differential effect 
in these two studies. It may be something aS 
relatively simple as sex differences betwee? 
experimenters, or it may be something a5 
complex as differences in total setting oF 
“atmosphere” induced. At least, it seems cleat 
that these apparently contradictory results 
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hardly indicate a straightforward conditioning 
process! 

It is interesting to note that Rogers’ control 
group (no reinforcement) showed a significant 
decrease in positive self-statements but not in 
negative self-statements. If the effect of no 
reinforcement is to decrease the subjects’ 
self-statements, then both positive and 
negative self-statements should decrease. If 
so, then something else was operative to 
arrest the decrease in negative self-statements. 
Since the control group used in the present 
investigation (receiving random reinforce- 
ment) showed no differential effects, the use of 
random rather than no reinforcement may be 
a more meaningful control in verbal condi- 
tioning studies. 

The results obtained from this experiment, 
then, indicate that by responding selectively to 
acertain class of verbal behavior, an interviewer 
Can influence the verbalizations of the subject. 
To be more specific, it was found that the 
Proportion of positive self-statements is 
icreased when they are reinforced and that 
the degree of the increase is inversely related 
to scores on Autonomy. It has not been demon- 
Strated, however, that verbal conditioning is 
What occurs in therapy, or that conditioning 
effects are stable over time, or that they are 
Correlated with changes in personality, or 
that they are related to extra-interview 
behavior. 

Until applied studies of the effects of verbal 
Conditioning are actually carried out, it cannot, 
of course, be more than conjectured that verbal 
Conditioning techniques bring about the 
therapeutic changes currently envisioned. 

hanges in verbal behavior may be necessary 
Soncomitants of psychotherapy, but are they 
either causal or sufficient? The results obtained 

om the present study in comparison with 
those obtained by Rogers argue against a 
Straightforward conditioning process. 


SUMMARY 


The purpose of this experiment was to 
explore the relationship between a personality 


E 
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variable—Autonomy—and verbal conditioning 
effects in a quasi-therapy situation when the 
interviewer employed simple reinforcing be- 
havior. A significant increase in the proportion 
of emitted positive self-statements was 
obtained when they were reinforced. The 
increase in the proportion of positive self- 
statements when they were reinforced was 
negatively related to scores on the Autonomy 
variable, but no similar changes in reinforced 
negative self-statements were found. Finally, 
it was suggested that the use of random rein- 
forcement constitutes a more meaningful 
control in verbal conditioning studies than 
does the use of no reinforcement. 
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discussion regarding the “minority group” 

status of old people in American society. 
Barron (1953), for example, has stressed some 
of the similarities between the position of the 
aged and of ethnic, racial, and religious minori- 
ties on the American scene. In particular, 
Barron refers to the stereotyping of the aged 
by other age groups and the discrimination 
against older people in employment and other 
areas of life. 

More recently, Drake (1958) has pointed 
to some of the major differences between the 
status of old people and that of the traditional 
minorities in the United States. These differ- 
ences stem from the fact that the aged do not 
constitute an independently functioning sub- 
group with a unique history, language, and 
culture. With these considerations in mind, 
Drake suggests that old people be designated a 
“quasi-minority.” 

Proceeding upon the assumption that old 
people in American society are devalued, 
Linden (1957a, 1957b) has outlined various 
cultural influences considered responsible for 
such devaluation. Illustrative of these influ- 
ences, according to Linden, are the diminishing 
acceptance of family responsibilities towards 
one’s elders, the decline in respect toward the 
aged as a consequence of loss of their position 
of authority, and the exaggerated premium 

* This study was supported by the National Institute 
of Mental Health, United States Public Health Service, 
under Research Grant M-1867, and was conducted 
under the auspices of the Age Center of New England, 
Inc. This work was done in part at the Littauer Statis- 
tical Laboratory, Harvard University, and in part at 
the MIT Computation Center, Cambridge, Massa- 
chusetts. A preliminary version of this paper was 


presented at the April 1959 meetings of the Eastern 
Psychological Association. 
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S. Couch, and Philip E. Slater for their assistance in 
machine processing of much of the data. The author 
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Florence C. Shelton, Judith W. Stephens, and Irving 
K. Zola for their assistance in the Preparation of the 
paper, and to A. Bertrand Warren and Robert Smart 
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T recent years there has been considerable 
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placed on the physical and psychological 
attributes of youth. 

Empirical support for a minority or quasi- 
minority view of old people is offered in a 
paper by Tuckman and Lorge (1953). They 
examined the incidence of yes and no re- 
sponses to a set of statements reflecting 
stereotypes and misconceptions about the 
aged in such general areas as physical change, 
family relationships, personality traits, etc. 
While this work is of interest as an initial 
empirical venture in the field, Tuckman and 
Lorge make no use of attitude scaling proce- 
dures and pay little attention to psychological 
correlates of attitudes toward old people. 

The present research is directly concerned 
with the development of a Likert scale to 
facilitate the study of attitudes toward old 
people with respect to both norms and indi- 
vidual differences. The search for correlates of 
these attitudes proceeded along several direc- 
tions. First, with a minority group model in 
mind, we asked whether attitudes toward old 
people would be related to authoritarianism, 
anomie, and ethnic prejudice. Second, we 
considered Barker’s (1948) discussion of the 
minority group aspects of physical disability 
in our society, and, accordingly, inquired 
whether attitudes toward old people would be 
associated with attitudes toward people distin- 
guished by various physical disabilities. Third, 
since mental deterioration is often alleged to be 
an important attribute of old age, the relation- 
ship between attitudes toward the mentally 
ill and toward old age was examined. Finally, 
we hypothesized that there would be significant 
relationships between attitudes toward ol 
people and such personality dimensions a5 
autonomy, achievement, nurturance, self- 
esteem, and misanthropy; subjects more 
favorably disposed toward old people were 

* A complete bibliography of this work through 1955 
is available in Shock (1957). In a more recent papel 
Tuckman and Lorge (1958) concern themselves we 
personal psychological and physiological symptoms aní 
their projection onto old people as stereotypes. 
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expected to exhibit stronger tendencies toward 
self-esteem and nurturance, whilesubjects more 
unfavorably disposed were expected to have 
stronger needs with respect to misanthropy, 
autonomy, and achievement. 


METHOD 
Subjects 


Students enrolled in introductory classes in psy- 
chology served as subjects. Two male samples (V = 128 
and 186) were obtained from Northeastern University. 
One sample (V = 168, 87 males and 81 females) was 
obtained from Boston University. 


Measuring Instruments 


A set of 17 items expressing negative sentiments 
about old people was constructed. A second set of 17 
items was then devised the content of which was the 
Teverse of the first set. There were, thus, 17 matched 
Positive-negative pairs, These are shown in Table 1. 

Some of the items were adapted from available 
ethnic minority items by the simple substitution of the 

old people” referent. Other statements derive from 
the author’s and others’ intuitions regarding stereotypes 
and feelings about old people in our society. Some a 
set Clustering of the items is possible on the basis of 

leit manifest content. Thus, Item Pairs 1, 5, and 12 
are all Concerned with the residential aspects of old 
People’s lives with special reference to segregation, 
‘Maintenance of home, and character of neighborhood, 
Shee Item Pairs 2 and 8 reflect the degree to 
Paca vague feelings of discomfort and tension are 
ae in the company of old people. The extent 
eee old people vary among one another is tapped 
Ae m Pairs 11 and 13. The nature of interpersonal 
is RRL ie age generations—conflicted or benign— 

eRe aa in Item Pairs 9, 10, and 16. The theme of 

aa ìs represented by Item Pairs 4 and 17. 
RN airs 3 and 6 refer to the cognitive style and 
PRA of old people. Qualities of old people with 
cite Tit. personal appearance and personality are 
Hak Rear Pairs 14 and 15. Finally, Item Pair 7 does 

ly cluster with any of the other items. 

Hy SS aig were interspersed among items from 
pow titude scales thereby partially disguising the 
vows of logical opposites among the “old people” 
study F and providing the necessary data for the 

a eee ates, Most of these additional items were 
Gone m the California F and anti-Negro scales 
Da Frenkel-Brunswik, Levinson, & Sanford, 

i Srole’s (1956) Antiminority and Anomie 


4 
i RU and NU-I samples, the following F 
25, 26 ae (Forms 45 and 40) were used: 1, 6, 9, 18, 21, 
ie ea 34, 37, 42. In addition, the following item 
ele tig 'uded: “The best teacher or boss is the one who 
about ane exactly what is to be done and how to go 
items with a the NU-II sample, we employed the five 
tale Of a highest loadings on a factorially pure 
1956) ee of authority” (Bales & Couch, 
aan ae Scale contains items Nos. 1, 21, and 27 from 
and 40 of the F Scale, item No. 3 from the 


scales,* Gilbert and Levinson’s (1956) scale of attitudes 
toward mental illness’ and the attitude-to-blindness 
scale of Cowen, Underberg, and Verrillo (1958).7 In 
addition, items measuring attitudes toward totally 
deaf and crippled persons were constructed, and, in 
some cases, adapted from the attitude-to-blindness 
scale cited above.® In the case of the items taken from 


Patriotism subscale of the E Scale, and the item “You 
have to respect authority and when you stop respecting 
authority, your situation isn’t worth much.” The follow- 
ing items were selected from the Negro subscale of the 
E Scale: 5, 8, 11, 14, 16, 22, 28, 31. 

5Srole’s Anomie Scale was expanded on the basis 
of prior factor analytic work (Zola & Scarr, 1958) to 
include the following items: “The way the world is 
today it is hard to find any meaning or purpose in life,” 
“Man is basically evil.” “In this world it’s dog eat 
dog.” “The forces operating in the world today are so 
powerful that no single person can do anything to 
help.” In addition, the item “Human nature being 
what it is, there will always be war and conflict” was 
added for the NU-II administration. This item, pur- 
portedly measuring authoritarianism in the F Scale, 
proved to be more highly correlated in the present case 
with anomie items, In general, the correlations in the 
present samples between the new items and the original 
Srole statements were as high as the intercorrelations 
among the latter. 

ê Seven items were selected from this scale on the 
basis of high DP values and applicability to college 
student samples. The following items were used: 5, 7, 
15, 17, 19, 25, and 31. 

7 The following 10 items were used: 1, 2, 8, 12, 13, 21, 
22, 23, 29, and 30. 

8 One-half of the items dealing with totally deaf and 
crippled persons were constructed by making the 
appropriate substitution of disability group in the 
Cowen, Underberg, and Verrillo (1958) items on blind- 
ness. The following items from Cowen et al. were con- 
verted into cripple items: 3, 10, 14, 17, 19. The following 
items from Cowen et al. were converted into deafness 
items: 4, 5, 9, 18, 26. The remaining half of the deafness 
and cripple items were constructed by the author on 
the basis of various stereotypes commonly attributed 
to deaf and crippled persons as discussed by Barker, 
Wright, Meyerson, and Gonick (1953). The deafness 
items read as follows: “The peculiar facial expressions 
of the totally deaf suggest that there is something 
very queer about them.” “I think I would find it 
intolerable to have to spend a considerable period of 
time with a totally deaf person.” “Most totally deaf 

rsons can be expected to have a rather sour disposi- 
tion.” “The best way to stamp out deafness and other 
disabilities is to prevent marriages of people with such 
disabilities.” “Totally deaf persons would probably be 
best off living among themselves away from the world 
of the hearing.” yaaa 

The cripple items read as follows: “Tt is very uncom- 
fortable to be with a crippled person because most of 
them have a chip on their shoulder.” “It would be best 


if crippled le were supported by the government 
Praet find jobs for them.” “Whenever I 


trying to tay i 
rather (han i a I try my best to avoid him.” “It 


ippled person, 4 
eas best if crippled children went to special 


MEANS, STANDARD Deviations, ITEM-SUM CORRELATIONS 


NATHAN KOGAN 


TABLE 1 


(rg), AND Marcuen Irem Pare CORRELATIONS (ryp) FOR 
“Orp Propre” (OP) Ireus 


Item 


1 N® It would probably be better if most old 
people lived in residential units with people 
of their own age. 

1 P* It would probably be better if most old 
people lived in residential units that also 
housed younger people. 

2N There is something different about most old 
people: it’s hard to figure out what makes 
them tick. 

2P Most old people are really no different from 
anybody else: they’re as easy to under- 
stand as younger people. 

3 N Mostold people get set in their ways and are 
unable to change. 

3 P Most old people are capable of new adjusi 
ments when the situation demands it. 

4N Most old people would prefer to quit work as 
soon as pensions or their children can sup- 
port them. 

4 P Most old people would prefer to continue 
working just as long as they possibly can 
rather than be dependent on anybody. 

SN Most old people tend to let their homes be- 
come shabby and unattractive. 

5 P Most old people can generally be counted on 
to maintain a clean, attractive home. 

6N Itis foolish to claim that wisdom comes with 
old age. 

6P People grow wiser with the coming of old age. 

7 N Old people have too much power in business 
and politics, 

7 P? Old people should have more power in busi- 
ness and politics. 

8 N Most old people make one feel ill at ease. 

8 P Most old people are very relaxing to be with. 

9N Most old people bore others by their insist- 
ence on talking about the “good old days.” 

9 PÈ One of themost interesting and entertaining 
qualities of most old people is their ac- 
counts of their past experiences. 

10N Most old people spend too much time prying 
into the affairs of others and giving un- 
sought advice, 

10 PP Most old people tend to keep to themselves 
and give advice only when asked. 

11N Tf old people expect to be liked, their first 
step is to try to get rid of their irritating 
faults. 

11 P When you think about it, old people have 
the same faults as anybody else. 

12N In order to maintain a nice residential 
neighborhood, it would be best if too many 

old people did not live in it. 

12 P You can count on finding a nice residential 
neighborhood when there is a sizeable 
number of old people living in it. 

13 N There are a few exceptions, but in general 
most old people are pretty much alike, 

13 P It is evident that most old people are very 
different from one another, 

14N Most old people should be more concerned 
with their personal appearance; they’re 
too untidy. 
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TABLE 1—(Continued) 
BU Sample (W = 168) NU-I Sample (V = 128) | NU-II Sample (N = 186) 
Item 
M° | SD) rig | typ | Æ | SD |r | typ | M | SD | ng | tee 
14 P Most old people seem to be quite clean and | 3.54 | 1.60) .57 | .48 | 3.27) 1.48| .56 | .44 | 3.32] 1.49) .59 | .6t 
neat in their personal appearance. 
15 N Most old people are irritable, grouchy, and | 2.34 94) .54 2.47 | 1.22] .46 2,69 | 1.45 | .55 
unpleasant. 
15 P Most old people are cheerful, agreeable, and | 3.62 | 1.40 | .66 | .27 | 3.35] 1.36 | .64 | .25 | 3.47 | 1.51 | .61 | .40 
good humored. 
16 N Most old people are constantly complaining | 4.89 | 1.62} .50 5.18 | 1.67 | .52 5.49 | 1.46 | „54 
about the behavior of the younger genera- 
tion, 
16 P One seldom hears old people complaining | 5.47 | 1.13 | .49 | .41 | 5.59] 1.19] .33 | .50 | 5.75 | 1.08] .25 | .34 
about the behavior of the younger genera- 
tion, 
17 N  Mostold people make excessive demands for | 3.96 | 1.63] .46 3.96 | 1.68 | .50 3.92 | 1.54] .52 
love and reassurance, 
17 P Most old people need no more love and reas- | 4.54 | 1.55| .39 | .19 | 4.41] 1.79] .27 | .27 | 4.46] 1.53] .43 | 11 
surance than anyone else. 
Total negative scale (OP—) 54.87. {11,04 56.84 |11.00 54.17 |12.28 
3.234 3.35 3.40 
Total positive scale (OP-+) 64.14 |10.90 62.13 | 9.70 60.42 |10.47 
3.77 3.65 3.77 


pean NDP ES ey ire LAL BE EM) eve ee oe 
Note.—The magnitude of r’s required for statistical significance is as follows: for N = 168, .15 at the .05 level and .20 at the .01 
level; for N = 128, .17 at the .05 level and .23 at the .01 level; for N = 186, .14 at the .05 level and .19 at the .0L level. 
® Items are listed in pairs, N representing the negatively worded form and P the positively worded form. 
> Revised form of the item used with NU-II sample. The revised items read as follows: 


7 P Old people have too little power in business and politics. 
9 P One of the more interesting qualities of most old people is their accounts of their past experiences. 
10 P Most old people respect others’ privacy and give advice only when asked. 
© Negative and positive means were made comparable by subtracting the positive means from 8.00. By this step, higher mean values 
reflect more unfavorable attitudes for both positive and negative items. 


d Per item means. 


she F Scale and the Gilbert and Levinson mental illness 
cale, a set of matched items of opposite sign was added 
ʻo the final battery.’ Six response categories were pro- 


schools, rather than have them try to keep up with 
normal children.” “I cannot imagine myself having a 
date with a crippled person.” 

° The F items used with the BU and NU-I samples 
were selected from Chapman and Campbell (1957) — 
reversals of Items 6, 9, 18, 21, 27, 34, and 44 from Form 
40 and Item 44 from Form 60 of the original F Scale— 
and from Jackson and Messick (1957)—Items 1, 15, 16, 
24, and 28 of their reversed F Scale. For the five-item 
factorially derived F Scale used with the NU-IL sample, 
the three F items in italics above were used. Reversed 
wording for the remaining two items was “The only way 
to have progress in this world is to reject authority when 
we disagree with it, and to go off on one’s own track,” 
and “There are qualities more important than patri- 
otism and loyalty in deciding what makes a good citi- 
zen.” 

The reversed (positively worded) MI items read 
as follows: “Most people who develop mental disturb- 
ances will get over them if treated with kindness, 
understanding, and sympathy.” “The behavior of 
most mental patients doesn’t seem so odd once you 
understand their childhood and family backgrounds.” 
“A religious person is just as likely to become mentally 
ill as one who is not religious.” “Mental patients deserve 


vided for all of the items: “strongly disagree,” “dis- 
agree,” “slightly disagree,” “slightly agree,” “agree,” 
and “strongly agree.” These categories were scored 1, 
2, 3, 5, 6, and 7, respectively, with a score of 4 assigned 
in the rare case of failure to respond to an item. 

All three samples received the F Scale items, two 
samples received the Anomie scale, and one sample re- 
ceived all of the remaining scales. Two of the three 
samples, in addition, were given a set of 30 true-false 
personality items taken largely from Murray's Explora- 
tions in Personality (1938) volume, and the Brodbeck 
and Perlmutter (1954) self-dislike scale.1° 
eee nnn 
the same kind of consideration that is given to patients 
with physical ailments.” “The large majority of dis- 
charged mental patients can be expected to make a 
satisfactory adjustment in their community.” One ad- 
ditional MI-+ item (No. 7) was selected from Gilbert and 
Levinson (1956), and in addition, the following MI— 
item was added: “Once a mental case, always a mental 
10 The following 20 questionnaire items were selected 
from the Murray volume: Achievement items 2 10; 
Autonomy items 2, 6; Narcism items 15, 16; Rejection 
items 1, 3, 4, 10; Nurturance items 3, 5, 6, 7, 9, 11, 13, 
14, 16, 20. The 10 items chosen from the Brodbeck and 
Perlmutter scale were as follows: 2, 5, 9, 10, 11, 14, 16, 


20, 23, 26. 
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RESULTS 
Old People Scales 


Table 1 presents the basic data for “old 
people” items and scales (hereafter referred to 
as OP items and scales). Consider first the 
scale means shown at the bottom of Table 1. 
A high degree of consistency among the sample 
means is evident. In all three samples, for 
example, the negatively worded statements 
elicit more favorable sentiments. In other 
words, subjects disagree more with statements 
commenting adversely on old people than they 
agree with statements praising old people. A 
test for the statistical significance of these 
overall mean differences yields ?’s significant 
beyond the .01 level for each of the samples. 
Note further that the per item means are 
uniformly smaller than 4.00—the hypotheti- 
cal indifference point. These college subjects, 
then, tend, in general, to be more favorable 
than unfavorable in their attitudes toward old 
people as manifested by responses to the OP 
items of the present battery. 

For the BU sample, it was possible to 
examine sex differences in OP scale means. For 
both the positive and negative OP scales, no 
significant differences obtained. This does not 
imply, of course, that there are no sex differ- 
ences for specific OP items. 

Reliability coefficients and interscale correla- 
tions are listed in Table 2. The former (in the 
diagonals of the table) range from .66 to .85. 
There is a trend toward greater reliability for 
the OP— as opposed to the OP+ scale. The 
magnitude of the OP— reliability coefficient for 
the NU-II sample suggests that few additional 
item modifications are required. The OP-+ 
scale, on the other hand, falls somewhat short 
of an acceptable reliability level, despite item 
revisions for the NU-II sample. 

Consider next the interscale correlations in 


TABLE 2 
Opp-Even RELIABILITY (COEFFICIENTS (Spearman- 


Brown) AND CORRELATIO: 
OP Sates NS FOR THE 
BU Sampl S 
N = 108) Sma | R Sale 
Scale | OP— | OP+ | op— | op+ | op- | op+ 
OPp— -76 ) .51 | .73 | .52 | .83 
OP+ 7 -66 % 


NATHAN KOGAN 


Table 2. In the construction of matched pairs, 
we attempted to build genuine logical opposites 
in terms of connotative meaning rather than 
retain virtually identical wording in both 
members of the item pair. This procedure was 
followed in order to disguise the presence of 
logical opposites among the items. We wanted 
to discourage consistency of response based on 
sheer recognition of almost identically worded 
opposites. 

There is, however, a price paid for such 
freedom in item wording when subjects endorse 
or reject two items of presumably opposite 
meaning. One might account for such findings 
in terms of response set, ambivalent feelings 
regarding the object of the attitude, or poorly 
constructed opposites; or, more likely, a com- 
bination of all of these might be involved. This 
problem has been cogently discussed in the 
case of F Scale items by Christie, Havel, and 
Seidenberg (1958). 

In the present case, product-moment coeffi- 
cients between positively and negatively 
worded scales are uniformly positive. As Table 
2 shows, r’s between OP+ and OP— range 
from .46 to .52 in the three samples, all signifi- 
cant beyond the .01 level. Given correlations 
that can theoretically range from +1.00 
(perfect logical consistency) to — 1.00 (perfect 
logical inconsistency), content must necessarily 
be a more powerful influence than response 
set for the OP items of the present battery- 


Old People Items 


Substantial variations in mean levels across 
OP items may be noted. It is especially note- 
worthy that the Item Pair 16 with the highest 
means—i.e., most unfavorable toward ol 
people—concerns old people’s feelings about 
the younger generation. No single item pair 
seems to stand out at the favorable end of the 
attitude continuum. Rather, there are 4 
number of cases where one or both members of 
an item pair yield means less than 3.00. 

Individual items range widely in the extent 
of their correlation (r;;) with positive and 
negative sum scores. It will be noted that 
Item Pair 6 yields very low rys values for both 
the BU and NU-I samples and was, accord- 
ingly, not used with the NU-II sample. Appat- 
ently, the association between wisdom and ol! 
age tapped by Item Pair 6 is off the favor- 
ability-unfavorability continuum represented 
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by the other “old people” items. Perhaps, the 
aphoristic quality of that item pair militates 
against adequate discriminability. 

Item Pair 4 also yields rather low item-sum 
correlation coefficients. Modification of this 
item pair is recommended in future work with 
the OP scales. A suggested modification would 
eliminate the combination of support by 
children and by pension in Item 4N. There 
is an obvious value distinction between support 
by one’s children and support by a pension 
plan to which one has financially contributed." 

An additional consideration in the revision of 
items is the size of ryp—the correlation between 
the matched positive and negative items. 
There is considerable variation among specific 
OP item pairs in the extent to which the 
subjects respond to the positive and negative 
member of the pair in a consistent fashion. 
Again we may note that such variation could 
well reflect success in constructing logically 
opposite item pairs. So, for example, Item 
Pair 7 does not meet specifications of logical 
oppositeness, for the negative member of the 
pair is a descriptive statement while the 
Positive member is a statement advocating 
change. It is hardly surprising under these 
circumstances that ryp values in the BU and 
NU-I samples are nonsignificant. However, 
when Item 7P was revised for the NU-II sam- 
ple (cf. Footnote c, Table 1), the rxp value 
Increased, but at the unfortunate price of a 
sharp drop in the item-sum correlation (rs). 
Clearly, further modifications would seem to 
be in order for Item Pair 7. 

Revision of Item 9P was also deemed advis- 
able in view of fairly low ryp (as well as low 
rs) values. We suspected that the reference to 
„entertaining qualities” in the item could be 
interpreted by subjects in a satirical vein and, 
accordingly, dropped the presumably offending 
Words in the revised version used with the 
NU-I sample. A slight increase in rs and a 
Substantial increase in ryp resulted. It should 

e noted, however, that rrs for Item 9P is 
considerably smaller than the corresponding 
value for its matched negative. 

? Finally, Item 10P was revised in an effort 
® make that item more positive in its con- 
ee In its original form, the item suggests 

ât old people are inclined to avoid inter- 


Co g êm grateful to Willis Saulnier of Raytheon Mig. 
©. for this suggestion, 


personal relations with others. While the 
revised wording no longer carries this connota- 
tion, no appreciable increase in ry and ryp 
was observed in the NU-II sample. 

In sum, it should be emphasized that the 
variability in ryp coefficients for specific item 
pairs is entirely within the positive band of the 
correlational spectrum. Further, of the 50 
fye values reported in Table 1, 43 are statisti- 
cally significant at the .05 level or better in the 
positive, i.e., logically consistent, direction. 
These results further emphasize that the OP 
items tap something more than stylistic 
response tendencies. 

Relations with Authoritarianism 

We turn next to the various correlates of 
attitudes toward old people. It is in the exami- 
nation of correlates that the advantage of 
using both positively and negatively worded 
items is especially evident. Several statistically 
significant relationships obtained between 
attitudes toward old people and other attitu- 
dinal variables based on items worded in the 
same direction might well be spurious relation- 
ships attributable largely to response set 
tendencies. This phenomenon is best illustrated 
by the results shown in Table 3. 

In each of the samples, statistically signifi- 
cant correlations ranging from .21 to .28 are 
obtained between F Scale scores for items 
worded in the authoritarian direction and 
scores based on negatively worded OP items. 
It should be noted that these results are 


TABLE 3 
Product-Moment r’s) or “OLD 
MNT r e AUTHORITARIANISM 
AND ANOMIE SCALES 


“Old People” Scales 
Scale Sample 

OP+ OP- 
i iani: Es 28** 
Authoritarianism BU -12 A 
nE | NEE |B A 
itariani 04 10 

Authoritarianism BU $, 
oppos (=t | Nair | 06 | cae 
Ld ,39** 
mwe N e BS 


a Scored in the content direction consistent with F+, ies 
higher scores reflect more authoritarianism. 
* Significant at the .05 level. 
** Significant at the .01 level. 
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consistent for both a conventional type of F 
Scale (administered to the BU and NU-I 
samples) and a “pure” factorially derived scale 
of “acceptance of authority” (administered to 
the NU-II sample). The correlations suggest 
that more authoritarian persons are more 
unfavorably disposed toward old people. When, 
however, these authoritarianism scores are 
related to scores based on positively worded 
“old people” items, the correlation coefficients 
are in the negative direction, i.e., more author- 
itarian individuals are more favorably disposed 
toward old people. In the latter case, however, 
only the r for the NU-II sample is significant. 

For items worded in the nonauthoritarian 
direction, the r’s tend toward low nonsignifi- 
cant values, The significant negative correlation 
between F— and OP— for the NU-II sam- 
ple is in the opposite direction and of approxi- 
mately the same magnitude as the r between 
F+ and OP— for that sample. There seems 
little doubt, then, that response set effects 
partially account for relations between F and 
OP. Even if F+ items constitute a more 
valid measure of authoritarianism than do F— 
items (Gage, Leavitt, & Stone, 1957), the 
fact remains that F+ correlates in opposite 
directions with positively and negatively 
worded OP items. It seems doubtful that the 
slightly lower reliability of the positive OP 
scale could account for the extent of this 
inconsistency. 

Correlations between F+ and F— were 
42, .27, and .26 for the BU, NU-I, and NU-IL 
samples, respectively. While these r’s—all 
significant beyond the .01 level in the content 
direction—suggest a reasonable degree of 
success in the selection of reversed F items, 
reliabilities for F— (.27, .10, .46) are consider- 
ably lower than for F+ (.62, .73, .60) in the 
BU, NU-I, and NU-II samples, respectively. 
Relations with Anomie 

Consider next the Anomie (A) scale. As 
defined operationally by Srole’s (1956) items, 
the anomie variable measures such charac- 
teristics as the subjects’ pessimism about the 
future, helplessness in the face of powerful 
social forces, and inability to find meaning or 
purpose in life. Here we did not attempt the 
construction of items of opposite sign. The 
influence of response set effects may be gauged, 
however, in terms of the direction and mag- 
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nitude of the correlations between A and the 
two OP scales. As Table 3 shows, the r’s are 
considerably higher with the negatively 
worded OP scale than with the positively 
worded version in the two samples tested, 
However, while the r’s drop in the latter case, 
they remain statistically significant in a 
logically consistent direction. In sum, subjects 
unfavorably disposed toward old people are 
more disposed toward anomie. 

Is anomie also related to attitudes toward 
the other minority groups represented in the 
present study? Correlations between anomie, 
on the one hand, and scales for attitudes 
toward ethnic and physically disabled minor- 
ities, on the other, range from .28 to .44 in 
the NU-II sample, all significant beyond the 
.01 level. Unfortunately, these r’s are based 
on items worded in but one direction, and, 
hence, are susceptible to the usual effects of 
response set. Indeed, when anomie is cor- 
related against the two “mental illness” scales, 
the r of 40 between A and MI— is highly 
significant, while r between MI+ and A is a 
mere 04. 


Relations with Antiminority Attitudes 
Correlations between the OP and the various 

antiminority scales are listed in Table 4. Since 

these results are based on a single sample, 


conclusions drawn from them must necessarily 
be considered tentative. This is particularly 


TABLE 4 
InTERCORRELATIONS (Product-Moment 1’s) or “OLD 
PEOPLE” SCALES WITH ANTIMINORITY AND 
Disasrity SCALES 


(NU-II Sample) 

Scale op+ OP- 
E AE Ein E aa Wei E 
Mental illness positive (MI-++)* 332. 108 
Mental illness EETA MMT) 11-46 
Antinicerity (AN) ar ER 

timinority i 

Blindness (B) 28 52 
Deafness (Dx)? 21 «48 
Deafness y)? .33 -50 
Cripple {cs 23 53 
Cripple (Cy)° <21 .42 


Note.—For N = 186, r’s of .14 and .19 are significant at the .05 
and .01 levels, respectively. 

® Scored in the content direction consistent with MI—, i.e» 
higher scores reflect more negative attitudes toward the mentally 
il. 


> Adapted from Cowen et al. (1958) blindness items by 2pp10- 
priate substitution of disability group. 
© Items constructed by the author (cf. Footnote 7 to text) 
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the case for the various scales in Table 4 
original to the present investigation. Here, we 
simply are dealing with sum scores for items 
grouped together on the basis of manifest 
content. The reliabilities of these newly con- 
structed ‘‘scales” are surprisingly high, 
however, given the small number of items they 
contain. Thus, the Spearman-Brown co- 
efficients for the five-item “deafness” and 
“cripple” scales range from .51 to .60. 

Inspection of Table 4 reveals that the large 
majority of 7’s are significant beyond the 01 
level. For the “mental illness” scales, highly 
significant positive correlations are found 
between items worded in the same direction 
on the two scales, but r’s drop to nonsignificant 
levels when obtained between scales worded 
in opposite directions. Table 4 further shows 
that Srole’s Antiminority scale, the California 
anti-Negro scale, and the scales comprising 
the various physical disability groupings are 
all significantly related to the OP scales when 
response set effects are held constant. While 
such effects may exert an influence, suggested 
by the larger 7’s in the OP negative as opposed 
to the OP positive column, item content would 
seem to be of major importance in the present 
case. In sum, subjects negatively disposed 
toward old people tend to hold unfavorable 
attitudes toward ethnic, physical disability, 
and (possibly) mental illness minorities. 


Normative Data for Correlates of the OP 
Scales? 


The F+ scale means and SDs for the BU 
and NU-I samples are very similar in 
magnitude to norms listed in The Authoritarian 
Personality for college student samples. The 
five-iten F Scale used with the NU-I 
sample seems to elicit stronger authoritarian 
sentiments. It will be recalled that these items 
dealt exclusively with “acceptance of author- 
ity,” and were very possibly less extreme in 
connotation than many of the F Scale items 
used with the first two samples. 

1 A one-page table giving the means, standard devia- 
tions, and reliability coefficients for correlates of the 
OP scales has been deposited with the American Docu- 
mentation Institute. Order Document No. 6691 from 
ADI Auxiliary Publications Project, Photoduplication 
Service, Library of Congress; Washington 25, D. C., 
Temitting in advance $1.25 for microfilm or $1.25 for 
Photocopies. Make checks payable to: Chief, Photo- 
duplication Service, Library of Congress. 


Anomie scores in the present samples are 
of lower magnitude than authoritarianism 
scores. Unfortunately, norms for college sub- 
Jects are not available for the Anomie scale. 
The various antiminority scales yield con- 
sistently low means. In general, subjects are 
more likely to hold favorable than unfavorable 
sentiments about the various “minorities” 
sampled by the items of the present study. 
Tt should be noted, in addition, that the means 
and SDs obtained for the AN and B scales 
correspond quite closely to those reported by 
Adorno et al. (1950) and Cowen et al. (1958), 
respectively. All of the “minority” scale per 
item means are lower than the per item means 
for the negative OP scale shown in Table 1. 
Comparison with the positive OP scale is not 
warranted in view of the uniformly negative 
direction of the antiminority items. 

For the particular items employed in the 
present study, then, subjects more readily ex- 
press negative sentiments toward old people 
than toward members of ethnic, mentally ill, 
and physically disabled groups. This finding 
must, of course, be considered highly tentative 
since items across minority clusters vary in con- 
tent and wording over and above the difference 
in minority group referent. Further work along 
these lines might well take the form of devel- 
oping a pool of items whose content would be 
applicable to a wide array of minorities. 


Personality Correlates of the OP Scales 


It will be recalled that certain explicit 
predictions were made regarding a number of 
personality dimensions. A factor analysis of 
the 30-item true-false personality inventory 
was carried out, combining the BU and NU-I 
samples and including sex of the subject as 
a variable. Seven factors were extracted by the 
Principal Axes method and these were then 
rotated to a quartimax solution (Neuhaus & 
Wrigley, 1954). The first three factors are 
relatively clear. Factor I has highest loadings 
on those items involving nurturant concern 
for others. Factor II yields highest loadings on 
items expressing feelings of self-doubt in 
relations with other people. Factor III appears 
to tap rejective misanthropic sentiments. The 
remaining factors account for relatively small 

A three-page table listing factor loadings (quarti- 
max) for the een eats items has been de- 
Posited with ADI (see Footnote 12). 
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portions of the overall variance. Sex of the 
subject yields the highest loading on Factor 
IV. Loadings of the other items on this factor 
are quite low. Factor V involves sentiments of 
yielding to or resisting others’ judgments or 
wishes. Factor VI shows highest loadings on 
items expressing doubt about personal achieve- 
ment. Factor VII accounts for a minute 
portion of the variance, and, hence, was not 
considered further. 

Factor scores for individuals were derived 
from the factor loadings and correlated with 
OP scale scores (Table 5). In general the 
results are in the predicted direction. Factor 
I—essentially a nurturance factor—is signifi- 
cantly correlated with OP scale scores. The 
more favorably subjects are disposed toward 
old people, the greater their score on Factor 
I in the nurturant direction. Rejective, mis- 
anthropic sentiments tapped by Factor III 
correlate significantly in the predicted direc- 
tion for only the positive OP scale, suggesting 
that the observed relation is partially attribut- 
able to response set effects. It should be noted 
that the factors containing the autonomy and 
achievement items did not correlate with 
attitudes toward old people, though this may 
be a reflection of the small number of items 
representing those dimensions in the inventory. 
Finally, there seems to be no simple relation 
between self-esteem tendencies and feelings 
about old people. 


DISCUSSION AND CONCLUSIONS 


That attitudes toward old people are scal- 
able is strongly indicated by the present data. 
The use of items worded in opposite directions 
has permitted us to examine the extent to 
which response set effects permeate the various 
scales. Correlations between the matched 


TABLE 5 


Corretations (Product-Moment r’s) BETWEEN OP 
S AND Factor SCORES DERIVED FROM 


PERSONALITY RY 
(N = 296) 
Scale g IL m> Iv v VI 
OP-+ | .14* -06 | .18** |—.07 | .03 |—.09 
OP— | .21** | —.04 | .07 .03 | .10 | .05 


® High scores are in the nonnurturant direction. 
P High scores are in the misanthropic direction. 
* Significant at the .05 level. 
** Significant at the .01 level. 


positive and negative “old people” scales ar 
in the content direction, but of moderate size 
Compared to the present attitude domain, 
undergraduate subjects seem to show greater 
logical consistency in responding to ethnic 
minority items (Prentice, 1956), but greater 
inconsistency in regard to authoritarianism 
(Christie, Havel, & Seidenberg, 1958). 
Consider next the relationships observed 
between attitudes toward old people and othe 
attitudinal variables. Of particular interest is 
the fact that attitudes toward old people ar 
related to feelings of anomie and apparently un: 
related to authoritarian tendencies. This resul 
is in marked contrast to the finding thal 
prejudice toward ethnic minorities is associated 
with both authoritarianism and anomie (Rob 
erts & Rokeach, 1956; Srole, 1956). Hence, 
while “old people” may constitute a minority, 
such status does not necessarily render then 
equivalent to ethnic minorities in their rela 
tionship to authoritarianism. This is hardly 
surprising when one considers the unique fea- 
tures of old people as attitude objects. Old 
people cannot be categorized in strictly out 
group terms, for most individuals will have 
old people as members of their families. In 
addition, old people constitute a minority o 
which most persons will eventually belong: 
Given these conditions, one might expect 4 
slight positive association between authori= 
tarianism and favorable attitudes toward old 
people. Note, however, that old people do not 


and, indeed, are often weak, exposed, and 
highly vulnerable. Given the alleged connec 
tion between authoritarianism and contemp 
for weakness, it would follow that the mor 
authoritarian individuals should be mo 
negatively disposed toward old people. Since 
no consistent significant relationship was foung 
between the two variables in question, counter- 
vailing forces of the kind described may be 
responsible. 

How can we account for the significant 
relationship between feelings of anomie and 
attitudes toward old people? Certainly, agree- 
ment with anomie items implies that the 
subject has a precarious and threat-oriente 
view of life. Old age, in the eyes of such” 
subjects, might well represent that period of 
life when their pessimistic predictions have 
been confirmed, i.e., old people have no future, 
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are helpless, and awaiting death without life’s 
meaning made clear; and, indeed, the views 
of these subjects have some basis in reality 
for many old people in American society. It 
seems reasonable, then, that feelings of anomie 
and negative attitudes toward old people would 
be related, for the more anomic subjects would 
be likely to perceive the aged as symbolic of 
that period of life when an individual is least 
able to cope with a hazardous environment. 

It will be recalled that the relation between 
attitudes toward old people and toward men- 
tally ill persons was not entirely free of re- 
sponse set effects. The consistent significant 
relationships found between attitudes toward 
old people and toward physically disabled 
groups, however, suggest that subjects are 
more likely to associate aging with a state 
of physical disability as opposed to one of 
mental illness. It should be noted, however, 
that the Gilbert and Levinson mental illness 
scale does not contain any items referring 
specifically to the mental deterioration asso- 
ciated with senility. Subjects may well con- 
sider the latter to have a physical locus, and, 
hence, to be distinguishable from mental 
illness conceived more generally. Finally, 
subjects who are prejudiced against ethnic 
minorities tend to be negatively disposed 
toward old people, though, as we have seen, 
the two sets of attitudes do not seem to have 
a common base in authoritarian tendencies. 

In sum, the present data suggest that there 
is a general trend for subjects to be positively 
or negatively disposed toward a wide variety 
of groups deviating in some respect from a 
hypothetical norm of similarity to self. Hence, 
for negatively disposed subjects, members of 
ethnic minorities, physically disabled persons, 
old people, and possibly mentally ill people 
are groups to be avoided or rejected. While 
parts of this broad attitude complex seem to 
relate directly to a concept of the authoritarian 
personality, other parts do not. There is some 
indication in the present data that Srole’s 
concept of anomie may be most relevant to 
the formation of a wide range of attitudes, 
though a definitive statement along these 
lines must necessarily await the development 
and application of reversed scales for the 
many variables in question. 

Much of the discussion thus far has been 
couched in terms of the negative end of the 


attitude continuum, The observed relation 
between positive attitudes toward old people 
and nurturant personality dispositions helps 
to clarify the opposite end of the attitude di- 
mension under consideration. If persons with 
strong nurturance needs are attracted to those 
in need of succorance, old people may be highly 
appropriate objects in this regard. For there is 
considerable evidence that dependency is one 
of the more salient qualities attributed to old 
people by the younger generation (Golde & 
Kogan, 1959). While such dependency would 
arouse positive sentiments in nurturant per- 
sons, the dependent state of the older person is 
likely to arouse ambivalence or conscious 
hostility in those less inclined in the nurturant 
direction. Unfortunately, we do not know 
whether nurturant needs play a part in positive 
orientations toward other kinds of minorities. 
It is conceivable, however, that the extent of 
the relationship would be a function of the 
prominence of dependency as a characteristic 
of the minority group in question. 


SUMMARY 


The present paper reports on the develop- 
ment of a Likert scale for assessing attitudes 
toward old people and discusses empirical 
relations found between such attitudes and 
other attitudinal and personality variables. 
Three samples of college undergraduates 
served as subjects. 

“Old people” items were constructed in the 
form of positive-negative pairs, yielding two 
“old people” (OP) scales: a scale containing 
items making unfavorable reference to old 
people and a scale containing matched favor- 
ably worded items. Item and scale means 
and standard deviations are reported, as are 
item-sum correlations and scale reliabilities. 
The use of matched positive-negative item 
pairs permitted a test of the extent to which 
response set effects permeate the scales, Cor- 
relations between positive and negative scales 
were significant in the direction of logical 
consistency of response. In addition, all cor- 
relations between the positive and negative 
members of specific item pairs were in the 
logically consistent direction, the large major- 
ity significantly so. 

A number of significant relationships ob- 
served between attitudes toward old people 
and other attitudinal and personality variables 
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proved to be partially attributable to response 
set effects. We were forced to conclude on this 
basis that no clear consistent relation obtained 
between authoritarianism as measured by the 
F Scale and attitudes toward old people. On 
the other hand, unfavorable attitudes toward 
old people were associated with feelings of 
anomie, and with negative dispositions toward 
ethnic minorities and a variety of physically 
disabled groups. A nurturance factor derived 
from a brief personality inventory given to 
the subjects was significantly correlated with 
OP scale scores, the more nurturant subjects 
being more positively disposed toward old 
people. Theoretical interpretations of observed 
associations between the OP scales and other 
measures are advanced in the body of the 
paper. 
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of medical symptoms, as reported by the 

patient, are related to patient charac- 
teristics other than simply his diagnosed 
medical, surgical, neurological, etc. disease or 
diseases. Thus, in developing the Cornell 
Medical Index (CMI) (Brodman, Erdmann, 
Lorge, & Wolff, 1949, 1953), its authors found 
that older people check more physical symp- 
toms than do younger people, and that women 
check more than men. An observation similar 
to the latter has been made independently 
(Boake, 1958) in a study of “common re- 
spiratory diseases.” Boake found that the 
females in his groups actually had slightly 
more illnesses on the average than the males, 
and also that the females had, on the average, 
slightly more symptoms per illness than the 
males. He did not, however, present the statis- 
tical significance of these differences, 

The authors of the CMI also suggest that 
when a large number of positive replies is made 
on their Health Questionnaire (including both 
medical and psychiatric symptoms), psychia- 
tric disturbance is indicated, as is the case 
when certain replies fall specifically in the 
‘fatigability” and “frequency of illness” 
categories. Thus, they found that emotional 
state or adjustment pattern are other patient 
characteristics which are associated with 
medical symptomatology (Brodman, Erd- 
mann, Lorge, Gershenson, & Wolff, 1952). 

Hinkle and Wolff (1957) have studied 
further the hypothesis that medical and psy- 
chiatric symptoms tend to occur together, 
They state: “... ‘Illness’ is a state of the 
organism and... when a human moves from 
a state of ‘health’ into a state of ‘illness,’ 
the ‘illness’ is likely to be manifested by a 
variety of syndromes appearing concurrently 
or consecutively, their nature being dependent 
upon the various factors acting upon the 
organism at that time.” In studying long-term 
health records they found that: 

1. Individuals having the greatest number 
of illnesses experienced a wide variety of them 
(both major and minor) in a number of body 


I has been observed that number and kind 


to the physical or emotio 
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systems. There was a relationship between 
number of major illnesses and number of 
minor illnesses experienced by an individual, 
and between number of illnesses and number 
of body systems involved. : 

2. Both in those individuals having many 
illnesses and those having few, illnesses were 
not scattered randomly through their lives, 
but usually appeared in clusters. 

3. In general, all illnesses were likely to 
occur in periods when the individual was hav- 
ing difficulty in adapting to the total configura- 
tion of his life. 

4. A large percentage of those with high 
susceptibility to illness had had a difficult 
early life, although this relationship was by 
no means absolute. 

While clusters of illness often could be 
related to adverse emotional circumstances, 
there were also periods of emotional stress 
which apparently were not associated with 
any significant somatic disturbances, No 
Statistical presentation of their data was 
made, however. j 

On the basis of all the above findings 
(Boake, 1958; Brodman et al, 1952, 1953; 
Hinkle & Wolff, 1957), then, one might expect 
to find that: 

1. Psychiatric patients, presumably under 
more intense, longer-lasting, and more faapa: 
stress, in general, than medical patients, woul 
have a greater number of bodily symptoms 
than would medical patients. | 

2. The number of psychiatric symptoms 
(degree of emotional distress) on various tests 
would be highly correlated with number if 
“physical” symptoms in both medical an 


Psychiatric groups, or that total number 
Physical symptoms is nearly as good a measure 
of psychiatric status as is number of psychia 
tric symptoms (although no assumption, i 
made as to which may be precursor © 
other), 
b 3. The total number of symptoms should 
€ Positively correlated with age: a 
En i men should rea 
Within both groups, yon a A 
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greater total number of symptoms than the 
men. 

Each of these hypotheses is either an in- 
dependent check on the actual findings or an 
implication derived from the results of the 
investigators cited. 


METHOD 


The 162 subjects used here consisted of 40 medical 
inpatients, 42 medical outpatients, 40 psychiatric in- 
patients, and 40 psychiatric outpatients (all white) who 
were under treatment at the University of Oregon 
Medical School Hospitals and Clinics. They were taken 
at random from among those coming through the 
services at the time the study was being carried out, 
provided they were willing to fill out the forms, had 
the education and intelligence to do so, and, for the 
medical inpatient group, were not too physically 
incapacitated. 

The patients were requested to fill out the test forms 
used in this research “in order to provide background 
information on their cases for the doctors.” Patients 
were asked to complete the CMI (long form), the 
Taylor Manifest Anxiety scale, and the Saslow Psycho- 
somatic Inventory, The CMI is a 195-item question- 
naire designed to obtain a standardized medical history 
(Brodman et al., 1949). It covers, separately, the differ- 
ent body systems; e.g., eye and ear, respiratory, cardio- 
vascular, digestive, musculoskeletal, skin, nervous 
system, and genitourinary; as well as frequency of ill- 
ness, miscellaneous diseases, and mood and feeling 
patterns. Sections A through K cover primarily physical 
manifestations in separate body systems, whereas L 
through R cover habits, and mood and feeling patterns. 
Thus, the total Cornell score was obtained and then 
broken down into a score for A-K (medical symptoms) 
and L-R (emotional symptoms), 

The MA (Taylor, 1953) is a 50-item questionnaire 
covering both psychological and physical symptoms of 
anxiety or “neuroticism.” It has been found to differ- 
entiate psychiatric from medical and nonpatient groups 
(Matarazzo, Guze, & Matarazzo, 1955). 

The Saslow Screening Inventory (Gleser & Ulett, 
1952; Saslow, Counts, & DuBois, 1951) is a measure of 
psychological and physical reactions to anger and to 
anxiety producing situations. 


RESULTS 


Analyses of variance were computed in 
order to ascertain comparability of the four 
samples in regard to age, which has been re- 
ported to be associated with symptomatology 
(Brodman et al., 1953). It was found that the 
medical inpatients were significantly older 
than the medical outpatients (48.35 vs. 27.76, 
p of .001),! and this was true for both males 

1 The age ranges for the four groups were as follows: 


medical outpatients 18-52; medical inpatients 14-68, 
eo outpatients 16-65, psychiatric inpatients 


and females. This difference of 20 years on 
the average is considerable and probably re- 
flects the general fact that the illnesses requiring 
hospitalization are usually encountered in 
older age groups. There is no such difference 
between the psychiatric in and outpatients, 
but the total psychiatric group is significantly 
younger than the medical group (33.05 vs. 
37.80, p of .05). This difference is due, however, 
to the advanced age of the medical inpatients, 
while the medical outpatients do not differ 
in age from the psychiatric groups. If increased 
age is associated with more symptoms, this 
age discrepancy should work against the 
hypothesis that the psychiatric population 
will have more medical symptoms than the 
medical group. 

Tt was also important to determine whether 
there was an age difference between the sexes 
because this might complicate the relationship 
between sex and number of symptoms. Within 
each of the four groups there were no statis- 
tically significant sex differences in age, al- 
though there was a consistent trend for the 
male medical patients to be older than the 
female medical patients. When the two medical 
groups (inpatient and outpatient) were com- 
bined, this difference was significant (43,71 
vs. 33.40, p of .05). There was no sex difference 
in age within the psychiatric groups. Thus, 
this population, it appears that medical 
treatment is likely to be required or sought 
at an earlier age in females than in males, 
generally speaking. There was no such sex 
difference in psychiatric illness. Since the age 
difference between the sexes in the medical 
groups was found to be approximately the 
same (8 years) for both in- and outpatients, 
it appears that both minor and major ailments 
were more prevalent among females who were 
still in their prime. This age difference, 
however, should also work against the hypoth- 
esis that the males will have fewer symp- 
toms than females, since the former were older. 

There were no significant educational differ- 
ences between sexes, nor was level of education 
associated with in- or outpatient status. The 
psychiatric group was, however, slightly supet- 
ior in education to the medical group (11. 
years vs. 10.00 years, p of .01). This was 
largely due to the superiority of the psychiatric 
inpatient sample, which had a mean of 12.00 
years of education. 


a 
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MEDICAL AND PSYCHIATRIC SYMPTOMS 


Table 1 presents the means of all groups on 
each of the scales, and the F values for 
differences among them, and indicates that 
four out of five measures significantly differ- 
entiate the groups, the psychiatric patients 
having the greater number of symptoms. The 
one measure which was not significant was the 
Cornell A-K (medical) items, while the Total 
Cornell and all the psychiatric measures 
attained statistical significance. Thus, number 
of medical symptoms does not discriminate 
among the groups as a whole. If one were to 
rank the groups on each of the scores, however, 
the medical outpatient group would, in every 
case but one (Cornell L-R), appear to be the 
lowest on all medical and psychiatric scales. 
Similarly, the medical inpatient group is, in 
almost every case, the next in rank, followed 
by the psychiatric inpatients and psychiatric 
outpatients, in that order. Thus, one may 
conclude that, in general, the medical out- 
patients presumably experienced, and for 
this reason reported, the least number of 
medical and psychiatric symptoms (and, thus, 
inferentially the least amount of stress), while 
the psychiatric outpatients suffered the most. 
One might ordinarily have expected the psy- 
chiatric inpatient sample to report the greatest 
number of symptoms, and there is no obvious 
explanation for the fact that they did not. 
However, the differences in means between 
this group and its outpatient counterpart is 
small on all measures. In regard to medical 
symptoms (A-K), there is a nonsignificant 
trend for the four groups to be differentiated, 
which, however, follows the identical pattern 
of all the psychiatric scales. All of the other 
scales differentiate the groups, three of them 
at the .001 level (Cornell L-R, Anxiety scale, 
and Psychosomatic Inventory). Analysis by 
1 tests established that, on these es, 
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each of the medical groups was significantly 
different from each of the psychiatric groups, 
but differences within medical and psychiatric 
groups were not significant. On the “medical” 
part of the Cornell Index (A-K), only the two 
extreme groups were statistically differentiated 
(p of .05)—the medical outpatients from the 
psychiatric outpatients. This discrimination, 
and the consistent trend for medical symptoms 
to differentiate the groups in the same way as 
did emotional symptoms, gave some support 
to Hypothesis 1: the psychiatric patients, 
presumably under greater total stress than 
the medical patients, experienced many more 
psychiatric symptoms and had a tendency 
toward more physical symptoms. It would 
appear from Table 1, however, that the number 
of psychiatric symptoms was & better means of 
establishing psychiatric status than was the 
number of physical symptoms. 

Table. 2 presents the Pearson product- 
moment correlations of all the measures for 
the total medical group, including the cor- 
relations with age. (Since the comparable 
correlations for the psychiatric sample in- 
dicated that the relationships were identical 
for both groups, the additional table has been 
omitted.) From Table 2 it can be seen, first of 
all, that the total CMI was highly correlated 
with both of its subparts (,94 with the “med- 
ical” A-K and .82 with the “psychiatric” 


parts, the total CMI was almost as highly 
correlated with the MA 
nificantly correlated with the Saslow Psy- 
Inventory (.55). Furthermore, 


slightly below those 
of the total index (.59 with L-R, .62 with the 
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Psychiatric Outpatients 


p level 
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TABLE 2 
INTERCORRELATIONS AMONG ÅGE AND SCORES FOR TOTAL MEDICAL GROUP 
(N = 82) 
E 
a Camel I i era a 
Age +14 17 .02 .17 .05 
Total Cornell Index +94 -82 78 55 
A-K (Medical) Cornell 59 +62 40 
L-R (Psychiatric) Cornell +82 61 
Taylor Manifest Anxiety scale 63 
Saslow Psychosomatic Inventory 
p .05-.22 
$ .01-.28 
Anxiety scale, and .40 with the Psychosomatic TABLE 3 


Inventory). Thus, the number of medical 
symptoms was highly related to each of the 
psychiatric scores. Hypothesis 2, then, was 
well substantiated; one can, with fair success, 
predict number of psychiatric symptoms from 
knowledge of the number of medical symptoms 
teported and vice versa. This is true, to a 
greater or lesser extent, depending upon the 
psychiatric scale used. The lower correlation 
of the psychosomatic inventory was probably 
associated with its smaller range of scores. 

The remaining correlations, ranging from 
-61 to .82, show the extent to which the psy- 
chiatric measures were correlated, indicating 
that there is high predictability from one to 
the other although there is little similarity 
among the items of which they are comprised. 
Interestingly, as shown in Table 2, age was 
found not be associated with number of symp- 
toms of any kind. 

Table 3 presents a comparison of male and 
female scores on all measures within each of 
the four subgroups. Within both medical 
subgroups, comparison of the males and fe- 
males indicated that there were no differences 
in number of either medical or psychiatric 
symptoms. Thus, contrary to the results of 
Brodman et al. (1953) and Boake (1958), the 
medically sick but presumably emotionally 
healthy women of this study did not indicate 
that they had a significantly greater number 
of medical ailments or concerns than did the 
men. It should be mentioned, nevertheless, 
that while no difference between groups at- 
tained significance, there was a consistent 
trend for the women to score higher on each 
scale, in agreement with the results of these 
investigators. 


COMPARISON OF MEAN SCORES OF MALES AND FEMALES 
WITHIN EACH OF THE Four Groups 


Saslow 
L-R, Taylor 
Total A-K 7 ylor Psycho 
Geel lata Cech esi Se 
Index Corell Cornell “scale i he, 
Medical Inpatients 
(W = 40) 
Males 35.70 28.30 7.40 15,00 5,05 
Females 42.25 30.30 11.95 19.10 8.30 
F 82 +03 3.03 1.81 2% 
Medical Outpatients 
W= 42) 
META AU YOO aa 
Males 31.07 24.07 10.00 14,53 48 
Females 34.85 24.85 10.00 14.96 5.85 
F -28 -61 +00 -01 -48 
Psychiatric Inpatients 
(W = 40) 
MEADS aaa ine in tl 
Males 38.61 23.00» 15.61 24.08 8.94 
Females 57.27 35.45 21.82 26.88 16.64 
F 4.85% 4,03 3.25 36 7.65 
Psychiatric Outpatients 
W = 40) 
Males 41.78 22.78 19,00 26.65 96t 
Females 61.86 38.41 23.45 26.04 14.64 
F 4.919 7.05% 1.32 „02 290 


* .05 level of probability, 
** .01 level of probability. 


Among the psychiatric groups, howevely 
the women scored significantly higher ot 
certain scales. Thus, in both psychiatric 
groups, the women obtained a higher total 
CMI score than did the men (p of .05). When 
the CMI was broken down into its “medical! 
and “psychiatric” subparts, it is apparent 
from Table 3 that, while there was a trend 
for the women to score higher than the men 
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-in both categories, it was mainly the “medical” 
category which differentiated them. This 
difference attained significance (p of .05) in 
the outpatient sample, and nearly reached this 
level for the inpatient group. While the two 
remaining psychiatric scales showed a con- 
sistent trend in both groups for the women 
to be higher, this difference reached signifi- 
cance only in the Saslow Psychosomatic In- 
ventory for the inpatient group ($ of .001). 
Thus, among the psychiatric patients, the 
women did have significantly more medical 
complaints than the men, as indicated by 
the Cornell A-K score, but also more com- 
plaints on the Saslow Psychosomatic Inven- 
tory, There was also, as in the medical group, 
a consistent trend for the women to report 
more psychiatric symptoms. 

In Table 4, a comparison of the four sub- 
groups is presented with the sexes separated. 
When only the males were compared, the 
difference (presented in Table 1) among 
groups on the Total Cornell Index no longer 
teached significance. In addition, the psy- 
chiatric scales, while still discriminating the 
groups, tended to be less efficient in their power 
to differentiate the samples. 

On the other hand (as seen in Table 4), 
when the females were compared, the differ- 
ence between groups on the Total Cornell 
Index was increased from p of .05 (Table 1) 
to $ of .01, and this greater difference was 
largely contributed by the medical (A-K) 
items. While A-K did not differentiate the 


groups when the sexes were combined (Table 
1), it did differentiate the females of the 
groups (p of .05). For the females, the other 
(psychiatric) measures were not essentially 
different in their level of statistical significance 
from those presented in Table 1 for the two 
sexes combined. It can be seen (Table 4), 
however, that in general the females con- 
tributed more to the group differences (Table 
1) than the males, and that female psychiatric 
patients differed more from their psychiatric- 
ally healthy (medically ill) counterparts 
than did the males. 


DISCUSSION 


It is interesting that the females tended to 
appear for treatment of both minor and major 
illnesses at an earlier age than did the men 
(although not for psychiatric treatment), and 
that the women psychiatric patients differed 
from all groups of males and from medically 
ill females in that they reported many more 
physical (medical) disturbances. An explana- 
tion for this discrepancy between the sexes 
is diffcult to find. A greater number of minor 
complaints could be attributed purely to 
greater awareness of, or concern with, physical 
symptoms. However, the same age difference 
prevailed in both medical outpatient and in- 
patient groups, the latter presumably having 
had to convince not only themselves but 
several physicians that they were seriously ill, 
(The sex ratio of approximately 1:1 in this 


TABLE 4 


Comparison oF MEAN SCORES AMONG THE Four GROUPS 


Total Cornell 
Index 
Males 
- .05 
Medical Inpatients 35.70 i 
Medical Outpatients 31.07 £54 
Psychiatric patients 38.61 9.61 
Peychiatric Outpatients 41 H 2.98 
? level $ 3 
Females 
7 .10 8.30 
Medical Inpatient 42.25 30.30 11.95 4 
Medical Outpatients 34.85 24.85 10,00, 26.88 16.64 
Psychiatric Inpatients 57.27 35.45 24.87 26.04 14.64 
Psychiatric Outpatients 61.86 38.41 3.84 5.95 9.04 
F 5.51 as “001 -01 001 


60 


study is artificial since an attempt was made 
to obtain equal numbers of men and women in 
each group.) It seems probable, then, that 
women tend to have a somewhat greater 
number of both major and minor illnesses. 
Adding weight to this conclusion is the fact 
that in both psychiatric groups the age differ- 
ence did not hold. If it were purely a matter of 
women seeking treatment earlier than men, 
this should be reflected in the psychiatric 
groups also. Again, consistent with the hypoth- 
esis of greater physical vulnerability among 
females, was the tendency for the latter to 
` report more symptoms under emotional stress 
(psychiatric group). Thus, under the same 
emotional stress (as measured by scores on 
psychiatric scales), the women reported more 
physical distress than the men. Again, if it 
were purely a matter of greater awareness of 
discomfort, the women would be expected to 
report significantly more psychiatric symptoms 
also. 
A paper by Mandler, Mandler, and Uviller 
(1958) is interesting in this regard. They 
studied a group of subjects which reported 
high awareness of autonomic changes under 
stress, and one which reported low awareness. 
Actual physiological measurements taken in a 
stress situation tended to corroborate the 
subjects’ reports; i.e., those who reported 
greater awareness of autonomic changes were 
those who, by measurement, were found to 
have the larger physiological responses. Also 
notable, however, was the fact that the sub- 
jects who were most physiologically responsive 
tended to overestimate their physiological 
changes, whereas those with low responsiveness 
tended to underestimate physiological changes. 
It is possible, then, that women are slightly 
more sensitive or more physiologically reactive 
to stress than men, but that they also tend to 
overestimate the extent of the disturbance, or 
to be more highly aware of it. This could be 
analogous to the apparent “conditioning” 
process involved in many psychological dis- 
turbances, such as the anxiety neurotic’s 
“fear of fear.” That is, after experiencing 
episodes of a particular kind of discomfort, one 
can learn to anticipate or dread further 
instances of discomfort while becoming pro- 
gressively sensitive to lower magnitudes of 
Pertinent cues (the “reaction-sensitization” of 
Cameron, 1947). That, for example, the anxiety 
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neurotic’s fear of the symptoms of fear may be 


at least partly determined by physiological 
preconditions seems indicated by the report of 
Cohen (1949) that the vital capacity of the 
anxious person tends to be lower than that of 
normals, and that, thus, his breathing, heart 
rate, etc. may be more vulnerable to stress, 
Lacey (1956), in demonstrating that each 
individual has a specific autonomic pattern of 
reactivity to stress, has provided the possibility 
of explaining the anxious patient’s fear of, or 
concentration upon, specific physiological 
symptoms; e.g., onè patient’s fear that he has 
heart trouble; another’s fear of vomiting, being 
unable to get his breath, or fainting; another's 
obsession with headache, constant fatigue, or 
bowel function. The results of Grossman and 
Greenberg (1957) suggest that the autonomic 
patterns described by Lacey are present in the 
newborn, and, thus, are apparently constitu- 
tional, at least in part. That physiological 
precondition may be an important factor in 
“psychiatric” symptomatology is further sug- 
gested by a report on bronchial asthma, allergic 
rhinitis, and pulmonary emphysema (Heiskell, 
Rhodes, & Thayer, 1959). The patients were 
rated as to severity of functional (breathing) 
impairment and compared on the Hypochon- 
driasis, Depression, and Hysteria scales of the 
MMPI. Whereas asthma has been said to be a 
“psychosomatic” disease, it was found that 
disease entity did not seem to account for 
differences in MMPI score, although severity 
of functional impairment (in all three diseases) 
did, Thus, the physiological changes, 
symptoms, or impairments associated with the 
illness seemed to be related to emotional 
distress as reflected in the MMPI. 

Regardless of etiology, however, the high 
Positive correlation found in the present study 
between number of medical and psychiatric 
symptoms (Table 2) tends to confirm the 
Hinkle and Wolff observation that the state of 
health or illness is a condition of the organism 
as a whole, and that when the organism 15 
under stress, it is likely to reflect this stress by 
symptoms of malfunction in a number of areas. 


SUMMARY 


This investigation has attempted to relate 
number of symptoms to patient characteristics 
other than diagnosed medical, neurological, etc- 
disease or diseases. Four groups of patients 
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were studied: psychiatric inpatient, psychiatric 
outpatient, medical inpatient, and medical 
outpatient samples, comprised of approxi- 
mately equal numbers of males and females. 
It was found that, when the groups were 
compared for number of symptoms on the 
Total Cornell Index, Cornell A-K (medical 
symptoms), Cornell L-R (psychiatric symp- 
toms), Taylor Manifest Anxiety scale, and 
Saslow Psychosomatic Screening Inventory, in 
general, the medical outpatients had the 
fewest symptoms, followed by the medical 
inpatients, psychiatric inpatients, and psy- 
chiatric outpatients, in that order. When medi- 
cal and psychiatric symptoms were correlated, 
it was found that number of medical symptoms 
(A-K) correlated .59 with (L-R) psychiatric 
symptoms, .62 with score on the anxiety scale, 
and .40 with number of symptoms on the psy- 
chosomatic inventory, Thus, number of 
medical symptoms was a good predictor of 
psychiatric symptomatology and vice versa. 
Female psychiatric patients were found to 
have many more medical symptoms than 
either male psychiatric patients or male or 
female medical patients. Males and females 
did not differ significantly in regard to number 
of psychiatric symptoms, however. This was 
considered in relation to the fact that the 
female medical patients were approximately 
10 years younger than the male medical 
patients, although there was no such difference 
in the psychiatric group. The results suggested 
the hypothesis that women are physically more 
reactive to stress, and that this physical sensi- 
tivity may then “condition” them to greater 
awareness of, or concern with, the physical 
manifestations of stress. 

In addition, the high relationship between 
medical and psychiatric symptoms was dis- 
cussed in relation to individual autonomic 
response patterns described by Lacey, and to 
Hinkle and Wolff's hypothesis that the state 
of health or illness, is a condition of the 
organism as a whole. 
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UMAN beings display a variety of 

conceptions of “the good life.” 

Some lay stress on the overcoming of 
obstacles through persistent striving; some 
favor the search for sensual comforts and 
pleasures; others emphasize the more intel- 
lectual joys to be obtained in passive con- 
templation and reflection. Whereas some 
people perceive values in terms of society, 
through participation with others toward 
group goals, others organize their value 
systems around the essential dignity and 
independence of the individual. Whether they 
be expressed explicitly in the doctrines of 
religion and the policies of political organiza- 
tion, or expressed implicitly in the everyday 
behaviors of ordinary people living their 
normal lives, such values are both deeply 
ingrained and pervasively influential in 
determining choices, both great and small. As 
Kluckhohn et al. (1951) put it, “a value- 
orientation (is) a generalized and organized 
conception, influencing behavior, of nature, 
of man’s place in it, of man’s relation to man, 
and of the desirable and nondesirable as 
they may relate to man-environment and 
interhuman relations” (p. 411), Although the 
preferential ordering of values probably varies 
most notably across different groups of men, 
€g., across different cultures, it also varies 
among the individuals composing such 
groups—for example, differences in the assess- 
ment of human values are implicit in the 
Policies favored by different Americans for 
survival in this nuclear age. 

In Paths of Life (1956) Morris distinguished 
three basic components of personality on 
philosophical grounds, these being presumed 
to combine in various ways to produce various 

1 The basic data used in this study were collected 
by Olga McNemar in 1957, Norma Mae Ross helped 
work on this material. I. Richard Savage, Lyle V. 
Jones, and George J. Suci made helpful suggestions 
at various stages of the work. 


value orientations: “Dionysian” (tendency to 
release and indulge existing desires), ‘““Prome- 
thean” (tendency to manipulate and remake 
the world), and “Buddhistic” (tendency to 
regulate the self by holding in check its 
desires). The combination of these tendencies 
in their possible orderings, including that 
combination in which all are moderately 
strong (Way 7), generated the 7 “ways to 
live” which were described and explored in 
Paths of Life. These are the first 7 Ways given 
in essence in Table 1. When single-paragraph 
descriptions of these “ways to live” were 
given to students who were asked to indicate 
their preferences about them, the need for 
several additional alternatives was demon- 
strated. Ways 8 through 13 were added, and 
this 13-alternative form is the one which 
was used by Jonesand Morris (1956; Morris & | 
Jones, 1955) in a series of empirical studies of 
human values. These and other studies, with 
their implications, are summarized in Varieties 
of Human Value (Morris, 1956b). Table 1 
attempts to give only the essential emphasis | 
of these Ways, for purposes of identification 
throughout this report; the more complete 
descriptive paragraphs used as bases for : 
judgments by subjects in Morris’ original © 
studies and in the present investigation may 
be found in Varieties of Human Value (Mortis, 
1956b, pp. 15-19) and in a paper by Mortis | 
and Jones (1955, pp. 524-525). 

Morris obtained preferential judgments | 
(“Indicate ... how much you yourself like or 
dislike each of them”) of the 13 ways, using # 
seven-step scale from “like very much” to 4 
“dislike very much.” Sizable samples %» 
American, Indian (Asian), and Chinese y 


college students, and smaller samples in othe! 
countries, were used. When the preferenti! M 
judgments were subjected to factor analys$ 
(Morris & Jones, 1955), evidence for com 
siderable agreement in factor structure actos 
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TABLE 1 
A SAMPŁE oF HUMAN VALUES 


Way 1 preserve the best in society 

Way 2 self-sufficiency, reflection, and medita- 
tion 

Way 3 altruistic affection and concern for others 

Way 4 abandonment to sensuous enjoyment 

Way 5 group action toward common goals 

Way 6 progress through realistic solution of 
problems 

Way 7 integration of diversity 

Way 8 Tome enjoyment of simple com- 
orts 

Way 9 quiet receptivity to rience 

Way 10 dignified ie contzol ai 

Way 11 contemplation of the rich inner life 

) Way 12 dynamic physical interaction with the 

environment 

Way 13 humble obedience to cosmic purposes 


Note.—These phrases attempt to epitomize the statements of 
the Ways given in Varieties of Human Value, pp. 15-19; they are 
given to provide easy identification of the Ways, but are not the 
statements the subjects judged. 


‘different cultures was obtained, even though 

the preferences for various “ways to live” 

_ differed in these cultures. Varieties of Human 
Value describes relationships between these 
value factors and a number of other types of 

_ individual differences, including differences in 
religious affiliation, in social caste and class, 
in temperament, in personality, and in body 

- size and build. The fact that such relationships 
do exist both testifies to the validity of the 
value dimensions and serves to illuminate 
some of the ways in which values relate to 
other aspects of human existence. 

- For a number of years Osgood and his 
associates have been working on a psycho- 
logical theory of meaning and the objective 

: measurement of certain (connotative) aspects 
of meaning. Application of factor analytic 

techniques to large numbers of descriptive 

dimensions, defined by polar adjectives, has 

provided evidence for the existence of a 

limited number of connotative semantic 

factors—‘“‘evaluation,” “potency,” and “ac- 
tivity” being most clearly defined, and 

“stability” and “receptivity” being less well 

defined. The logic of this type of measurement, 

the factorial studies, and applications of the 
technique to a variety of fields have been 
described in The Measurement of Meaning 

(Osgood, Suci, & Tannenbaum, 1957). A 
major purpose of this research has been to 

ee it possible to index the connotative 

- meanings of concepts as locations in a common- 


dimensional space, and hence to be able to 
specify similarities and differences in concept 
meanings as distances (cf. Osgood & Suci, 
1952) in this space. Application of this type 
of measurement has been made to the predic- 
tion and measurement of attitude change 
(Osgood & Tannenbaum, 1955), to a blind 
analysis of a case of triple personality (Osgood 
& Luria, 1954), to cross-cultural studies of 
semantic similarities (Kumata, 1957; Triandis 
& Osgood, 1958), and to various psycholin- 
guistic, mass communication, and esthetic 
problems. 

The present investigation was planned 
with the prime purpose of bringing together 
these two kinds of measurement—of values 
and of connotative meanings—and determining 
what relations, if any, exist among them. The 
study can also be conceived as a semantic 
differential analysis of Morris’ “ways to live,” 
i.e., of the connotations that the various Ways 
(as concepts) have for a sample of American 
college students. Another purpose was to 
provide evidence for concept-scale interaction 
(cf. Osgood, Suci, & Tannenbaum, 1957, 
pp. 176-188); factor analyses of single con- 
cepts have indicated that both interscale 
correlations and even factor structure, to a 
lesser extent, can vary as a function of the 
particular concepts being judged, and we 
wished to see if the same phenomenon would 
hold for a restricted class of concepts—here, 
a set of value statements. Another methodo- 
logical interest lay in the complexity of the 
“concept” being differentiated. Most studies 
with the semantic differential have required 
the subjects to react to single words (eg., 
hospital, tornado) or at most short phrases 
(e.g, my doctor, use of the atomic bomb), 
although complex visual stimuli such as 
modern paintings have been used; here 
subjects reacted to very complex verbal 
statements, averaging 100 or more words, as 
if to unitary “concepts.” 

METHOD 
Materials 

Any form of the semantic differential involves a set 
of “concepts” and a set of “scales” against which to 
judge them. In the present case the concepts are a set 
of 13 statements, each running about 100 words, 


describing different value-orientations. These state- 
ments are the same as those used in other studies of 
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values by Morris (cf. references above). The semantic 
scales used in the present study were taken in the main 
from the 76 used in the Thesaurus factor analysis (cf. 
Osgood, Suci, & Tannenbaum, 1957, pp. 47-66) to 
represent the general factors derived in that study: 
“evaluation” (timely-untimely, kind-cruel, successful- 
unsuccessful, good-bad, positive-negative, beautiful- 
ugly, important-unimportant); “potency” (hard-soft, 
masculine-feminine, strong-weak, powerful-powerless); 
“activity” (active-passive, quick-slow, warm-cool, 
excitable-calm); “stability” (true-false, stable-change- 
able, wise-foolish, predictable-unpredictable); “recep- 
tivity” (interesting-uninteresting, _savory-tasteless, 
colorful-colorless); “novelty” (new-old, unusual-usual) ; 
and “tautness” (angular-rounded, straight-curved). 
The actual order and direction, left to right, of these 
scales is shown in Table 2, this arrangement being 
maintained for all concepts and all subjects, A separate 
semantic-differential page was provided for each 
concept, €.g., 


way 1 


and the Ways were judged in the 1 to 13 order given 
in Table 1. 


Subjects 

The subjects used in this study were 55 male college 
students at Stanford University. They were all 20 or 
21 years of age, This is a rather homogeneous subsample 
from the American population, and any generalizations 
from the present data must be tempered by this fact. 
The subjects completed the experiment in one session. 


Procedure 


The written instructions adapted the usual semantic 
differential instructions for use with the more complex 
stimuli, paragraph descriptions of the 13 “ways to 
live.” (See Osgood, Suci, & Tannenbaum, 1957, pp. 
82-84, for an example of the instructions generally 
given for the semantic differential.) Following these 
instructions, the subject found 13 single-page semantic- 
differential forms, one for each Way. Upon completing 
his differentiations, he read the following: 

Turn back to the thirteen ways to live which 
iat asia times have advocated and 
followed [each subject a set of the h 
descriptions of the Ways). iea 

Indicate by numbers, which you are to write in 
the margin, how much you yourself like or dislike 
each of them. Do them in order. Do not read ahead. 

Remember that it is not a question of which 
kind of life you now lead, or the kind of life you 
think it prudent to live in our society, or the kind of 
life you think good for other persons, but simply the 
kind of life you personally would like to live. 

Subjects indicated their preferences by writing a 
number from 7 (I like it very much) to i (I dislike it 
very much) beside each Way. 
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RESULTS 


Preference for and Connotations of the Thirteen 
Ways of Life 


Table 2 gives the mean judgments of each 
Way against each semantic scale, as well as 
the mean preference ratings (bottom row), 
for the 55 Stanford college men. Looking 
first at the preferences, we see that Ways 7 
(integration of diversity), 1 (preserve the best 
in society), 6 (progress through realistic 
solution of problems), and 12 (dynamic 
physical interaction with the environment)— 
in that order—are the most preferred value- 
orientations, while Ways 13 (humble obedience 
to cosmic purposes), 9 (quiet receptivity to 
experience), and 11 (contemplation of the 
rich inner life)—in that order—are the least 
preferred. There are approximately three 
scale units on the seven-step preference scale 
separating these most and least liked values, 
so the preferential discrimination is marked. 
Stanford men clearly favor diversity, progress, 
and action in contrast to the more humble, 
contemplative, and receptive modes of life. 
That this ordering of value preferences is not 
unique to our Stanford group is shown by the 
rank order correlation of their mean preferences 
with those of Morris’ (Morris & Jones, 1955) 
much larger college student group—a correla- 
tion of +.93. The only rank difference of any 
magnitude is for Way 4 (abandonment to 
sensuous enjoyment), which is ranked ninth 
by the larger group and sixth by the Stanford 
students. 

Comparing the mean semantic profiles for 
the Ways in Table 2 with their verbal state- 
ment in Table 1 (and see Morris, 1956b, pP- 
15-19) provides rather impressive evidence 
for the face validity of the judgments on 
which the profiles are based. Those means 
that are at least one scale unit away from the 
midpoints of the scales (4.00), and hence 
presumably significant deviations from neutr: 
(cf. Osgood, Suci, & Tannenbaum, 1957, 
pp. 126-140, where significance levels for 
group data are discussed), are indicated by 
asterisks in Table 2. Considering these means, 
we find interesting semantic characterizations 
for each of the Ways. For example, Way 1 
(preserve the best in society) is characterize 
as old, kind, good, stable, important, calm, 
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TABLE 2 
MEAN VALUES FOR THE THIRTEEN “Ways TO LIVE” 

Seaton] EE | Ryo E E E | War wer | War | er | Opposes 
interesting 4.91%.) 3.67 | 4.45 | 5.315 | 4.85 | 5.76%] 5.55” | 4.58" | 3.20] 4.64 uninteresting 
untimely 3.29 | 5.04°| 4.11 | 3.95 | 3.78 | 2.33%] 2.85% | 4.07 | 5.20% 3.84 timely 
new 2,449 | 2.65"! 2.768| 3.47 | 3.75 | 4.87 | 4.18 | 2.95% | 2.40% 3.13 old 
kind 5.33" | 3.15 | 6.31» | 3.65 | 4.24 | 3.98 | 4.75 | 5.138] 4.58 | 3.71 cruel 
hard 3.67 | 5.558] 2.51%] 4.27 | 4.02 | 5.073 | 4.11 | 2.45% | 2,608] 5.20% soft 
unsuccessful | 3.04 | 4.87 | 4.29 | 4.16 | 3.51 | 2.228] 2.76"| 4.42 | 5.64%] 3.75 successful 
active 3.96 | 2.768] 3.00 | 5.25*| 5.51%] 6.15%] 5.04%] 2.33 | 1.65%) 4.11 passive 
true 4,82 | 3.31 | 4.36 | 3.82 | 4.27 | 5.445 | 5.028] 3.82 | 2.96% 4,24 false 
tasteless 3.93 | 5.18%] 3.93 | 3.16 | 3.67 | 3.04 | 2.91%] 4.02 | 5.05% 4.15 savory 
good 5.225 | 3,27 | 5.49 | 3.85 | 4.42 | 5.169] 5.078] 4.33 | 3.58] 4.51 bad 
feminine 4.25 | 3.58 | 5.04” | 3.35 | 3.75 | 2.60%] 3.78 | 4.75 | 4.84] 3.31 masculine 
slow 4.82 | 4.80 | 4.93 | 2.919 | 3.16 | 2.60%] 3.53 | 5.64% | 5.988) 4.07 fast 
stable 5.96" | 4.29 | 4.84 | 2.47 | 3.67 | 3.47 | 3.95 | 5.38" | 4.93 | 4.98 changeable 
angular 3.58 | 4.75 | 3.64 | 4.49 | 3.71 | 4.27 | 3.53 | 3.20 | 3.95] 4.18 rounded 
cool 4.47 | 5.20%) 2.629) 3.92 | 3.51 | 4.27 | 3.73 | 2.78°| 4.36 | 4.65 warm 
curved 3.36 | 3.44 | 3.76 | 4.53 | 3.85 | 3.33 | 4.24 | 4,02 | 4.18] 3.53 i straight 
colorful 3.31 | 2.71"| 3.85 | 5.679 | 5.073 | 4.89 | 4.98 | 3.55 | 2.56") 3.27 |2. 5 .38*| colorless 
negative 3.24 | 5.31 | 3.07 | 4.02 | 3.27 | 2.533 | 3.02 | 4.38 | 5.11%] 3.85 | 4.95 | 2.75" 5.40%) positive 
unusual 3.33 | 5.33 | 4.67 | 4.16 | 3.33 | 3.53 | 3.60 | 3.69 | 4.75 | 4.44 | 4.69 | 4.05 | 4.45 | usual 
ugly 3.33 | 4.65 | 2.65" | 3.78 | 3.78 | 3.58 | 3.16 | 3.25 | 3.91] 4.16 | 4.04 | 3.58 | 4.20 | beautiful 
strong 4.40 | 4.07 | 3.84 | 3.98 | 4.35 | 5.69%] 4.73 | 2.89 | 2.36% 5.119] 2.759] 5.515] 2.134) weak 
important 5.07 | 4.20 | 4.96 | 4.15 | 4.55 | 5.64%] 5.07 | 3.65 | 3.11 | 4.78 | 3.18 | 4.84 | 2.98! unimportant 
excitable 2.40" | 3.20 | 2.76 | 5.315 | 4.85 | 4.67 | 4.20 | 2.56" | 2.05%) 3.09 | 2.40%) 5.444) 2.444! calm 
foolish 2.78" | 4.98 | 3.42 | 4.76 | 4.18 | 2.645 | 3.04 | 4.22 | 5.00% 3.47 | 4.91 | 3.82 | 5.42%] wise 
unpredictable | 2.07 | 3.56 | 3.22 | 4.96 | 3.93 | 3.82 | 3.98 | 2.69 | 3.45 | 3.18 | 3.67 | 4.80 | 3.65 | predictable 
powerless 3.36 | 4.27 | 3.96 | 4.07 | 3.24 | 2.20%} 3.33 | 4.96 | 5.56") 3.16 | 5.31%) 2.89%) 5.96") powerful 
Preference: 

5.11 | 3.02 | 4.18 | 4.29 | 4.06 | 5.07 | 5.16 | 4.36 | 2.51 | 4.16] 2.66 | 5.04 | 2.06 | dislike 


® These means deviate from the scale midpoint (4.00) by at least one scale unit and determine semantic characterizations for each 


of the Ways (see text). 


> For values >4.00, the mean judgment is toward the left-hand term; for values <4.00, the mean judgment is toward the right-hand 


term. 


wise, and predictable; Way 5 (group action 
toward common goals) is active and colorful; 
and Way 11 (contemplation of the rich inner 
life) is untimely, old, unsuccessful, passive, 
tasteless, slow, colorless, weak, calm, and 
powerless. The connotations of the other 
Ways can be determined from Table 2 by the 
interested reader. 


Factor Structure of Value Connotations 


Since we have each of the 13 Ways judged 
against the same 26-scale semantic differential, 
we can determine the structuring of the Ways 
in terms of their connotative meanings. Two 

_ Separate analyses were made, one in which 
intersubject variance was excluded (by using 
the mean judgments on scales, V = 26, as the 
basis for correlations) and one in which 
intersubject variance was included (by corre- 
lating over both individual subjects and 
scales, W = 1430). In both cases 13/13 corre- 
lation matrices, each Way versus every other 


Way, were computed; these correlation 
matrices were factored by the centroid 
method, again using 1.00 communalities, and 
then were rotated toward simple structure by 
the quartimax method.? 

Table 3 gives the quartimax rotation of the 
first four factors for both types of analysis, 
based on means (m) and on individual data 
(i); corresponding factors in the two analyses 
are placed in adjoining columns for ready 
comparison. The correspondence of the factor 
loadings obtained by these two methods is 
extremely high, as can be seen from inspection 


2 To conserve space, only the rotated factor loading 
tables are given here. The original correlation matrices 
and the unrotated centroid loading tables have been 
filed with the American Documentation Institute, 
Order Document No. 6692 from ADI Auxiliary Publi- 
cations Project, Photoduplication Service, Library of 
Congress; Washington 25, D. C., remitting in advance 
$1.25 for microfilm or $1.25 for photocopies. Make 
checks payable to: Chief, Photoduplication Service, 
Library of Congress. 
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of the paired columns in Table 3. Looking at 
the absolute size of the loadings, and at the 
percentage of total variance extracted (bottom 
row), it can be seen that the analysis based 
on means extracts more of the total variance 
on the first two factors while the analysis 
that includes the intersubject variance spreads 
the proportion of variance extracted more 
evenly over the factors, actually extracting 
more on the fourth factor. These four factors 
account for 95% of the total variance in the 
factor analysis of means and for 61% of the 
total variance in the analysis based on indi- 
vidual data. It is clear that in this case elimi- 
nation of intersubject variance by use of 
group means serves to amplify the common 
variance without seriously distorting the 
nature of the underlying factors. 

What are the common factors in the Ways 
as revealed through their connotative mean- 
ings? Here we infer the nature of a factor 
from inspection of those Ways which have high 
versus low loadings on it, and we then check 
this inference by noting which descriptive 
scales order the Ways as does the factor. 
Factor I most clearly divides Ways 9, 11, and 
13 (quiet receptivity to experience, contempla- 
tion of the rich inner life, obedience to cosmic 
purposes) from Ways 5, 6, 7, and 12 (group 
action toward common goals, progress through 
solution of problems, integration of diversity, 
dynamic physical interaction); the semantic 
scales which order the Ways much like this 
factor are uninteresting-interesting, passive- 
active, colorless-colorful, and negative-positive. 
We identify this factor as “dynamism” (the 
negative pole being dynamic and the positive 
pole passive), Factor II has Ways 2 and 4 
toward the positive end (self-sufficiency, 
reflection, and meditation; abandonment to 
Sensuous enjoyment) and Ways 1, 3, and 8 
toward the negative end (preserve the best in 
Society, altruistic affection and concern, 
wholesome enjoyment of simple comforts); 
the most nearly parallel scales are cruel-kind, 
hard-soft, and angular-rounded. We identify 
this factor as “socialization” (again reversing 
the polarity, so that the negative pole is 
highly socialized and the positive pole highly 
egocentric). 

Factor III segregates Ways 1, 2, and 10 
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(preserve the best in society; self-sufficiency, 
reflection, and meditation; dignified self- 
control) from Ways 4, 8, and 13 (abandon- 
ment to sensuous enjoyment, wholesome 
enjoyment of simple comforts, humble obedi- 
ence to cosmic purposes); the related descrip- 
tive scales are hard-soft and cool-warm. We 
identify this as a “control” factor (the positive 
pole being highly controlled and the negative 
uncontrolled or “abandoned”). Factor IV 
shows Ways 4, 5, 7, and 12 as considerably 
positive (abandonment to sensuous enjoyment, 
group action toward common goals, integration 
of diversity, dynamic physical interaction) 
and Ways 2, 8, and 10 as relatively negative 
(self-sufficiency, wholesome enjoyment of 


simple comforts, dignified self-control); parallel — 


scales are active-passive, fast-slow, colorful- 
colorless, and excitable-calm. We more tenta- 
tively identify this factor as “‘venturousness” 
(the positive pole being the more adventure- 
some, and the negative pole the more cautious 
and colorless). It is the least clear of the factors, 
and differs in some respects in terms of means 
and individual data. 

We are now in a position to characterize 
these 13 “ways to live” in terms of their 
locations within this four-factor connotative 
meaning space, and we can check these fac- 
torial characterizations against the verbal 
statements of the Ways. In other words, we 
utilize the interpretive economy provided by 
the factor analysis to describe the 13 value- 
orientations in terms of four bipolar factors. 
Table 3 gives these characterizations in the 
last column, Thus Way 1 (preserve the best 
in society) is characterized as “controlled 
socialization” because it has its highest 
loading on the “socialization” factor and 
subsidiary loading on the “control” factor. 
Way 2 (self-sufficiency, reflection, and medita- 
tion) is characterized as “egocentric controlled 
passivity” because it shows loadings on Factors 
I, I, and III, the highest being on the first. 
Way 8 is “easy passive socialization” because 
its highest loading is on the “socialization” 
factor, its next on “passivity” (nondynamism), 
and it shows residual loading on “noncontrol” 
(hence “easy”). Way 13 (humble obedience 
to cosmic purposes) is characterized as “pure 
passivity (somewhat abandoned)” because of 
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TABLE 3 


QUARTIMAX ROTATIONS OF THE First Four FACTORS FOR CONNOTATIVE MEANINGS OF THE Ways: ANALYSIS 
Basen ON Means (m) versus InprvipvaL Data (i) 


Factor I Factor I Factor II Factor IV 
(Dynamism) Gocinlira tid (Control) (Venturousness) | Factorial Characterization of 
Way Ways 
m $ m t m $ m + 
1 —.09 -12 | —.82 | —.S2 48 -44 | —.16 .22 | “controlled socialization” 
2 -68 50 30 ae: 63 43 +13 | —.08 “egocentric controlled pas- 
sivity” 
3 17 :20 | —.93 | —.77 | —.01 08 -10 | —.04 | “pure socialization” 
4 —.54 | —.04 35 35 | —.26 | —.34 -69 -56 | “abandoned ego-gratifying 
dynamic venturousness” 
5 —.87 | —.26 | —.14 | —.20 | —.12 03 31 57 “socialized, venturous dy- 
namism” 
6 —.95 | —.63 | —.07 | —.03 27 .42 | —.04 .41 | “controlled dynamism” 
7 —.86 | —.32 | —.44 | —.25 01 .09 -04 54 “venturous dynamism” 
8 60 -35 | —.74 | —.68 | —.15 | —.09 | —.01 | —.08 | “easy passive socialization” 
9 96 -82 | —.23 | —.17 .01 | —.03 -09 | —.01 “somewhat socialized pas- 
y. 
10 —.17 | —.07 | —.21 | —.07 +94 -82 | —.11 | —.07 Gir rina socialized con- 
trol” 
11 97 -83 | —.13 | —.06 12 14 .09 | —.01 jpa] ied (slightly con- 
12 —.92 | —.51 17 05 06 07 27 58 “egocentric venturous dy- 
nal 
13 97 -81 | —.13 | —.16 | —.12 | —.05 -10 | —.04 | “pure passivity (slightly 
abandoned)” 
Percentage 55.7 |25.2 | 20.5 | 13.5 | 13.2 | 10.7 5.7 | 11.5 
of Total 
Variance 


extremely high loading on Factor I, yet with a 
subsidiary loading relatively on Factor III. 
The reader can check the other characteriza- 
tions in Table 3 against the factor loadings 
and against the descriptions of the Ways. 


The Semantic Space within Which the Ways Are 
Judged 

The set of 13 value statements used as 
“concepts” in this study are interesting from a 
methodological point of view. In the first 
place, they are a restricted sample of con- 
cepts—all of the same class and all highly 
evaluative (“oughts”). The scales selected for 
this study provide ample opportunity for at 
least the three general factors usually ob- 
tained, “evaluation,” “potency,” and “ac- 
tivity,” to appear. But do the scale factors 
show the effects of interaction with the 
Particular set of concepts being judged and 
differ from those typically obtained with 
‘wider samples of concepts? In the second 
Place, these “concepts” are extremely compli- 
cated verbal constructions—running 100 words 


or more in length and obviously shifting in 
semantic content from phrase to phrase. Do 
subjects respond to quite different facets of 
these messages, and hence tend to cancel each 
other out toward meaninglessness (and an 
unclear factor structure), or do they distill 
from each complex the connotative redun- 
dancy, and hence reinforce each other toward 
consistent meaningfulness (and a clear factor 
structure)? 

To get at the factor structure of the semantic 
scales, we begin by correlating each scale 
with every other scale across the 13 concepts, 
generating a 26/26 matrix of intercorrelations. 
Again, as was the case with the concept 
factorizations, we make two separate analyses, 
one based on the means of subject judgments 
given in Table 2 (where N for the correlations 
is 13) and the other based on summations 
across both individual subjects and concepts 
(where N for the correlations is 715). These 
correlation matrices are then factored, using 
Thurstone’s centroid method with communal- 
ities fixed at 1.00, and rotated toward simple 
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TABLE 4 


Quartaax ROTATIONS OF THE First Four FACTORS FOR SEMANTIC SCALE RELATIONS: ANALYSIS BASED ON 


Means (m) versus INDIVIDUAL DATA (i) 


‘actor II Factor III Factor IV 
(Successfulness) (Predictability) (Kindness) @) 
Scales 
m + m $ m $ m + 
interesting-uninteresting —.96 —.76 —.15 —.10 —.08 -02 18 
untimely-timely -97 -79 —.02 .02 10 -00 —.22 
new-old —.81 —.48 — .43 —.40 13 —.01 —.18 
kind-cruel +06 —.25 34 30 —.92 —.74 -00 
hard-soft —.54 — .36 —.02 —.21 .82 -68 —.10 
unsuccessful-successful .98 85 —.08 —.03 .00 —.07 —.15 
active-passive —.91 —.72 —.32 —.47 18 20 pp 
true-fi —.96 —.85 -20 .09 —.14 —.10 10 
tasteless-savory -92 75 -30 Fyi 18 16 —.13 
good-bad —.79 —.79 .33 -22 —.49 —.30 —.05 
feminine-masculine -76 57 25 .34 —.56 —.49 -03 
slow-quick +84 59 -45 59 —.26 —.21 —.05 
stable-changeable 40 01 84 76 —.30 —.18 — .06 
angular-rounded —.02 160 —.30 112 «83 48 —.37 
cool-warm 18 32 31 12 85 64 -01 
curved-straight —.43 +22), —.70, -=—15. =.35  =—.28 -46 
colorful-colorless —.80 =—.68 —.56 —.48 —.09 —.16 — .03 
negative-positive .95 -81 —.02 -02 27 -22 -06 
unusual-usual -66 ll .03 .07 -36 .11 —.45 
ugly-beautiful 46 53° -.09  —.07 -85 -62 -06 
strong-weak —.93 —.85 -08 —.08 31 19 —.22 
important-unimportant —.93  —.83 .25 09 01 =—.03 —.14 
excitable-calm -1 —.40 —.64 -—71 +23 18 -08 
foolish-wise +85 4 -4 -.25 .21 „17 -06 
unpredictable-predictable —.20 .04  —.89 ~.68 -29 «03 — 18 
powerless-powerful -96 400 =. 11 05) OSAR) ay +16 
Percentage of Total 56.8 38.4 15.8 11.7 19.7 10.8 3.3 3.4 


Variance 


structure, using the quartimax method? 
Again, as in the factorization of concept 
matrices above, it was immediately apparent 
from inspection that the correlations obtained 
from mean judgments were significantly 
higher than those obtained with intersubject 
variance included in the analysis. 

Table 4 presents the loadings on the first 
four rotated factors, the analysis based on 
means (m) and that based on individual data 
(i) appearing in paired columns. Inspection 
of the figures for percentage of total variance, 
in the bottom row of this table, confirms the 
impression from the correlation matrices that 

*Only the rotated factor loading tables are given 
here to conserve space, The original correlation matrices 
and the unrotated centroid loading tables have been 
filed with the American Documentation Institute. 
Order Document No. 6692 from ADI Auxiliary Publi- 
cations Project, Photoduplication Service, Library of 
Congress; Washington 25, D, C., remitting in advance 
$1.25 for microfilm or $1.25 for photocopies. Make 
checks payable to: Chief, Photoduplication Service, 
Library of Congress. 


the analysis based on means sharpens the 
common or shared factor structure and ac- 
counts for a larger portion of the total variance 
(by virtue of eliminating the intersubject 
differences). Only three factors account for 
92% of the total variance in the case of means; 
three factors account for 61% of the total 
variance in the case of individual data—which 
is still sizable, of course. Inspection of the 
values in the paired columns indicates that the 
two analyses yield exactly the same pattern of 
loadings. Hence it is again true that no change 
in the nature of factors is introduced by chang- 
ing methods of analysis, 

The semantic space of connotative meanings 
generated when these value statements (the 
Ways) are judged is clearly not the same as 
that obtained when more varied samples of 
concepts are used. Although Factor I 1$ 
clearly “evaluative” (timely, successful, true, 
good, positive, and wise versus untimely, 
unsuccessful, false, bad, negative, and foolish), 
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it also includes what are usually independent 
dimensions—‘‘potency” (strong-weak and 
powerful-powerless), “activity”? (active-pas- 
sive, quick-slow), and “receptivity” (savory- 
tasteless, interesting-uninteresting, colorful- 
colorless). In other words, when different 
“ways to live” are being judged by American 
college students, good, strong, active, and 
stimulating (as opposed to bad, weak, passive, 
and dull) form a single, overwhelmingly 
dominant factor in terms of which such 
value-orientations are discriminated. This 
prime factor is probably best defined by the 
single scale, successful-unsuccessful (with 
loadings of .98 and .85, respectively, in the 
two analyses). Comparing these results with 
those of other factor analyses (cf. Osgood, 
Suci, & Tannenbaum, 1957, Ch. 2), then, we 
have clear evidence for concept-scale interac- 
tion. In the case of value statements (the 
Ways) being judged as concepts, by American 
students, “evaluation,” “potency,” “activity,” 
and “receptivity” fuse together as a single 
“successfulness” factor. 

Two other factors are clearly defined in this 
analysis, even though their weights are about 
4 that of the first factor. Factor II can be 
identified as a “predictability” dimension 
from the nature of the scales that load most 
highly on it: predictable, stable, calm, colorless, 
slow, old, straight, and wise versus unpredict- 
able, changeable, excitable, colorful, quick, 
new, curved, and foolish. Factor III appears 
from the scales that load most highly on it to 
be a “kindness” dimension: kind, soft, warm, 
rounded, beautiful, and feminine versus cruel, 
hard, cold, angular, ugly, and masculine, 
Factor IV accounts for very little of the 
variance (3% in both analyses) and is not 
readily interpretable. Since this was an 
orthogonal analysis, and since the first three 
factors together account for 92% (mean data) 
or 61% (individual data) of the total variance, 
we may conclude that the semantic space used 
to characterize the connotations of these 13 
“ways of life” is structured along three 
independent dimensions by American college 
students—“‘successfulness,” “predictability,” 
and “kindness.” It will also be noted that 
these scale factors are similar in meaningful 
content to the first three factors derived 


from the Ways, which were “dynamism,” 
“control,” and “socialization.” 

Prediction of Value Preference from Value 
Connotation 


In this study value preferences and value 
connotative meanings have been measured 
independently, in the sense that subjects indi- 
cated their preferences by numbers from 7 
(“like very much”) to 1 (“dislike very much”) 
on the margins of the “Ways” sheets after 
they had filled out semantic differentials. To 
what extent can one predict value preferences 
from the clearly identified connotative meaning 
factors? 

To quantify the relation we must project 
the preferences and connotative meanings of 
the 13 Ways in to the same space, and this is 
possible because the same subjects made 
both types of judgments. We use the semantic 
space determined by interscale correlations, 
which, it will be recalled, was quite exhaus- 
tively described by three factors—‘“success- 
fulness,” “predictability,” and “kindness.” 
To locate the Ways in this space, we compute 
the factor scores for each; these are given in 
Table 5. It can be seen, for example, that 
Way 1 scores slightly negative on Factor I 
(slightly successful), considerably positive on 
Factor II (quite predictable), and slightly 
negative on Factor III (slightly kind). The 
first column in Table 5 gives the mean prefer- 
ence ratings of the Ways (taken from the 
bottom row of Table 2). To the extent that 
connotations predict preferences, the factor 
scores (taken either singly or multiply) should 
correlate with the preference judgments. 

These correlations were computed both 
for the mean preference ratings (V = 13) 
and for the preference ratings given by 
individual subjects (V = 55 X 13, or 715). 
The single and multiple 7’s obtained in both 
cases are given in the lower portion of Table 5. 
It can be seen that in terms of both means 
and individual data, the first factor (“‘success- 
fulness”) correlates highly (—.93 and —.66) 
with expressed preferences (Note: the negative 
correlation simply reflects the way the factor 
scores are ordered; the actual relation is such 
that high “successfulness” connotation is 
associated with high preference). The other 
two factors display practically no relation to 
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preference (.09 and —.05 with “predictability” 
and —.18 and —.04 with “kindness”). As 
might be expected in this case, the multiple 
r’s (.96 and .66) indicate that the addition of 
Factors II and III adds little or nothing to 
the prediction of preferences. We can conclude 
that the dominant first factor of the semantic 
space within which the value-statements (the 
Ways) are discriminated by the Stanford 
students is the one that predicts value prefer- 
ences, and that therefore it is, indeed, an 
evaluative factor. 


Discussion 


Preferences among various “ways to live” 
for American college students prove to be 
very stable in the cultural sense. This was 
shown by the fact that the mean preference 
ratings obtained here correspond almost 
perfectly with those obtained earlier by Morris 
for other groups of American college students. 
However, in Morris’ earlier studies the correla- 
tions between Ways, taken across individual 
subjects, were uniformly low and the factor 
structures accounted for only a small part 
(approximately 34%) of the total variance, 
This is one respect in which judgment of the 


TABLE 5 
MEAN PREFERENCE RATINGS AND FACTOR SCORES ON 
SCALE MATRICES FOR Ways 


Factor Scores of Ws 
(from analysis in Table 4 based on means) 
eat eae S FPE A) 


M 

Ways Prelec I pia m 
(Unsuccessful) Credit-  (Unkind) 
1 5.11 —0.585 1.994 —0.453 
2 3.02 0.758 0.664 2.077 
3 4.18 —0. 103 0.630 —2.030 
4 4.29 —0.130 —2.006 0.278 
5 4.06 —0.534 —0.804 —0.264 
6 5.07 —1.663 0.296 0.671 
7 5.16 —0.965 —0.361 —0.590 
8 4.36 0.442 0.242 —1.288 
9 2.51 1.517 —0.031 —0.116 
10 4.16 —0.310 1.175 1.194 
11 2.66 1.191 0.029 0.317 
12 5.04 —1.215 —1.207 0.419 
13 2.06 1.598 —0.623 —0.216 


I/P = —.93 Rigs 
II/P = .09 ip = 08 
Ill/P = —.18 IN/P = — 04 

1, I, I/P = .96 I, II, MP = 66 


C. E. Oscoop, E. E. WARE, AND C. Morris 


Ways against the semantic differential has 


provided clarification: it has accounted for 
almost all of the total variance and it has 
indicated that preferences are strongly related 
to an evaluative factor, which for American 
college students generally corresponds to the 
amount of “successfulness” attributed to the 


alternative Ways. This is especially true when 


means are used (r = .93), When individual 
data are used, the r of .66, while still high, 
shows that preference and “successfulness” 
may at times diverge. (For a further discussion 
of the relation of preference ratings to evalu- 
ations see Morris, 1956b, pp. 163-171.) 
Another respect in which this semantic 
analysis provides clarification is the picture 
one gets of the connotative framework within 
which various “ways to live” are considered. 
Whether one looks at the space generated by 
the intercorrelations among the Ways them- 
selves or at the space generated by the inter- 
correlations among the semantic scales, 
essentially similar connotative factors are 
sharply etched. Statements of “ways to live” 
are most clearly discriminated in terms of their 
“dynamism” (Ways analysis) or “successful- 
ness” (scale analysis); it is this connotation 
that seems to come through most intensely 
when subjects read the value statements. 
Quite independently, subjects discriminate 
among the value statements in terms of how 
“controlled” (Ways analysis) or “predictable” 
(scale analysis) the ways of life seem to be; 
according to these subjects, there can be 
“dynamic” Ways that are relatively “con 
trolled” (e.g, Way 6, progress through 
realistic solution of problems) and “dynamic” 
Ways that are relatively “uncontrolled” oF 
“unpredictable” (e.g., Way 4, abandonment 
to sensuous enjoyment). Third, and again 
independently, subjects discriminate among 
value statements on the basis of their “sociali- 
zation” (Ways analysis) or “kindness” (scale 
analysis). These three general attributes 


Rae ee 


account largely for the differences people see — 


in the connotative meanings of the alternative 
ways to live. 

How adequate a sample of conceivable 
“ways to live” is the set of 13 used in these 
studies? The factoring of Ways in terms of 


their connotative meanings sheds some light | 


on this question, and it becomes most evident 


| 
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when we construct a three-dimensional model 
from the data for means given in Table 3. 
This model, shown in Figure 1, distributes 
the 13 Ways in the space according to their 
loadings on the three factors. We can think of 
each factor as dividing the space into two 
regions (or hemispheres) established by a 
hyperplane passing through the origin at 
right angles to the factor. The three factors 
slice our spherical space in three different 


ways, and, with respect to each way, we can 
ask if the distribution of Ways is equable. 
The “dynamism” factor divides the space into 
upper and lower halves, in terms of the model, 
and it can be seen that there are about as 
many Ways above as below this hyperplane. 
The “socialization” factor divides the space 
into left- and right-hand halves, and we see 
that only Ways 2 and 4 (self-sufficiency, 
reflection, and meditation; abandonment to 


DYNAMISM 


PASSIVITY 
Fic. 1. Three dimensional representation of the ways. 
Ways to Live* i 
1. preserve the best 8. simple comforts 
2. self-sufficiency 9. quiet receptivity 
3. altruism 10. self-control 
4. sensuous abandonment 11. contemplation _ 
5. group action 12. physi interaction 
6. progress through realism 13. humble obedience 
7. integration 


"See Table 1 and Varieties of Human Value, pp. 15-19, for more complete description. 
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sensuous enjoyment) fall in the right-hand, 
“egocentric” region. The “control” factor 
divides the space into front and rear halves, 
as the model is viewed, and it is apparent 
that only Way 4 (abandonment to sensuous 
enjoyment), and to a slight extent Way 8 
(wholesome enjoyment of simple comforts), fall 
in the “noncontrol” or “abandonment” region. 

It is clear, then, that the connotative 
meaning space for “ways to live? is not 
uniformly populated by the Ways treated in 
this and Morris’ earlier studies. Specifically, it 
is empty of conceptions of “the good life” 
which stress extremes of “egocentrism” and/or 
“noncontrol,” We can imagine Ways that 
might fit these criteria—“The only way to 
live which is both rational and maximally 
gratifying is narcissism; one should live so as 
to appreciate and glorify the self, since Ego is 
the only true reality...” or “The goal of 
life should be social, emotional, and intellectual 
anarchy; each individual is an isolated island 
in fact and should be in spirit, always resisting 
social conformity, emotional dependence, and 
intellectual tyranny... .” But as soon as we 
do try to imagine Ways falling in these regions, 
it becomes apparent why they are not likely 
to appear in the value-orientations of organized 
groups of humans. The establishment and 
maintainance of societies almost of necessity 
implies some degree of both socialization and 
control over the individual. Hence conceptions 
of “the good life” that characterize societies 
do not include narcissism or complete anarchy. 
On the other hand, for the purposes of meas- 
uring values, it would be wise to include 
value statements like the foregoing to “round 
out” the space. The model also suggests 
another possible inadequacy. At least for 
American college students, Ways 5, 6, and 12, 
on the one hand, and Ways 9, 11, and 13, on 
the other, are connotatively redundant 
(undifferentiated) within the system of 26 
scales used here, 

However, all of what has been said so far 
in this discussion takes no account of either 
individual or cultural differences, Magnitudes 
of correlations, of factor loadings, and of 
Percentages of total variance accounted for by 
factors were found to decrease sharply when 
interindividual variance was included in the 
analysis. Although this, in part, reflects the 


greater reliability of mean judgments as 
compared with individual judgments (and 
thus decrease in the error variance), it also 
may reflect individual differences in the 
assessment of values. One of the important 
directions which research on the connotative 
meanings of the Ways should take in the 
future is into individual differences—in 
demographic status (age, sex, socioeconomic 
level, etc.), in personality, in temperament, and 
the like. 

The lack of cross-cultural comparative 
data also places limits on the generality of our 
findings. Although Morris (1956b, Ch. 3) has 
found evidence for common factors in value 
preference, despite differences in the prefer- 
ential ordering of the Ways, for cultures as 
disparate as India and the United States, we 
cannot assume that this will hold for the 
connotative framework within which such 
value-statements are interpreted. In fact, it 
seems unlikely that the dominance of a 
“successfulness” factor (combining evaluation, 
potency, activity, and receptivity) would hold 
for certain other cultures, such as, say, India. 
And would subjects drawn from other cultures 
display the same clustering of Ways according 
to connotative meanings, (even though 
differently preferred), as shown in Figure 1 for 
American college students? Would the tight 
bundles composed of Ways 5, 6, and 12 versus 
Ways 9, 11, and 13 prove to be more widely 
discriminated, say, by Moslem Arabs? Com- 
parative cross-cultural study of value prefet- 
ences and connotative meanings is another 
direction for future research on values— 
research that could have significant bearing 
on international communication and under- 
standing. 

Finally, there are three matters of methodo- 
logical interest that deserve mention. First, 
the results of the factor analysis of scale 
relations make it clear that factors derived 
from more “representative” samples of con- 
cepts are not necessarily independent, an 
hence visible, when some specific subset of 
concepts is judged. The semantic scales us 
in this study were selected as representing 
seven independent factors obtained in & 
previous analysis based on 100 subjects 
judging 20 varied concepts against 76 scales; 
in the present study we find a quite different 


ConNOTATIVE MEANINGS OF HUMAN VALUES 73 


factor structure, for the same scales and 
type of subject. Concept/scale interaction 
thus holds for groups of related concepts as 
well as for single concepts, and this may 
mean that we must devise different semantic 
differentials for different classes of concepts— 
value concepts, political concepts, personality 
concepts, esthetic concepts, and so on. Second, 
we have found that analyses based on correla- 
tions over means and analyses based on corre- 
lations over individual subjects yield almost 
identically the same factors. Although this 
relationship need not always be the case 
(indeed, one can arrange hypothetical data in 
such a way that it would not be true), it does 
increase the likelihood that comparisons made 
in the past by Osgood and his collaborators 
between factor analyses of these two types are 
feasible. Third, the fact that we have obtained 
clear-cut factor structures in both Ways and 
scale analyses, despite the length and com- 
plexity of the “concepts” being judged, 
clearly indicates that complex verbal state- 
ments of this sort can be studied effectively 
with the semantic differential. 


SUMMARY 


Standard statements of Morris’ 13 “ways 
to live” were judged against a 26-scale form 
of the semantic differential by 55 Stanford 
college men. These subjects also indicated 
their preferences among these “ways to live.” 
The average ranking of preferences for the 
Stanford group was almost identical with that 
for other American college student samples 
studied by Morris and his associates. Factor 
analyses of the connotative meanings of these 
“ways to live,” as measured by the semantic 
differential, yielded a reasonably clear, four- 
factor system: “dynamism,” “control,” “so- 
cialization,” and perhaps ‘“venturousness.” 
Factor analyses of the semantic scale relations 
yielded three clear factors that accounted for 
large portions of the total variance: “success- 


fulness,” “predictability,” and “kindness.” 
Two types of factor analyses, one based on 
correlations over means and the other on 
correlations over individual subjects, were 
made, and for both the concept and the scale 
factorizations identical factors were obtained. 
It was found that the preferences for “ways 
to live” could be predicted from the factor 
scores on the first connotative factor (“success- 
fulness”) to the extent of a correlation of .93 
(for average preferences) and .66 (for indi- 
vidual preferences). Substantive and methodo- 
logical implications of these findings are 
discussed. 
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REDUCTION OF COGNITIVE DISSONANCE BY 
SEEKING CONSONANT INFORMATION: 


J. STACY ADAMS? 
Stanford University 


N a person holding an opinion on 
a matter of importance is exposed 
to an authoritative communication 


advocating a contrary opinion, cognitive 
dissonance and a need to reduce the dissonance 
are produced (Festinger, 1957). Several 
means of dissonance reduction are possible: 
The person may change his opinion to conform 
more closely with the communication; he 
may attempt to persuade the communicator 
to alter his opinion; he may distort the content 
of the communication; he may discredit the 
communicator; he may reduce the importance 
of his opinion; or he may attempt to obtain 
support for his opinion. It is with the last of 
these modes of reducing dissonance that we 
are concerned in this study. 

Previous studies have shown this mode in 
operation, Festinger, Riecken, and Schachter 
(1956) found that a group of people, in whom 
strong dissonance had been produced by the 
disconfirmation of their beliefs, began to 
recruit adherents to their cause. Their efforts 
to obtain social support were in marked 
contrast to their behavior prior to the onset of 
dissonance, when they were reluctant to take 
in new members and reticent in releasing news 
about their beliefs to the press. Maccoby 
(1959) found evidence of seeking consonant 
social support in two field experiments con- 
ducted with mothers. Her data show that 
mothers in whom dissonance was greatest, as 
a result of exposure to a contrary communica- 
tion on child training, were most likely to 
have discussed the topic of the communica- 
tion with like-minded persons. In an experi- 
mental study of the effects of group discussion 
on counteracting propaganda, Brodbeck (1956) 
showed that persons in whom dissonance had 
been produced displayed a preference to 

1 The research was supported in whole by United 
States Public Health Service Grant 3M-9122C. 

„>I wish to express my thanks to Leon Festinger for 
his continuing interest, support, and advice during 
the study; to A. K. Romney for his assistance in a 


preliminary study; and to Nathan Maccoby for givi 
the post-experimental talk, tad 
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engage in discussion with nonexpert peers 
whose beliefs were consonant with theirs, thus 
obtaining social support and restoring confi- 
dence in their beliefs to prepropaganda levels. 
Because of the experimental design, Brodbeck 
was not able to show that seeking social 
support increased as a function of magnitude 
of dissonance. 

In a different, but related, vein Ehrlich, 
Guttman, Schonbach, and Mills (1957) found 
that persons with postdecision dissonance re- 
sulting from the purchase of an automobile were 
highly selective in the car advertisements they 
noticed and read. Specifically, they were more 
likely to notice and read ads about their own 
make of car than about other makes, and were 
more likely to read ads about their car than 
were car owners with old cars. In this case, 
seeking consonant information rather than 
seeking social support was the approach taken 
to reduce dissonance. In a further study Mills, 
Aronson, and Robinson (1959) found dis- 


$ 


sonance-reducing selectivity to positive infor- 


mation among students who had made a de- 
cision to take one of two alternative types of 
examinations. J 
From Festinger’s theory and the foregoing 
data the following two hypotheses are derived 
for testing. First, a person in whom dissonance 
has been produced by exposure to an author 
tative communication espousing a contrary 
point of view on an important subject, is more 
likely to seek authoritative information 0? 
the subject than a person who has been ex- 
posed to a consonant communication. This 
prediction is expected to hold only if othet 
means of reducing dissonance are relatively 
less available than seeking informational 
support. Furthermore, the prediction $ 
expected to hold only if the communication 1$ 
authoritative and if its subject matter 1$ 


important to the subject. If the communica- — 


tion were not perceived as at least minimally 


authoritative (trustworthy, credible, expert), 


or, similarly, if the subject of the person's 
opinion were not important to him, little 
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dissonance should result. The hypothesis 
receives some, though unsatisfactory, support 
from the field studies of Festinger et al. 
(1956) and Maccoby (1959). In the first a 
control group not experiencing dissonance was 
lacking. In the second study it was only shown 
that seeking consonant support varied as a 
function of dissonance; it was not shown that 
seeking support per se varied with dissonance. 

The second hypothesis concerns the selec- 
tivity of information-seeking. A person in 
whom dissonance has been produced by 
exposure to a contrary communication seeks 
support, if such is available, from a source 
that is perceived to agree with his opinion. If 
the person’s opinion is not substantially 
changed by the communication, he should 
seek support from a source perceived to agree 
with his precommunication opinion. If, on 
the other hand, his opinion is changed sub- 
stantially to conform with the communica- 
tion, he should seek support from a source 
perceived to agree with his new opinion. This 
is derived from the fact that, once the person 
has partly reduced dissonance by adopting 
the opinion advocated by the communication, 
new dissonance results, produced largely by 
the psychological incompatibility of the new 
opinion with cognitions related to the original 
opinion. In sum, changing opinion produces 
new cognitive dissonance that can effectively 
be reduced by obtaining support for the new 
opinion. This hypothesis receives direct and 
indirect support from the data of Ehrlich et al. 
(1957), Maccoby (1959), Mills et al. (1959), 
and Brodbeck (1956). However, in the Brod- 
beck study, which is the most relevant to the 
present experiment, subjects were constrained 
to communicate either with a subject who held 
consonant opinions or one who held dissonant 
opinions, whereas in the present study subjects 
were free to choose whether they wished 
information or not, in addition to being free to 
select the source of information. In addition, 
the discussants in Brodbeck’s study were 
nonexpert peers, whereas in the experiment 
reported here the sources available to the 
subjects were experts. 


METHOD 


‘The subjects were 100 mothers whose youngest 
child was in the first grade of five Palo Alto, California, 


public schools. Fifty-three of these, randomly chosen, 
were experimental subjects; 47 were control subjects.* 
Each subject was put through the following experi- 
mental procedure in her home by one of four trained 
interviewers.‘ 

1. The subject was asked a series of opinion ques- 
tions on the relative importance of hereditary and 
environmental factors in child development, in addition 
to a few background questions and some opinion 
questions on the issue of permissiveness in child rearing. 
The key question for the purpose of the experiment 
was a 10-point scale that followed eight questions on 
the issue of heredity-environment: 

On the whole, would you say that a child’s be- 
havior is mostly learned or would you say that it is 
mostly inborn? Show your opinion by putting a 
cross (X) at either end, or any place in between, on 
the line below. 

Mostly Mostly 

Learned /}—-+—+-—+—+-+—+—_++H Inborn 
Following this was the question, “How sure are you of 
the answer you gave to the last question?” to which 
five answers ranging from “very sure” to “not at all 
sure” could be given. 

2. The subject was then asked to listen to an 18- 
minute tape-recorded talk by a fictitious expert, Dr. 
Wilkins, emphasizing either hereditary or environ- 
mental factors in child development. Both the heredi- 
tary and the environmental talk were attributed to the 
same fictitious expert, for whom a brief biographical 
sketch was given, and were recorded by the same 
person. Of approximately the same duration and 
making about the same number of points, the talks 
differed only in the point of view advocated—pro- 
heredity or pro-environment. All factual data referred 
to had origins in the published literature; no fictitious 
information or misinformation was given. 

Experimental subjects were given whichever of the 
two talks was contrary to the opinion they expressed 
on the 10-point scale, For this purpose the subject was 
considered as having an environmental opinion if she 
checked any of the five intervals on the left of the 
scale (‘Learned” pole) and as having a hereditary 
opinion if she checked any of the five intervals on the 
right of the scale (“Inborn” pole), If the subject placed 
her check precisely in the middle of the scale, a syste- 
matic scheme determined which talk she heard, Control 
subjects were given whichever of the two talks agreed 
with their opinion. 

3. Following the talk, the subjects were told: 

You have heard Dr. Wilkins’ views on “Heredity 
and Environment.” There are, of course, other views 
that are just as authoritative. Now, we’d like to 
give you the opportunity to discuss some of the 
things you heard in the talk, if you want to. Of 


peice asin Ace TREE SRE SDT is PES tS RAP 
2A few mothers gave birth to a child between the 
time the sample was drawn and the study was con- 
ducted. Three mothers were mistakenly treated as 
experimental subjects instead of as control subjects. 

4 Approximately equal numbers of experimental 
and control subjects were assigned at random to each 
interviewer. The interviewers were unaware of the 
specific hypotheses under test. 
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course, it’s up to you. We thought some of you 
might want to have a discussion, so two people have 
agreed to talk further on the subject. Both are 
psychologists with interests relevant to the talk you 
heard and will be at Stanford in about two weeks.” 

A brief biographical sketch of each psychologist, 
emphasizing his expertness, was then read. Except for 
their names, the speakers were differentiated only by 
the title of a noted article they had allegedly written. 
For the pro-heredity speaker the article was entitled, 
“The Importance of Inheritance and Maturational 
Factors in the Development of Children”; for the pro- 
environment speaker it was “The Role of Training and 
Environmental Factors in Child-Rearing.” Both 
speakers were to give their talks in different rooms, but 
on the same evening, at the same time, and in the same 
building. 

The subject was then asked: “Since for planning 
purposes we need to know approximately how many 
people will attend each of the talks, would you tell me 
if you want to attend either of the talks and, if so, 
which one?” The interviewer recorded the subject’s 
preference for either (or neither) speaker and, in 
addition, recorded all comments made by the subject 
that would permit an estimation of the degree of the 
subject’s desire to attend the talk of her preference. 

4. At the conclusion of the session the subject was 
asked for her opinion again and her confidence in it. 
This was done under the guise of the interviewer's 
checking over the original questionnaire and discovering 
she had “failed” to record the subject’s answers to two 
questions. At this point the interviewer apologized to 
the subject and asked her, “Would you please answer 
these now?” 

Approximately 2 weeks following the experimental 
sessions, a talk on the question of hereditary versus 
environmental factors in child rearing was actually 
given for those subjects who availed themselves of the 
opportunity. At this time the object of the experiment 
was also explained. Both the talk and the description 
of the experiment were received with considerable 
interest. However, as is the case in many field experi- 
ments in which there are no constraints, very few 
subjects attended the talk. 

The subjects, taped talks, and experimental pro- 
cedure described above were well-suited to a test of the 
hypotheses. Dissonance was assured by a topic for the 
talk that would be important to the mothers of 6- or 
7-year-old children, The taped talks, however, were 
not so rabidly partisan on either side as to be easily 
discredited. Nor were they so persuasive as to produce 
substantial opinion change, thus minimizing opinion 
change and maximizing information-seeking as modes 
of dissonance reduction. On the other hand, the talks 
were explicit enough in the points of view advocated 
so that distortion of content would be difficult. At- 
tempts on the part of the subjects to reduce dissonance 
by trying to alter the speaker’s opinion were, of course, 
impossible, and seeking social support from the inter- 
viewer was effectively denied by the latter’s not inter- 
acting with the subject. 


RESULTS AND Discussion 


With the exception of four experimental 
subjects and two control subjects, all subjects 
expressed an initial opinion on the environ- 
mental (“Learned”) side of the scale. Among 
the 94 subjects indicating an environmental 
opinion, 34 experimental and 32 control 
subjects checked one of the three most extreme 
intervals on the scale. It is with these subjects 
that the main analysis will proceed, as this 
maximizes the opportunity of revealing 
differences in the groups as a function of the 
experimental manipulation. Since these sub- 
jects all had environmental opinions, it follows 
that experimental subjects heard a hereditary 
communication and that control subjects 
heard an environmental communication. 


Success of the Experimental Manipulation 


Though the communications were selected 
because they would not be so persuasive as to 
produce large amounts of opinion change, it 
might be expected that if the communications 
were indeed different and productive of 
dissonance in the experimental group, the 
latter would show a significantly greater 
amount of opinion change in the direction of 
heredity than would the control group. For 
this purpose the number of subjects in each 
condition changing opinion in a hereditary 
direction may be compared. Sixteen experi- 
mental subjects as contrasted to five control 
subjects thus changed. By chi square test the 
difference is significant at the .01 level.’ As 4 
further indication of whether or not dissonance 
was produced among experimental subjects 
changes in confidence following the communi- 
cation may be observed. If the hereditary 
communication was productive of dissonance 
for experimental subjects, more experimental 
subjects should show decreases in confidence 
in their opinion. The data bear this out, 3$ 
six experimental subjects but only one control 
subject showed a decrease in confidence. The 
difference is significant at the .06 level by 
Fisher’s exact test but is difficult to interpret 
because all subjects but one showing decreases 
in confidence also changed opinion slightly- 


* All chi square tests of significance are two-tailed. 
The null hypothesis is rejected when p> .05. 
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We may, nevertheless, conclude that the 
experimental manipulation was successful. 


Information Seeking 


The first hypothesis to be tested stated, 
briefly, that persons in whom dissonance had 
been produced by a communication would be 
more likely to seek information than persons in 
whom a communication had produced no 
(or little) dissonance. An appropriate test 
consists of a comparison of the number of 
subjects in each condition who expressed 
some or no desire to attend a talk, irrespective 
of any consideration of the alleged point of 
view of the speaker. Each subject was therefore 
tated by two coders independently as to 
whether she expressed some or no desire to 
attend one of the talks. The coders had no 
knowledge of the hypothesis being tested and 
agreed on 65 of their 66 ratings. The single 
disagreement was resolved by the experimenter 
against the hypothesis. The data in Table 1 
show that experimental subjects were more 
likely than control subjects to want to attend 
an authoritative talk. The difference between 
the groups is significant at the .02 level by 
Fisher’s exact test. It is clear, therefore, that 
under conditions of free choice subjects attempt 
to reduce dissonance by seeking information. 


Selection of Information Source 


When the subjects were asked if they wished 
to attend the talk, they indicated which of 
two speakers they would prefer to hear—one 
of them by implication holding a hereditary 
view and the other an environmental view. 
It was hypothesized that the experimental sub- 
jects would elect to hear a talk consonant with 
their opinion following the communication. 
Since only one subject changed opinion 
entirely, going from an environmental to a 
hereditary opinion, the data will be analyzed 
only for subjects who initially had extreme 


environmental opinions, who either maintained 
their initial opinion or changed opinion by a 
small amount only, who wished to hear a talk, 
and expressed a preference for one of the 
speakers. To test the hypothesis we may 
compare the observed distribution of prefer- 
ences of experimental subjects against an 
equally split distribution which would be 
expected if no differential preferences existed. 
This is shown in Table 2. The difference is 
significant at the .06 level only. 

Though a greater preference for a consonant 
(i.e., environmental) talk among experimental 
subjects is suggested by the data, is this 
greater preference a function of the amount of 
dissonance produced? In answer we may 
compare the preferences of experimental and 
control subjects. The data are given in Table 
3. No significant difference in the preferences 
of the two groups exists; both clearly prefer to 
hear the consonant, environmental talk. 
This is contrary to the hypothesis and to 
previous findings (Brodbeck, 1956; Ehrlich 
et al., 1957; Maccoby, 1959; and Mills et al., 
1959), but an explanation is available if we 
assume that the subjects who had no desire to 
hear a talk were those who had no, or at 
least very little, dissonance. Thus, all or 
most subjects in Table 3 have some dissonance. 
As a result, we may expect that both groups 
should tend to avoid exposure to the disson- 
ance-increasing hereditary talk and seek 
exposure to the consonant, environmental 


TABLE 2 
OBSERVED AND HYPOTHETICAL PREFERENCES OF 
EXPERIMENTAL SUBJECTS FOR PosTCOMMUNICATION 
TALK 


Environmental Hereditary 
Talk Talk 


Observed preference 22 8 
Equal preference 15 15 


= .06 by chi square. 


TABLE 3 


PREFERENCES FOR POSTCOMMUNICATION TALK BY 
EXPERIMENTAL AND CONTROL SUBJECTS 


TABLE 1 
DESIRE To ATTEND TALK 
Desire Experimental Control 
Some desire 33 25 
No desire 1 T 


? = .02, by Fisher’s exact test. 


Environmental Hereditary 
Talk Talk 


Experimental Subjects 22 8 
Control Subjects 19 5 
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talk, and that the preferences of the two groups 
should not differ greatly. Nevertheless, since 
the experimentals should have more disson- 
ance on the average than the controls, we 
would expect them to show a somewhat 
greater relative liking for the environmental 
talk. The tendency is actually in the opposite 
direction, as Table 3 shows. 


SUMMARY 


An experiment was conducted to test two 
hypotheses about the reduction of cognitive 
dissonance by seeking information. The 
hypotheses were: (a) A person in whom dis- 
sonance has been produced by exposure to a 
communication advocating an opinion contrary 
to the person’s is more likely to seek informa- 
tion than a person exposed to a compatible 
communication, and (b) a person in whom 
dissonance has been produced by a contrary 
communication tends to seek information 
from a source agreeing with his opinion. 

The opinions of 100 mothers on the impor- 
tance of hereditary and environmental factors 
in child rearing were ascertained by personal 
interview; they were then exposed to a tape 
recorded, authoritative communication espous- 
ing a hereditary or an environmental point of 
view. Following the communication they 
were asked if they wished to attend a talk by 


an expert allegedly favoring an environmental 
view or one favoring a hereditary view. For 
approximately half the subjects the tape 
recorded talk was contrary to their opinion; 
for the remaining subjects the talk agreed 
with their opinion. 

The results supported the first hypothesis. 
The second hypothesis was not sustained, 
though most subjects indicated a preference 
to hear a talk by a consonant source. A possible 
explanation for the failure of the data to 
support the second hypothesis was given. 
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ONSERVATIVE social philosophers from 
Hobbes through Ortega y Gasset 
have had a profound suspicion of the 

motives and capacities of the “mob.” Their 
dislike for the mass has been premised on a 
series of assumptions concerning human 
nature. One of the most fundamental of these 
axioms has been the conviction that man is 
innately aggressive. Hence, in the opinion of 
these theorists, the instinctive cruelty and 
barbarism of mankind must be kept under the 
strict control of a more enlightened “‘élite”’—a 
class, who, through long training in the ways 
of civilization, has inhibited its own aggressive 
impulses. 

This jaundiced view of human motivation 
has been buttressed by the Freudian belief 
that aggression is a universal outcome of the 
“thanatos instinct.” In Freud’s view—at 
least, in one of its several versions—all men 
strive not only for libidinal satisfactions, but 
also for return to a state of nothingness, of 
“nirvana.” Freud argued that aggression, a 
desire to destroy, was the “natural” energy 
emerging from the death instinct; a drive 
which makes its appearance in infancy and is 
either fused with “Eros,” directed toward the 
destruction of others, or aimed at self-annihila- 
tion. 

On a general level, this view of aggression as 
an inevitable element in the make-up of man 
has been challenged by such writers as Fromm, 
Horney, Allport, and Maslow. These observers 
have put forward the position that aggression 
is the result of specific types of social milieu, 
rather than being an instinctual drive. 

In an empirical fashion, other investigators 
have produced evidence which—at least, 
implicitly—contradicts the conservative Freud- 
lan opinions about aggression. Many an- 
alysts have demonstrated the variability of 
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the level and expression of aggression in 
response to various environmental conditions 
(cf. Weiss & Fine, 1956). Others have con- 
centrated on the child rearing antecedents of 
extremely aggressive antisocial behavior. For 
example, the Gluecks (1950), examining a 
lower class sample, and Bandura and Walters 
(1959), concentrating on a middle class group, 
have independently produced a similar portrait 
of the backgrounds of such individuals. Both 
studies found that the aggressively antisocial 
person emerged from an environment char- 
acterized by parental rejection, familial 
discord, punitive discipline, and inconsistency. 

Perhaps the most comprehensive study of 
the early determinants of aggression can be 
found in the work of Sears, Maccoby, and 
Levin (1957). From carefully executed inter- 
views with 379 New England mothers (and 
other sources of data) these researchers con- 
cluded that aggression in small children— 
defined as “behavior intended to hurt or 
injure someone”—was associated with such 
environmental antecedents as parental per- 
missiveness for aggression, the use of physically 
punitive discipline, and maternal lack of 
self-esteem. These results led the investigators 
to reject the instinctual view of aggression 
and, instead, to affirm: “Today, with our 
better understanding of the ruling influence of 
culture as a source of behavioral universals, 
we are inclined to suspect that the deter- 
mination of which stimuli will arouse rage, 
and the focusing of the emotion into an 
actual desire and intent to hurt others (or the 
self), are products of learning experiences 
begun in early infancy” (p. 222). 

The purpose of this paper is to report a 
study which supports the view that aggression 
is a form of behavior which is learned by the 
child during his early experiences within the 
family. This research, concerned with 174 
boys and their families, differs from certain of 
the previous studies: none of the subjects 
were known delinquents; the sample was 
drawn from a relatively lower class, urban 
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environment; and the information on child 
rearing practices and on the boy’s aggression 
was gathered through direct observation of 
both parents and child over more than a 5-year 
period. Each of these differences is important 
in assessing the significance of the research: 

1. The fact that delinquents were excluded 
from the sample means that this study of 
“normal” aggression is not obscured by a 
simultaneous examination of the causes of 
delinquency. 

2. The fact that the sample is drawn in 
disproportionate numbers from the lower class 
has two advantages: the sample serves as 
counterbalance to those findings which have 
emerged from analyses of primarily middle 
class children; and, at the same time, it 
counterbalances studies of delinquency which 
have usually concentrated on lower class 
samples. 

3. The fact that the material was gathered 
through direct observation eliminates the 
difficulties that hamper studies based on 
interviews, experimental situations, or thera- 
peutic sessions. 


BACKGROUND OF THE RESEARCH 


The opportunity to investigate the genesis 
of aggression came from the Cambridge- 
Somerville Youth Study of the 1930s. This 
extensive project, aimed primarily at the 
prevention of delinquency, has been described 
in numerous other publications (McCord, 
McCord, & Gudeman, 1960; McCord, Mc- 
Cord, & Zola, 1959; Powers & Witmer, 1950); 
consequently, we will outline its history only 
briefly. 


Six-hundred and fifty boys, referred by their 
teachers, ministers, or by social agencies, were chosen 
for Participation in the experiment. At the time of 
selection, the boys averaged 9 years in age and were 
drawn primarily from lower class districts in Cambridge 
and Somerville, Massachusetts. In composition, the 
sample contained a relatively high proportion of 
Catholics and recent immigrants, approximately 25% 
of the families were on relief during the depression, 
and only 4% of the parents had attended college. It 
should be noted, in passing, that these “peculiarities” 
of the sample did not generally affect the results. 
Neither social class, immigration patterns, ethnic 
group, nor paternal occupation was significantly related 
to the child’s aggression. 

Since the subjects were selected from a restricted 
Socioeconomic level, the lack of association between 


the child’s behavior and more general social factors is 
hardly surprising. 

Half of the boys were selected because they were 
judged potentially delinquent, “maladjusted” children; 
the remaining cases were regarded as “normal,” 
nondelinquent boys. 

Between 1939 and 1945, 325 children (composed 
equally of delinquent and normal boys) were given 
medical, educational, and social aid; all of these boys 
were assigned to a counselor who visited and advised 
both the boys and their families. The remaining 325 
cases, closely matched to those who received treatment, 
formed a control group. 

In the process of the study, the boys were subjected 
to physical, psychiatric, psychological, and intelligence 
examinations. Reports on the boys and their families 
were systematically gathered from social agencies, the 
schools, and various community agencies. Detailed 
reports on the behavior of the boy and of his family 
were recorded by the staff after each contact. 

In 1956, the authors and their colleagues initiated a 
re-evaluation of the Cambridge-Somerville project, 
aimed at the solution of three problems: 

1, What effect did the treatment have on the adult 
behavior of the subjects? 

2. What correlations could be detected between the 
early familial experiences of the boys and their later 
behavior as adults? 

3. What relationships could be noted between the 
familial environments and the boys’ behavior in 
childhood? 

As a first step toward resolving these issues, the 
cumulative case histories of the subjects were read by 
a trained staff (who did not have knowledge as to the 
adult behavior of the boys). A rating system concerned 
with 150 aspects of the boys’ families, physical status, 
general social condition, and personality was developed. 
Each category as far as possible was defined in terms 
of observable behavior. 


The Measure of Aggression 


“Aggression” is an amorphous term. It has been 
used to describe such seemingly disparate phenomena 
as gossip, jokes, suicidal tendencies, hostile fantasies, 
as well as directly destructive acts. A simple event— 
slapping a person on the back—can be viewed by one 
observer as a hostile attack, while another individual, 
aware of the context surrounding the action, would 
regard it as evidence of exuberant friendship. Thus, 
the first task in any study of aggression is to define 
the concept in behavioral terms which can be replicated 
by other investigators. For the purposes of this re- 
search, we followed Sears, Maccoby, and Levin in 
defining aggression as “behavior intended to hurt or 
injure someone.” The word “intention” is, itself, 
tricky; it is notoriously difficult to establish the under- 
lying desires which prompt an action. Consequently, we 
sought to limit the term aggression to those acts which, 
objectively, “hurt or injured someone.” 

Fortunately, the staff of the Cambridge-Somerville 
study had ample opportunity to observe the boys’ 
behavior. Counselors visited their homes and neighbor- 
hoods and watched their behavior in relation to parents, 
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friends, neighbors, and siblings. The psychiatrists, 
physicians, and psychologists submitted reports on 
the boys’ reactions under relatively stressful conditions. 
The boys’ teachers and other school officials were 
systematically questioned concerning the children’s 
activities. Camp counselors, social agencies, ministers, 
police, YMCA directors, and employers submitted 
additional information. Not only did the information 
come from diverse sources, but it was gathered over an 
average period of 534 years—thus making it possible 
to observe the boy under different conditions, at 
different ages. 

From these various sources of information, the 174 
boys were placed into one of three categories.* 

Twenty-five boys were consistently and overtly 
aggressive. These individuals participated in the whole 
range of aggressive acts; they were involved in fist- 
fighting, bullying of smaller children, attacks on their 
teachers, or destructive actions in camp or in the 
community. They frequently threatened other children 
with violence—and often carried out their threats. 
In response to almost any form of frustration, they 
reacted with verbal abuse, open rage, and attempts to 
destroy the frustrating object. 

Ninety-seven boys were, by the norms of American 
society, normally assertive. They participated, at times, 
in fights, acts of destruction aimed at other children 
or adults, occasional battles with teachers or other 
community officials, and bullying of weaker children 
or girls, They differed from the openly aggressive 
boys, however, in that their hostile responses were 
sporadic exceptions to the general pattern of their 
lives. Normally, in the face of frustration, they re- 
sponded realistically; sometimes with anger, but 
seldom with aggressive actions, In the classroom, they 
sometimes created disciplinary problems for the 
teacher, but usually they conformed to the standards 
of the school. 

Fifty-two boys were consistently monaggressive. 
Outbursts of rage and directly aggressive attacks were 
exceptionally rare in their lives; and, indeed, in some 
cases, nonexistent. In the classroom, they did not 
actively participate in aggressive disturbances. In 


2 From the original sample of 650, 476 were elimi- 
nated for the following reasons: 

70 died, moved from Massachusetts, or were dropped 
at an early stage from the treatment program. The 70 
matched mates of those who were eliminated from 
treatment group were omitted from the control group. 

255 boys formed the remainder of the control group. 
Because information on them was less extensive (and, 
we believe, less valid), they were dropped from this 
research. 

74 boys were known delinquents who had been 
convicted of serious crimes, These boys were eliminated 
for we did not wish the analysis of aggression to be 
obscured by a simultaneous analysis of the causes of 
crime. 

7 boys from the treatment group were dropped 
either because we possessed insufficient information on 
them or their behavior was so highly contradictory 
that they could not be placed in one of the three 
categories. 


relations with other children, they were placid and 
friendly. They could absorb the usual frustrations of 
childhood—failure in school, loss of a game, discipline 
from an adult—with apparent calm and realism. Thus, 
this categorization of aggression is descriptive not only 
of isolated acts, but of the entire pattern of the child’s 
behavior over a period of 5 years. 

Two raters, independently categorizing the behavior 
of 30 cases randomly selected from the sample, reached 
86.7% agreement in their judgments of aggression. 


The Measures of Family Environment 


Our task was to assess the relevance of the child’s 
early environment in determining whether he became 
“aggressive,” “assertive,” or “nonaggressive.” To 
achieve this goal, the “raw” information gathered by 
the staff in their 5 years of direct observation of the 
families was systematically categorized into a set of 
variables, descriptive of the child’s environment. 
These ratings ranged from such relatively objective 
matters as the occupation of the parents, their educa- 
tion, and ethnic group, through more subjective judg- 
ments concerning emotional relationships within the 
family. Categories were developed concerning the 
parents’ attitudes toward one another, their role in the 
family, their affective relationship with the son, and 
their personality. The parental methods and consistency 
of discipline were considered, as well as the specific 
values which they wished to inculcate. The child’s 
physical status and aspects of his behavior, other than 
aggression, were also evaluated. (It should be noted 
that none of the measures of the child’s physical 
status—the conditions of his birth, neurological or 
glandular disorder, early childhood diseases, body 
deformities, etc.—was significantly associated with the 
level of aggression.) 

Although the specific definitions and reliability of 
each category will be discussed in the following pages, 
certain general characteristics of the ratings should be 
mentioned: 

1. The observations were made in the home of the 
child, over an average period of 534 years, by a number 
of different people (usually, each child was exposed to 
at least three counselors), These observers did not 
know that their material would be used for a study of 
aggression; hence, the possibility of conscious bias or 
of a “halo effect” was minimized. 


3 One general issue deserves to be discussed at this 
point. In the next pages, we will report a number of 
significant relationships between the child’s environ- 
ment and his level of aggression. Can these relationships 
be attributed to a “halo effect”? Critics might reason- 
ably argue that a specific association—say, between 
the father’s level of aggression and his son’s behavior— 
might be illusory and simply the result of the observers’ 
preconceived biases. A counselor (or other observer), 
noting that a boy was highly aggressive, might un- 
consciously tend to report a number of “negative” 
observations about the child’s environment, thereby 
falsifying the “real” situation. 

For four reasons, however, we are convinced that 
the relationship between familial experiences and 
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2, The raters who, between 1956 and 1958, re- 
categorized this information were also unaware that 
the data would be used for a study of aggression and 
were not acquainted with the hypotheses to be tested. 


aggression cannot reasonably be credited to a mere 
“halo effect”: 

A. As we have previously noted, the ratings of the 
home environment have proven predictive ability. In 
previous research, we have found that this set of 
ratings was strongly correlated with adult crime and 
adult alcoholism, Since the general predictive validity 
of these categorizations has been established, there 
would seem to be little reason to question their applica- 
bility to a study of aggression in childhood. 

B. There is some empirical evidence that certain of 
the more obvious biases did not contaminate the data. 
One such bias is the possibility that middle class 
“Yankee” observers would tend to interpret behavior 
solely in terms of middle class standards. One might 
argue, for example, that a middle class person might, 
in watching lower class behavior, “read in” an aggres- 
sive intent, where none was intended—simply because 
the researcher was unacquainted with the customs of 
lower class culture. Similarly, an observer raised in an 
American milieu might tend to distort the purpose and 
significance of the actions of an Italian, Puerto Rican, 
or French-Canadian immigrant. We know that these 
possible biases did not significantly affect the data: 
there were no major differences in the ratings of 
aggression according to the social class of the child 
(of the middle class children, 16% were “aggressive,” 
56% were “assertive,” and 28% were “nonaggressive”; 
of the lower class children, 12% were “aggressive,” 
57% were “assertive,” and 31% were “nonaggressive”). 
Similarly, there were no significant differences in the 
ratings of aggression according to the child’s ethnic 
group (of the sons of immigrants, 11% were rated as 
“aggressive,” 55% as “assertive,” and 34% as “non- 
aggressive”; among the sons of native Americans, 17% 
were regarded as “aggressive,” 56% as “assertive,” 
and 27% as “nonaggressive”). Further evidence 
against an operation of the “halo effect” is indicated 
by the fact that the staff did not rate the more intel- 
ligent boys in a positive fashion; equal proportions of 
intelligent and unintelligent boys were placed in each 
category of aggression. 

C. Moreover, the original reports on both the boy 
and the home came from a variety of individuals (the 
typical case history included reports from an average 
of three counselors, four teachers, a psychiatrist, a 
Physician, a psychologist, and various community 
officials); it would be remarkable if all of these people 
held the same unconscious biases. 

D. And, as we have noted, the possibility that 
conscious biases pervaded the data was largely, perhaps 
entirely, eliminated. The original observers of the 
child and his environment did not know that their 
Teports would be used for a study of aggression. (They 
were aware, of course, that the data would be used for 
a study of delinquency; but, as we have previously 
noted, known criminals have been eliminated from 
this study.) The later raters, who categorized the 
material in 1956, were also unaware that the informa- 


3. These ratings have previously been used in 
follow-up studies of those boys who became criminals, 
alcoholics, and psychotics in adulthood. The tatings 
successfully differentiated between these deviants and 
the nondeviant cases in the sample. Thus, in terms of 
certain forms of deviant behavior, the “predictive 
validity” of the categorizations has been established 
(cf. McCord, McCord, & Gudeman, 1960; McCord, 
McCord, & Zola, 1959). 


RESULTS 


We assumed, as the basic premises of this 
research, that early environmental experiences 
might affect a child’s level of aggression in 
four conceptually distinct ways: 

1. Emotional relationships between the boy 
and his parents would be fundamental in two 
ways: first, by affecting the child’s level of 
frustration (and, thus, indirectly, his aggres- 
sive desires) and secondly, by molding his 
conception of the nature of human interaction, 

2. The parental methods of disciplining and 
controlling the child would affect his willing- 
ness to inhibit whatever aggressive desires he 
felt. 

3. The example of the parents (regardless of 
their direct relationship with the son) would 
affect the son by offering him an immediate, 
intimate model of human reactions to frustra- 
tion. 

4, The degree to which the parents rein- 
forced each other’s values would affect the 
“intensity” with which the son internalized 
parental demands (that is, we believed that 
parents who disagreed on a variety of issues, 
including the behavior which they expected 
of the child would be likely to produce a 
confused, relatively uninhibited child). 

Let us examine the actual effects of each of 
these four general influences. 


The Instigation of Aggression 


Direct parental attacks on the boy— 
whether manifested in physically punitive 
discipline, the frequent use of threats, or 
constant unfavorable comments on the boy’s 
worth—were strongly associated with a high 
level of aggressive behavior. Presumably, 
this kind of parental behavior is highly frus- 


tion would be used to study aggression. Hence, the 
chance that either the original observers or the later 
researchers would consciously allow their own theories 
to contaminate the data was minimized. 
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trating to the boy’s dependency needs; parental 
rejection teaches him early in life that other 
human beings are threatening and aggressive. 
One indication of this relationship between 
parental punitiveness and aggression among 
the sons can be found in an examination of 
parental methods of discipline. Some of the 
parents employed “nonpunitive” means of 
disciplining their children; they reasoned with 
the son, withdrew certain privileges, or indi- 
cated their verbal disapproval. Other parents 
depended on directly “punitive” techniques: 
if the child disobeyed, they responded with 
angry abuse, slaps, or beatings. A significantly 
lower proportion of the nonaggressive boys 
than of the aggressive or assertive children 
were raised by “punitive” mothers. (Through- 
out the paper, the Scott ~ reliability coefficient 
was used to estimate interrater reliability. 
Reliability on judgments of maternal punitive- 
ness was .772, on father’s punitiveness, .581. A 
comparison of the nonaggressive boys to the 
others in terms of maternal punitiveness was 
significant at the .025 level; x? = 5.09; df = 1. 
A comparison of the effects of the fathers’ 
techniques was not significant. Throughout 
the paper, the chi square test, two-tailed, was 
applied.) 

Another indication that parental attacks on 
the child led to more aggressive behavior was 
that those parents who frequently threatened 
the child, who verbally frightened him with 
dire consequences if he disobeyed, were more 


TABLE 1 
PARENTAL METHODS oF DISCIPLINE AND THE 
CHILD’S AGGRESSION 


(in percentages) 
npe | arar | Mater 
Mother’s method | (V = 24) (N = 95) | (N = 49) 
of discipline: 
Punitive 54 48 31 
Nonpunitive 46 52 69 
100 100 100 
Father’s method | «wv =19) | W=75 | (WV = 41) 
of discipline: 
Punitive 58 49 41 
Nonpunitive 42 51 59 
100 100 100 


TABLE 2 


PARENTAL Use oF THREATS AND THE 
CHILD’S AGGRESSION 
(in percentages) 


Frequent use of threats 64 44 32 
by the parents 

Infrequent use of 36 56 68 
threats by the par- — yr fore 
ents 100 100 100 


TABLE 3 


PARENTAL RELATION WITH THE Boy AND THE 
CHILD’S AGGRESSION 


(in percentages) 
Parental Attitudes To- ive Assertive Nor - 
ward the Child w To) a Si siye Boys 
Aecom mother 
p father 5 60 68 
rejecting father 37 19 17 
Rejecting mother 
eo father 47 5 10 ` 
rejecting father 11 16 AS 
100 100 100 


likely to produce aggressive sons than were 
parents who rarely threatened the boy (p < 
.05; x2 = 6.11; df = 2; Scott x reliability = 
-734), 

As one might predict from the preceding 
evidence, those parents who generally disliked 
and rejected their sons were most likely to 
produce aggressive boys. In terms of their 
overall emotional relationship with the boy, 
the parents in the study were described as 
either “affectionate” or “rejecting.” Affec- 
tionate parents were individuals who talked of 
their love for the son, who never voiced overtly 
rejecting statements about the son, and who 
demonstrated (at least, to some extent) their 
regard for the boy through kisses, hugs, gifts, 
proud statements about his accomplishments, 
etc. Rejecting parents, in contrast did not 
hesitate to express a fundamental dislike for 
their sons; they sometimes mentioned that 
they wished that the boy had never been 
born; they treated the boy either with open 
cruelty, neglect, or an erratic alternation 
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between demonstrations of affection and 
blatant dislike. 

As Table 3 demonstrates, the parents of the 
aggressive boys had an emotionally meager, 
cold relationship with their sons: The great 
majority of the aggressive boys, 95%, were 
raised in homes where at least one parent was 
emotionally rejecting; the majority of the 
assertive boys and of the nonaggressive 
children were raised by affectionate parents 
(p < .001; x? = 22.28; df = 1; Scott r relia- 
bility on mothers was .757, on fathers, .710). 

The variables which we have discussed up 
to this point—parental rejection, punitiveness, 
and parental threats—have certain features in 
common. These attitudes and actions are all, 
presumably, direct attacks on the child’s 
sense of security; they all tend to undermine 
the boy’s conception of himself as a person of 
worth and significance; and they all carry the 
implication that the world is a dangerous, 
hostile environment. It would seem reasonable 
to suppose that these influences serve to 
arouse pervasive aggressive tendencies in the 
child; he is not only extremely frustrated by 
these attacks but he is implicitly taught that 
aggression is the “way of the world.” If one 
views these variables in combination, an 
expected pattern appears: the aggressive 
children usually came from environments 
where several “drive-instigating” influences 
were present, while the assertive and non- 
aggressive children typically emerged from 
relatively “positive” homes. 


TABLE 4 


DrivE-InstiGATING VARIABLES AND THE 
Boy’s AGGRESSION* 


(in percentages) 
Number of ive Assertive Noni ive 
“Neestive” Tig S| Bo, 
Factora wew wes) was 
0 20 22 40 
1 0 28 27 
2 28 19 21 
3 44 19 6 
4 4 8 4 
5 4 4 2 
100 100 100 


"Each “negative” factor—maternal rejection, paternal rejec- 
tion, maternal punitiveness, paternal punitiveness, and parental 
use of threats—was given a weight of one. If there was insufficient 
information, the factor was regarded as a “positive” element in 
the child’s background. 


Thus, 80% of the aggressive boys, 50% of 
the assertive boys, and only 33% of the non- 
aggressive boys had been exposed to two or 
more “negative” influences (p < .001; x? = 
15.12; df = 2). It should be noted, in passing, 
that the delinquents (who were not included 
in Table 4) were usually exposed to at least 
three “negative” conditions. 

In summary, this evidence leads to the 
conclusion that: 

1. The aggressive boys and the assertive 
boys were more likely to have been disciplined 
in a punitive fashion by their mothers than 
were the nonaggressive boys ( < .025). 

2. The aggressive boys were more likely to 
have been frequently threatened by their 
parents than were the assertive boys and the 
nonaggressive boys (p < .05). 

3. The aggressive boys were more likely to 
have been raised by parents who rejected them 
than were the assertive or the nonaggressive 
boys (p < .001). 

4. The three types of boys differed signifi- 
cantly in the total number of “aggression- 
instigating” elements in their backgrounds 
(p < .001). 


The Direct Instillation of Controls 


The preceding evidence suggests that a 
high level of aggression was originally insti- 
gated by parental rejection, punitiveness, and 
threats. Another general set of factors—the 
extent to which the parents exerted immediate 
controls over the boy’s behavior—also played 
an important part. We assumed that such 
factors as the kind of demands imposed on 
the child, the degree to which his actions were 
supervised by the parents, and the extent to 
which the parents imposed their values in a 
consistent fashion could serve to modify the 
boy’s expression of aggressive urges. Even a 
child who was plagued by intense aggressive 
urges, we assumed, might not express these 
desires overtly, if he was subjected to consist- 
ent controls by his parents. Several measures 
revealed the influence of this aspect of the 
boy’s environment in the molding of aggressive 
behavior. 

The parents differed strikingly in the kinds 
of demands which they imposed on their sons. 
In some families, the parents placed “high 
demands” on the child for conformity; the 
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son was expected to clean his room, do chores 
around the house, perform well in school, be 
polite, attend Sunday school—in other words, 
to conform to the dominant customs of 
American society. In other families, the 
parents did not require this high degree of 
“polite behavior.” In some cases, they simply 
ignored the child; in other cases, they de- 
manded a high level of performance in one 
area of behavior (e.g., wearing clean clothes) 
but gave the child complete freedom in other 
ways (e.g., behavior in the classroom). 

The nonaggressive boys were more likely 
than either the assertive or the aggressive 
children to emerge from families that placed 
high demands on them for “polite,” ‘“‘respon- 
sible” behavior (p < .005; x? = 9.09; df = 1; 
Scott r reliability = .535). 

As in the kind of demands imposed on the 
child, the families also differed in the degree to 
which they directly supervised the boy. Some 
of the parents (or other adults) kept a close 
eye on the boy’s activities; they did not 
hesitate to advise the child or to intervene 
directly in behavior which they disapproved; 
they wished to know where he was and what 
he was doing most of the time. They were 
anxious to hear reports of his behavior away 
from the home. Other parents provided little, 
if any, supervision over the child; he was 


TABLE 5 


PARENTAL DEMANDS ON THE CHILD AND THE 
Boy’s AGGRESSION 


(in percentages) 
Parental Aggressive Assertive Non: ive 
patios w=) (w= 56) woth) 
High 16 24 45 
Low 84 16 55 
100 100 100 
TABLE 6 


SUPERVISION OF THE CHILD AND THE 
Boy’s AGGRESSION 
(in percentages) 


S iz ssive Assertive Nox ive 
jupervision A wiy j w idh E: ie x 
Present 52 61 75 
Absent ' 48 39 25 

100 100 100 


TABLE 7 


MOTHER’S CONTROL OF THE Boy AND 
Boy's AGGRESSION 
(in percentages) 


ressive Assertive Nonaggres- 
Mother’s Control joys: Boys sive Boys 
(N = 25) (VN =96) (N = Si) 


Overcontrolled 40 27 53 
Normal control 16 40 31 
Subnormal control 44 33 16 


10 10 100 


allowed to roam at will, to choose his own 
activities and friends. The nonaggressive 
children were significantly more likely to have 
been closely supervised (p < .05; x? = 4.0; 
df = 1; Scott x reliability = .710). 

Another measure, a rating of the mother’s 
control of the child, was also related to the 
level of aggressive behavior. The boys’ mothers 
were divided into three categories: those who 
“overcontrolled” their children (who de- 
manded that the boy should be close to the 
mother at all times and submit completely to 
her direction); those who exerted “normal 
controls” over the child (women who were 
concerned with the boy’s activities but gave 
him freedom in certain areas, such as the 
choice of playmates or the type of clothes 
which he would wear); and those who “sub- 
normally” controlled the boy (such mothers 
frequently were neglecting’ or unconcerned 
with the child’s behavior). The aggressive 
boys tended to be either “overcontrolled” or 
“subnormally controlled” by their mothers; 
the assertive boys were raised in approximately 
equal proportions by each type of mother; 
and the nonaggressive children were most 
often reared by mothers who “overcontrolled” 
or “normally controlled” them (p < .005; x? 
= 15.46; df = 4; Scott r reliability = .771). 

One other element, the consistency of 
parental discipline, deserves attention. The 
majority of the parents in the study disciplined 
their children in an “erratic” fashion: for the 
same action, the child was sometimes severely 
punished while, at other times, he was allowed 
(or even encouraged) to continue in his be- 
havior. In these inconsistent families, parental 
discipline was strongly affected by swings in 
mood or by the immediate situation. Other 
parents were relatively consistent; the child 
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could be sure that if he was caught committing 
a certain action, he would receive punishment. 
The nonaggressive children were raised by 
parents (particularly by mothers) who used 
consistent methods of discipline, while the 
converse tended to be true of the assertive 
and the aggressive boys (interrater reliability 
on ratings of parental consistency was .772, 
for the mothers, and .581 for the fathers, 
Comparison of nonaggressive boys to others 
in terms of consistency of maternal discipline 
was significant at the .01 level; x? = 7.03; 
df = 1; in terms of paternal consistency, the 
comparison was not significant). 

It is clear that those variables which deal 
with direct parental control over the child— 
the kind of demands that they imposed on 
the son, the degree of supervision that they 
gave him, and the consistency of their disci- 
pline—have an important relation to the 
boy’s aggression. These influences in the 
child’s environment are only theoretically 
distinct from those factors which are related 
to the instigation of a high degree of aggression. 
In practice, for example, one would expect 
that parental rejection would usually be 
combined with a lack of direct parental 
supervision. It is useful, therefore, to view the 
impact of these conditions in combination. 
In Table 9, one can observe the interaction 
between one factor that seems to be primarily 
related to the instigation of aggression (paren- 
tal punitiveness) and another factor that 
appears to be principally related to the 


TABLE 8 
Consistency or PARENTAL DISCIPLINE AND 
Y’S AGGRESSION 
(in percentages) 
Angressive Assertive | No 
wets) | a Sn ope 
Consistency of 
mother’s discipline; 
Consistent 28 42 52 
Erratic 72 58 48 
100 100 100 
Consistency of 
father’s discipline 
istent 28 30 40 
Erratic 72 70 60 
100 100 100 


TABLE 9 
PARENTAL PUNITIVENESS, PARENTAL DEMANDS ON 
THE CHILD, AND THE Boy’s AGGRESSION 
(in percentages) 


Assertive} "onag- 
sive Boys} Boys | Stessive 
W = 24)| CW = 95) POS, 


The families imposed hii 
Semanis on the child bei 


th parents were puni- 4 3 8 
tive 

One parent was punitive 8 14 18 
but the other parent 
was nonpunitive 

Neither parent was puni- 4 6 19 
tive 


instillation of controls (parental demands on 
the child), 

Both factors seem to be relevant to the 
expression of aggression. The aggressive boys 
tended to come from homes in which at least 
one parent was physically punitive and the 
demands on the child were low: 67% of the 
aggressive boys, 55% of the assertive children, 
and only 22% of the nonaggressive cases 
emerged from this type of family (b < .001; 
xX = 17.74; df = 2). Nonaggressive behavior 
appears to be associated with a lack of puni- 
tiveness on the parents’ part and with the 
imposition of high demands. Of those children 
who came from homes in which both parents 
were nonpunitive and placed high requirements 
on their children’s behavior, 56% were non- 
aggressive. At the opposite extreme (homes in 
which both parents were punitive and did not 
impose high demands), only 15% were non- 
aggressive (p < .05; x2 = 6.63; df = 2). In 
general, if demands were high, a lack of 
aggressive behavior resulted; if demands were 
low, parental punitiveness tended to increase 
the level of aggressive behavior. 

In summary, this material has indicated 
that: 

1. The aggressive and the assertive children 
were more likely to have been raised by 


FAMILIAL CORRELATES OF AGGRESSION 87 


parents who did not impose high demands on 
them than were the nonaggressive boys (p < 
.005). 

2. The aggressive boys and the assertive 
boys were less likely to have been closely 
supervised by their parents than were the 
nonaggressive boys (p < .05). 

3. More often than the assertive boys, the 
nonaggressive children were raised by mothers 
who “overcontrolled” them while the aggres- 
sive boys tended to be reared by mothers who 
either “overcontrolled” or ‘subnormally” 
controlled their sons (p < .005). 

4, The nonaggressive boys (and to a lesser 
extent, the assertive boys) were more likely 
to have been disciplined by their mothers in a 
consistent fashion than were the aggressive 
children (p < .01). 

5. The aggressive boys and the assertive 
boys were more likely than the nonaggressive 
boys to have been raised by parents who 
were both punitive and placed low demands 
on their sons (p < .001). 


The Parental Model 


It would seem reasonable to expect that 
highly aggressive parents would be likely to 
produce aggressive children. Presumably, 
a boy exposed to a father who consistently 
reacted with aggression to crises in his home, 
his work, or his neighborhood would tend to 
emulate the parental model. Surprisingly, the 
data do not confirm this “common-sense” 
prediction. Although there was a slight 
tendency for aggressive parents to produce 
aggressive children, this trend was not statis- 
tically significant at the 5% level. 

Three measures—ratings of the father’s 
aggression, his reaction to specific crises, and 
his role in the family—were used to estimate 
the effect of the parental model on the boy’s 
behavior. 

In terms of the same definition that was 
applied to their sons, the fathers in the study 
were divided into “aggressive,” “assertive,” 
and “nonaggressive” men. Fifteen percent of 
the aggressive boys were raised by aggressive 
fathers, but so were 18% of the assertive 
children, and 10% of the nonaggressive 
children. At the opposite extreme, 10% of the 
aggressive boys, as opposed to 6% of the 
assertive and 18% of the nonaggressive cases, 


were reared by nonaggressive fathers (Scott 
r reliability = .619). 

Judgments were also made concerning the 
reactions of the fathers to specific crises: loss 
of a job, failure to gain a promotion, loss of 
“face” in the community, legal involvements, 
etc. Most of the fathers responded “realisti- 
cally” by attempting to correct the situation 
in an effective fashion. Other men reacted in 
an “escapist” way: they went on a drinking or 
eating binge or withdrew into total passivity, 
A third set of fathers responded with overt 
“aggression” (verbal or physical attacks on the 
source of their frustration). 

Although one might expect that the aggres- 
sive fathers would “teach” their sons to 
respond in a similar fashion, this did not 
occur: almost equal proportions of the aggres- 
sive, assertive, and nonaggressive boys were 
raised by fathers who responded aggressively 
to crises (17%, 16%, and 11%, respectively). 
“Realistic” fathers, on the other hand, tended 
to have nonaggressive children; 33% of the 
aggressive boys, 39% of the assertive, and 
56% of the nonaggressive boys were raised by 
such men (p < .05; x? = 3.84; df = 1; Scott 
v reliability = .678). 

The fathers differed, too, in terms of their 
role within the family. Some fathers were 
“dictators”; they made all the decisions in the 
family and ruled the home with an iron grip; 
others were passive, ineffectual men, who left 
the guidance of the family to the mother; 
and still others were “leaders”—they made the 
basic decisions for the family, but only after 
consultation. Although one might expect that 
the aggressive boys would come most often 
from “dictatorial” fathers, this did not prove 
to be the case. Nineteen percent of the aggres- 
sive boys, as compared to 8% and 7% of the 
assertive and nonaggressive boys were reared 
by dictatorial fathers; 38% of the aggressive 
boys, as opposed to 51% of the assertive and 
55% of the nonaggressive were raised by 
fathers who played the role of the leader; the 
remainder came from passive fathers (Scott + 
reliability = .632). 

One can draw a general conclusion from 
these data: paternal aggression did not 
significantly heighten the son’s aggression; 
rather, it would appear that any form of 
paternal “deviance” whether expressed in 
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aggression, “escapism,” or an eccentric role 
within the family—tended to be associated 
with a higher level of aggression in the son. 
The nonaggressive boys were more often 
reared by nonaggressive fathers who responded 
to crises in a realistic fashion and acted the 
role of “leader” within the family; one can 
assume, therefore, that these children were 
offered a parental model of responsibility and 
relatively stronger control over deviant im- 
pulses. The aggressive children often lacked 
such a paternal model of inner control; it would 
seem to be this deficiency in their background 
(rather than a specific example of aggression) 
that may have contributed to their unre- 
strained behavior. 

Two other pieces of data support this 
interpretation. Many of the fathers in the 
study participated in various forms of deviant 
behavior (criminality, alcoholism, psychoses, 
desertion, sexual promiscuity, etc.). Only 
rarely was this behavior directly aggressive in 
nature. The deviant men, however, were 
relatively uninhibited, escapist, and irrespon- 
sible; they failed to offer their sons an example 
of moderation. The aggressive boys and the 
assertive cases were more often raised by such 
deviant fathers; 48% of the aggressive boys, 
52% of the assertive cases, and 39% of the 
nonaggressive children were reared by deviant 
fathers (p < .05; x? = 3.84; df = 1; Scott x 
reliability = .827). 

The strength of religion within the family 
may also have played a part. The nonaggres- 
sive boys were more often reared in homes 
which were “strongly” religious (that is, 
families in which the parents attended church 
at least once a week and gave other signs of 
devotion to their faith). Forty-eight percent of 
the nonaggressive children, as compared to 
34% of the assertive and only 21% of the 
aggressive boys, were raised by strongly 
religious mothers (p < .10; x? = 5.68; df = 3; 
Scott x reliability for father = -856, for mother, 
-676). The specific religious affiliation of the 
family did not significantly affect the level of 
the boy’s aggression, The strongly religious 
parents were people who, in general, valued 
inner control, conformity, “humility,” and 
nonaggressiveness more than did those who 
were simply nominal members of a religious 
faith. Thus, one can assume that the strongly 


religious parents more often offered a model of 
inner control to their sons. 

These data suggest that a parental model of 
conformity and inner control (whether ex- 
pressed in realistic reactions to crises, the 
fulfillment of a mature role within the family, 
strong adherence to a religious faith, or a lack 
of obvious deviance) may be associated with a 
low level of aggressive behavior on the son’s 
part. More strikingly, the material indicates 
that an aggressive parental model is, in itself, 
no more productive of aggression in the child, 
than are other “milder” forms of parental 
deviance, irresponsibility, or escapism. 

Even these tentative generalizations should 
be further qualified. Within the sample, there 
is a general correlation between parental 
deviance and such other factors as parental 
rejection, punitiveness, and lack of control of 
the child. Thus, the slight association between 
parental deviance and the boy’s aggression 
may well be simply a reflection of these other 
factors. 


Parental Agreement on Values 


One final set of conditions within the home, 
the amount of agreement between the parents 
concerning their basic values, has bearing on 
understanding the origins of aggression, In 
some homes, the parents were in constant 
conflict about each other’s attitudes and 
actions or were fundamentally dissatisfied 
with their place in the family; in other words, 
such parents tended to undermine, rather than 
to reinforce the attitudes of one another. We 
assumed that a child raised in such an environ- 
ment would tend to become confused con- 
cerning parental expectations and would be 
likely to “write off” parental demands. Thus, 
it would seem reasonable to assume that these 
children would be less governable by their 
parents, and, specifically, less inhibited in the 
expression of their aggressive urges. 

The data support this interpretation. The 
parents’ relation to one another, as well as 
their immediate relation to the child himself, 
had bearing on the boy’s level of aggression. 
As a group the aggressive boys more often 
came from homes that were disordered by 
parental conflict, lack of respect of the parents 
for each other, disagreement within the 
family concerning methods of child rearing, 
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and antagonism between the child and his 
relatives. The aggressive boys were, therefore, 
often raised in a milieu in which one parent 
undermined (or directly attacked) the other; 
an environment which was not conducive to 
the establishment of a consistent set of values. 

A variety of measures, all tapping the same 
phenomena, suggested that the aggressive 
children came from homes which were funda- 
mentally disrupted by parental disagreements. 
The specific figures are presented in Table 10. 

In summary, these conclusions can be 
drawn from the material: 

1. The aggressive and the assertive children 
came more often from families characterized 
by “intense” conflict between the parents— 


“constant bickering, recriminations, and, at 


times, overt physical attacks (p < .005; 
x? = 9.16; df = 1; Scott r reliability = .647). 

2. The aggressive and the assertive children 
more often came from families in which one 
parent held the other in “low esteem.” These 
were homes in which the parents expressed, 
with complete openness, their lack of respect 
for the other person (p < .005; x? = 8.65; 
df = 1; Scott x reliability = .833). 

3. The aggressive boys more often came 
from homes in which both parents were 
dissatisfied with their role in life. In these 
families, the mothers complained of the 
burdens of being a housewife and mother and 
the fathers expressed their disgust with their 
job, their role in the home, and their relation 
to society (p < .02; x? = 8.37; df = 2; Scott 
m reliability = .679). 

4. The aggressive boys were more likely 
to come from homes in which the parents 
disagreed concerning methods of child rearing 
(p< ay x? = 3.67; df = 1; Scott = reliability 
= .620). 

5. The aggressive boys were more likely to 
come from homes in which the parents were 
not demonstratively affectionate toward one 
another (p < .05; x? = 4.64; df = 1; Scott 
m reliability = .631). 

Taken together, these various measures of 
the “milieu” of socialization reveal that the 
aggressive boys (and, to a lesser extent the 
assertive children) were reared in families 
that had been disordered by a high degree 
of parental conflict and antagonism. The 
nonaggressive boys, in contrast, more often 


TABLE 10 


PARENTAL RELATIONSHIPS AND THE 
Boy’s AGGRESSION 


(in percentages) 
Anger’ Assertive | Nonageres- 
Boys sive Boys 
Parental conflict: | (V = 21) | W = 83) | (WV = 43) 
Intense 38 36 12 
Some 38 27 47 
None 24 37 31 
100 100 100 
Parental esteem: | (N = 20) | (V = 81) | (V = 38) 
High mutual 50 52 79 
esteem 
One parent dis- 25 20 13 
respected the 
other parent 
Mutually low 25 28 8 
esteem — — — 
100 100 100 
Parental role sat- | (N = 20) | (V = 84) | W = 42) 
isfaction: 
Both parents 45 26 12 
dissatisfied 
One parent dis- 5 25 19 
satisfied 
Neither parent 50 49 69 
dissatisfied — — — 
100 100 100 
Parental conflict | (N = 21) | (V = 76) | W = 41) 
about the child: 
Intense conflict 24 10 10 
Some conflict 38 30 24 
No conflict 38 60 66 
100 100 100 
Parental affection | (V = 20) | (N = 83) | (WV = 39) 
for each other: 
Affectionate 15 38 H 
Sporadically 40 27 36 
affectionate 
Indifferent 10 8 10 
Antagonistic 35 27 10 
100 100 100 


emerged from environments in which the 
parents respected one another, agreed on basic 
issues (including the ways in which their sons 
should be raised), and were relatively satisfied 
with their lot in life. 

A natural effect of such parental conflict 
would be that the son tended to reject his 
family and the values that the parents repre- 
sented. As one would expect, the aggressive 
boys more often chose a “reference group” 
(delinquent gang, neighborhood group, etc.) 
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outside of their home. Thirty-two percent of 
the aggressive children, as contrasted to only 
14% of the assertive boys and 6% of the non- 
aggressive boys consciously oriented their 
behavior to an “outside” reference group 
rather than to their parents’ wishes (p < .02; 
x? = 6.58; df = 1). 

This tendency for the aggressive children 
to separate themselves from their families 
was also revealed in their attitudes towards 
their siblings. Only 42% of the aggressive 
children had a cooperative, friendly relation- 
ship with their siblings. In contrast, 70% of 
the assertive boys and 66% of the nonaggres- 
sive boys had an affectionate regard for their 
brothers and sisters (6 < .02; x? = 5.56; 
df = 1). 


CONCLUSIONS 


In this paper, we have argued that three 
major elements in a child’s environment affect 
his subsequent level of aggressive behavior: 

1. The emotional relationship between the 
boy and his parents (the degree to which the 
family treats the child in a punitive, threaten- 
ing, or rejecting fashion). 

2. The direct controls exerted by the par- 
ents over the boy’s behavior (the degree to 
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which the parents guide the child in a con- 
sistent, rigorous fashion). 

3. The pervasive milieu of the family (the 
extent to which the parents reinforced or 
undermined the values and significance of oni 
another). 

A fourth factor—the conformity or devia 
of the parental model—may also be weak 
associated with the child’s level of aggression. 

The data presented in this study—data 
gathered from the direct observation of 174 
nondelinquent boys and their families over a 
period of more than 5 years—indicate that 
“aggressive,” “assertive,” and “nonaggres- 
sive” boys emerged from radically different 
environments. 

These contrasts in background can perhaps 
best be viewed as summarized in Table 11. 

These variables in the child’s environment 
are not, of course, completely independent. 
We have treated particular conditions in the 
home, such as parental rejection of the child, 
as primarily related to the instigation of ag- 
gressive desires. Clearly, however, a factor 
such as parental rejection has other effects: 
it offers the child a model of aggression and it 
presumably weakens the child’s desire to 
follow the demands of his parents. Certain 


BACKGROUNDS OF AGGRESSIVE, ASSERTIVE, AND NONAGGRESSIVE Boys 


Environmental Conditions Aggressive Boys 
Parents’ emotional rela- Releceing 
tionship with the boy 
Grout use of threats 
Tnstillation of direct | Overcontrolled or sub- 
controls normally controlled by 
mothers 
Low demands on the 
child 
Lack of supervision 
Inconsistency in disci- 
pline 
Parental model Socially deviant 
Parental relationshi A high d i al 
to each other i Bah tee Sa ae 
Lack of mutual esteem 
Dissatisfaction with role 
in life 
Unaffectionate 


= In comparison to the background of nonaggressive boys. 


Assertive Boys Nonaggressive Boys 
Affectionate Affectionate 
Relatively punitive* Nonpunitive 
Infrequent use of threats Little use of threats 
Normally or subnormally Normally or overcon: 
controlled by mothers trolled by mothers — 
Low demands High demands 
Relatively little supervi- Firm supervision of th 
sion of the child® child J 
Relatively inconsistentin Consistent in discipline 
iscipline* 4 
Relatively deviant* Socially conformist 


Highs g Sete of general Low mrieceatee of general 


Relatively low mutual High mutual esteem 
Relatively dissatisfied Satisfied with role 

with role* N 
Affectionate Affectionate i 
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other variables, however, do not seem to have 
this many-faceted influence. The degree to 
which the child has been subjected to adult 
supervision, for example, would seem to have 
direct relevance to controlling the boy’s ex- 
pression of aggression, but little or no bearing 
on the original establishment of aggressive 
desires. 

To construct a comprehensive picture of the 
origins of aggression, one must view these 
various conditions in the child’s environment 
in terms of their total impact. In an attempt 
to develop a general portrait of the origins of 
aggression, 10 variables in the background of 
every child in the study were categorized. 

First, the children were divided according 
to the number of “aggression-creating” factors 
in their environment. For this purpose, five 
variables were used: paternal attitude toward 
the boy, maternal attitude toward the child, 
paternal punitiveness, maternal punitiveness, 
and parental use of threats. For each factor 
that was present in the child’s home, a “nega- 
tive” score of one was given. Thus, the child 
who was rejected by both parents, who was 
treated punitively by both parents, and was 
subjected to frequent threats received a score 
of “—5.” At the opposite extreme, the child 
who had not been subjected to any of these 
influences received a score of “0.” If we did 
not possess sufficient information on a particu- 
lar variable, this factor in the child’s back- 
ground was, de facto, considered as “positive.” 

Secondly, five variables relating to the 
general milieu of the home, the parental model, 
and the controls exercised by the parent over 
the child were considered—in other words, 
those factors which were presumably related 
to modifying the expression of aggressive 
desires. These conditions included parental 
supervision of the child, maternal consistency 
of discipline, parental demands on the boy, 
the strength of the maternal religion, and the 
‘level of general conflict between the parents. 
Each factor was given a “positive” weight of 
one; thus, the boy who came from an environ- 
ment characterized by maternal consistency, 
firm supervision, a strong religious faith, high 
demands, and parental agreement was awarded 
a score of “+5.” If we did not possess suffi- 
cient information on a particular condition, 
the factor was considered as “negative.” 


Since we knew that each of these 10 vari- 
ables was independently related to aggression, 
our supposition was that their combined in- 
fluence would account for much of the varia- 
tion in aggressive behavior. We assumed, in 
particular, that the aggressive boys would be 
most likely to emerge from an environment 
that simultaneously instilled a high level of 
aggressive urges and failed to provide the 
conditions for controlling aggression. The 
nonaggressive boys, in contrast, would be 
expected to come from homes characterized 
by a high level of controls and a low degree of 
aggression-creating influences. As Table 12 
indicates, these expectations were generally 
fulfilled. 

In a general sense, therefore, we can argue 
that the aggressive boys were most likely to 
have been raised by parents who (a) treated 
the boy in a rejecting, punitive fashion; (b) 
failed to impose direct controls on his be- 
havior; (c) offered him an example of deviance; 
and (d) were often involved in intense conflict. 

The nonaggressive boys came from a strik- 
ingly contrasting environment—a home in 
which they were (a) treated in an affectionate, 


TABLE 12 
THE COMBINED Impact OF ENVIRONMENTAL 
CONDITIONS ON CHILDHOOD AGGRESSION 
(in percentages) 


Aggres- 
sive Boys! 8 
QV = 25) | = 91) iy’ = 52) 


An environment in which 
the parents im “high” 
controls over son’s be- 
havior (+ 3, + 4, and + 5) 
and the home was: 
Highly conducive to ag- 4 1 2 
gression (—5, — 
Moderately conducive to 4 11 13 
aggression (—3, 2) 
Not conducive to ag- 8 24 39 
gression (—1, 0) 
An environment in ite pae 
the parents imposed few 
controls over the son’s be- 
havior (+ 2, + 1, and 0) 
and the home was: 
Highly conducive to ag- 
gression (—5, —4) 
Moderately conducive to 68 26 15 
ion (—3, —2) 
Not conducive to aggres- 12 27 27 
sion (—1, 0) — 
100 100 
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nonpunitive manner; (b) guided by a consist- 
ent set of controls; (c) exposed to examples 
of social conformity; and (d) reared by affec- 
tionate, satisfied parents. 

The assertive boys generally resembled the 
nonaggressive children in that they were 
reared by relatively affectionate, nonthreaten- 
ing parents; thus, they were not victimized 
by the rejection characteristic of the families 
of aggressive boys (51% came from families 
which were nonpunitive and affectionate as 
opposed to only 20% of the aggressive boys). 
They resembled the aggressive boys, however, 
in that their parents (a) often failed to impose 
consistent controls; (6) were deviant models; 
and (c) often were in open conflict. (64% of 
the assertive boys came from an environment 
where these conditions were present.) 

It would appear, therefore, that the aggres- 
sive boys had a high level of aggressive urges 
and were uncontrolled; the nonaggressive 
boys had a low degree of aggressive desires 
and, in addition, were well-controlled; the 
assertive boys had a few basically aggressive 
urges but were relatively uncontrolled indi- 
viduals. 

It is encouraging to note that the results of 
this study, despite methodological differences, 
are in fundamental agreement with the con- 
clusions which have emerged from prior re- 
search. For example, Bandura and Walters 
(1959) have studied the childhood antecedents 
of adolescent aggression. Their sample, drawn 
from a group of middle class, antisocial, delin- 
quents differed strikingly from the boys studied 
in this research (in age, in social class, in 
geographic area, and in illegal behavior). 
Yet the results are basically similar; the anti- 
social, aggressive boys came from homes in 
which they were rejected and treated in an 
inconsistent fashion. 

Again, the results achieved by Sears, Mac- 
coby, and Levin (1957) in their recent investi- 
gations of childhood aggression bear a basic 
resemblance to the findings of this research. 
Among other relationships, Sears and his 
colleagues found that aggression in childhood 
was associated with the use of physical punish- 
ment, low esteem of the mother for the father, 
a high degree of permissiveness for the expres- 
sion of aggression, disagreement between the 
parents, and dissatisfaction on the mother’s 
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part with her role in life. And, indeed, these 
same relationships appeared in this present 
study—even though the sample used in this 
research came from a generally lower social 
class than did the Sears’ sample and the meth- 
ods of observation in the two studies were quite 
different. 

These various relationships between the 
nature of a child’s family and his level of 
aggressive behavior could be interpreted in 
at least four different ways: 

1, Some analysts might contend that aggres- 
sion on the child’s part actually provokes 
certain “typical” responses from the parents. 
By this interpretation, the child’s innate ag- 
gressive behavior evokes rejecting attitudes, 
punitiveness, and inconsistency from the 
parents; the behavior itself is regarded as 
creating new conditions within the family. 
While it seems highly probable that the child’s 
behavior reciprocally affects parental attitudes 
towards him, it would be stretching this inter- 
pretation unduly to argue that the child’s 
behavior also causes such diverse phenomena 
as conflict between the parents, lack of super- 
vision, parental deviance, and low esteem on 
the parents’ part for each other. 

2. Some might argue that the parents’ 
behavior and the child’s aggression are pro- 
duced by common genetic factors and that the 
apparent environmental relationships are illu- 
sory. Within the confines of this study, it is 
impossible to evaluate this opinion directly. 
In the absence of definitive genetic evidence, 
however, it seems most reasonable to view 
this interpretation as a rather remote pos- 
sibility. 

3. Some social scientists, particularly sociol- 
ogists, might well contend that both the 
parents’ and the boys’ behavior result from 


similar vectors in the social environment. — 


These analysts would place special emphasis 
upon the similar experiences undergone by 
the parents’ and the boy in their relations 
with the “outside” world: the school, the 
occupational environment, the neighborhood, 


or the ethnic culture. Since there was no inde- | 


pendent relation between the child’s level of 
aggression and the family ethnic group or 


social class, this explanation does not appear _ 


to do justice to the findings. 
4. Although the previous interpretations 
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cannot be dismissed, it seems most reasonable 
to assume that the familial environment is an 
independent variable affecting the child’s 
behavior. The evidence presented in this study 
(and the facts accumulated by previous work) 
suggest that aggression in childhood is a form 
of behavior developed in response to specific 
environmental conditions; conditions created 
by man and thus, potentially changeable by 
man. Clearly, aggression is a universal capacity 
of human nature, a capacity which is first 
expressed in the unfocused rage of infants. 
But the development of this trait—whether 
it is transformed into a pervasively destructive 
syndrome of behavior or whether it goes 
fallow—seems to lie well within the realm of 
human culture, as this culture is mediated 
through early familial experiences. 
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the social learning of the individual, as 

most personality theorists assert, then it 
should be possible to demonstrate systematic 
differences in the self-concepts of persons whose 
social experiences have differed in funda- 
mental ways. We have focused in this study 
on three variables that should reflect differ- 
ential social learning experiences, in an ex- 
ploratory attempt to determine how these 
variables relate to two aspects of the self-con- 
cept, dominance and love. Dominance includes 
reported self-behaviors involving assertion, 
control over others, and self-confidence, while 
love includes characteristics such as friendly, 
warm, and cooperative. 

Most generally, males might be expected to 
develop differing self-concepts depending upon 
whether their identification is primarily with 
the father or the mother. If we consider domi- 
nance and love behaviors as cultural modal 
points in the role behavior of fathers and 
mothers, respectively, we would expect father 
identifiers to emphasize self-conceptions of 
dominance and mother identifiers to empha- 
size self-conceptions of love. That parental 
behavior may cluster about these modal points 
is suggested by our previous work (Bieri & 
Lobeck, 1959). Older adolescents tended to 
perceive their parents in terms of an active, 
controlling, and teaching role (dominance) or 
in terms of a role emphasizing emotional 
closeness and warmth (love). Similarly, the 
love and dominance dimensions of this study 
resemble, respectively, the love-hate and con- 
trol-autonomy dimensions described by 
Schaefer (1959) in his factor analyses of re- 
sults from parent behavior studies. 

A second major socializing influence on how 
a person’s self-concept develops is his religious 
affiliation. Religion may exert an influence on 
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Į the self-concept is a product primarily of 
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one’s behavior by defining the available soci 
and economic opportunities, by structuring 
certain values and attitudes, and by struc- 
turing social relationships within the family. 
We have selected two religious faiths, 
Catholicism and Judaism, about which, from 
the existing literature, one might assume cer- 
tain differences in socialization that should 
affect the self-concept. Perhaps most crucial 
in terms of the dominance aspect of the self- 
concept are the value differences often at- 
tributed to these two groups (Sklare, 1958; 
Strodtbeck, 1958). A strong value in the Jewish 
group has been considered to be the emphasis 
upon the rational mastery of one’s environ- 
ment with concomitant emphasis upon 
achievement and intellectual attainment. In 
contrast, values associated with the Catholi- 
cism of immigrant groups from Ireland and 
Italy have included an emphasis on fatalism 
and the role of destiny in controlling one’s 
life, as well as dependence upon external 
authority. Such considerations are akin to 
Florence Kluckhohn’s categories of value 
orientations in which the Jewish group would 
be considered to stress man’s mastery over 
nature while the Catholic group stresses man’s 
subjugation to nature (Kluckhohn, 1953). 

The third variable assumed to affect the 
self-concept is social class status. It seems 
reasonable to expect that the degree of domi- 
nance in one’s self-concept varies directly with 
one’s social class background. Persons from 
lower class backgrounds have been exposed to 
more learning conditions in which the relative 
inferiority of their physical and social situation 
may be translated into self-feelings of in- 
feriority and submissiveness. Conversely, thé 
upper status individual has been associat 
with conditions of occupational and social 
prestige and power, promoting a greater feeling 
of self-confidence and dominance, 


| 


METHOD 


Subjects. Members of an enlisted men’s Army 
Teserve unit served as subjects for the research. This 


| 
| 
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group of 89 males, residents of the Bronx, ranged in 
age from 19 to 40 years, with a mean age of 23.5 years, 
The majority of the subjects identified themselves as 
either Jewish or Catholic. Approximately one-third of 
each religious group were married, The ethnic origin 
of the Catholic group was mixed, being predominantly 
Italian or Irish, while the Jewish group had predomi- 
nantly Eastern European origin. Approximately 50% 
of the subjects in each religious group had fathers who 
were foreign born, ° 

‘Self-concept, The two aspects of the self-concept, 
dominance and love, were measured by the Inter- 
personal Checklist (Leary, 1957). This method con- 
sists of 128 adjectives or phrases grouped into eight 
behavioral categories or octants. These octants 
are: managerial-autocratic, competitive-narcissistic, 
aggressive-sadistic, rebellious-distrustful, self-effacing- 
masochistic, docile-dependent, cooperative-overcon- 
ventional, and responsible-hypernormal. The subject 
was instructed to check those adjectives or phrases 
which best described himself. Dominance and love 
scores were obtained by means of formulae de- 
veloped by LaForge, Leary, Naboisek, and Coffey 
(1954). The dominance score is most positively weighted 
on the managerial-autocratic and competitive-narcis- 
sistic octants, and negatively weighted on the self- 
effacing-masochistic octant. The love score is most 
positively weighted on the cooperative-overconven- 
tional and docile-dependent octants, and negatively 
weighted on the aggressive-sadistic octant. Dominance 
and love scores were converted to standard scores with 
a mean of 50 and a standard deviation of 10, These 
two scores are not correlated (r = —.03) for the 
subjects in this study. 

Identification. The measure of identification used 
here is intended to reflect the subject’s perception of 
greater similarity to one parent than to the other. A 
more detailed analysis of this measure of one aspect of 
identification and its personality correlates may be 
found in previous publications (Bieri, 1960; Bieri & 
Lobeck, 1959; Bieri, Lobeck, & Galinsky, 1959). 
Each subject rated himself, his mother, and his father 
on 14 10-point scales between each of the following 
bipolar adjectives: carefree-serious, dependable- 
undependable, passive-active, happy-sad, delicate- 
tugged, loud-quiet, friendly-unfriendly, meek-bossy, 
unpleasant-pleasant, strong-weak, cold-warm, con- 
fident-unsure, excitable-calm, and smart-dumb. The 
Perceived similarity to each parent is measured by 
squaring the distance between one’s self-rating and the 
rating given the parent on each 10-point scale. The 
sum of these 14 distance scores (D?) represents an 
index of the perceived similarity to each parent, and 
the subject is assumed to identify with (i.e., perceive 
himself as more like) that parent with whom he has 
the lower D? score. The independence of this identifica- 
tion measure and the checklist measure of the self- 
concept will be considered below. 

Social class. The social class background of each 
subject was determined on the basis of Hollingshead’s 
two-factor index (Hollingshead, 1957). This index 

ssifies the subject into one of five categories on the 
basis of the father’s occupation and education. The 
numerical scheme used places greater weight upon the 
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father’s occupation than upon his education. The 84 
subjects used in the analyses involving social class 
were distributed into five classes as follows: Class I 
(highest), 3; Class II, 7; Class II, 18; Class IV, 42; and 
Class V, 14. Thus, the modal class of the sample is 
Class IV, the lower middle class. While the class distri- 
bution of the Jewish subjects indicated somewhat 
higher class backgrounds than those of the Catholic 
group, this difference was not significant. We have 
grouped these five classes into two overall class group- 
ings, which we call the upper class group (I, II, and 
IM) and the lower class group (IV and V). 


RESULTS AND Discussion 


Table 1 presents the mean scores on domi- 
nance and love as a function of the three so- 
cialization variables. Considering identifica- 
tion first, no significant differences within or 
between identification groups are found. How- 
ever, the differences are in the direction an- 
ticipated in our earlier discussion, with father 
identifiers having higher dominance than love 
scores, and mother identifiers having higher 
love than dominance scores. The most striking 
difference is the higher love score of mother 
identifiers compared to father identifiers 
(p= .11). 

When the results in Table 1 are analyzed 
in relation to religious affiliation, it is found 
that while Jewish and Catholic subjects have 
almost identical mean dominance scores, 
Catholic subjects have a significantly higher 
mean love score than do Jewish subjects. 
Within religious groups, there is not a sig- 
nificant difference between mean dominance 
and love scores for Jewish subjects, but Catho- 


TABLE 1 
MEAN DIFFERENCES IN DOMINANCE AND Love SELF- 
Concept Scores IN RELATION TO PARENTAL 
IDENTIFICATION, RELIGIOUS AFFILIATION, AND 


Crass STATUS 
Subgroup N Dominance Love $ 
Pa aan Re A A SE 

F 39 49.62 48,59 ns 
$ ns ns 

M 47 49.08 52.49 ns 

J 48 49.79 48.08 ns 
$ ns 02, 

RC 41 49.83 53.90.01 

HI 28 54.14 47.96 05 
$ -01 ns 

IV-V 56 47.23 51.34 ns 


Note.—In this and subsequent tables, F = father identifiers, 
M = mother identifiers, J = Jewish subjects, RC = Catholic 
subjects, I-III = upper class subjects, and IV-V = lower class 
subjects. All probability values are based upon ¢ tests using a two- 
tailed test. Incomplete data on some subjects causes the Ns to 
vary. 
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lic subjects have a significantly higher mean 
love score than mean dominance score. Both 
significant findings in relation to religious 
affiliation involve the elevated love scores in 
the Catholic group. 

Finally, in relation to social class, we observe 
in Table 1 that upper class subjects do have a 
significantly higher mean dominance score 
than lower class subjects. While lower class 
subjects have a higher mean love score than 
upper class subjects, this difference is not sig- 
nificant. Within class groups, upper class sub- 
jects have a higher mean score on dominance 
than on love, and lower class subjects have a 
higher mean score on love than on dominance. 
Only the former difference is significant, how- 
ever. 

Further analyses of these self-concept differ- 
ences are possible holding the effects of two of 
the variables constant when studying the 
third. However, a problem arises in this re- 
gard because of the lack of independence 
among many of the comparisons that could 
be made. To avoid having possible a posteriori 
judgments dictate what independent compari- 
sons are made in an exploratory study such 
as this, we present and discuss these subgroup 
findings as suggestions for further study in 
which a priori considerations would indicate 
the specific relationships to be tested. 

Since our self-concept measure and our 
identification measure both contain self- 
ratings, it is necessary to analyze the possible 
confounding of these two measures. As would 
be expected, there is some association between 
the self-ratings on the checklist and the self- 
ratings on the semantic differential. If we iso- 
late a “warmth” cluster of three scales on the 
semantic differential (warm-cold, friendly- 
unfriendly, and pleasant-unpleasant), self- 
scores on this cluster correlate significantly 
with the love scores (r = .48, p < .01). Simi- 
larly, self-scores on a “strength” cluster of the 
semantic differential scales (strong-weak, 
bossy-meek, confident-unsure, and rugged- 
delicate) correlate significantly with the domi- 
nance scores (r = .47, p < .01). Moreover, 
item analysis shows these “warmth” and 

“strength” scales to be the best predictors of 
the overall direction’ of identification. Thus, 
for example, on the warm-cold scale mother 
identifiers score themselves closer to the 
mother than to the father while father iden- 


tifiers score themselves closer to the father 
than to the mother (chi square = 17.66, p 
< .01). That is, identification as we measure 
it appears to reflect to a great extent perceived 
similarities in terms of warmth and strength, 
Now, we also find that subjects in both identi- 
fication groups tend to rate mother higher 
than father on the “warmth” scales, and father 
higher than mother on the “strength” scales, 
Does it follow, then, that subjects who score 
closer to the mother than to the father 
(mother identifiers) rate themselves higher on 
“Warmth” scales and lower on “strength” 
scales than subjects who score closer to the 
father than to the mother (father identifiers)? 
If so, it is obviously redundant to say that the 
subject perceives himself to be high on love 
and that he identifies with the parent who is 
considered to be more loving (the mother), 
However, such a confounding of identification 
and self-concept requires that it is the self- 
ratings on the semantic differential and not the 
ratings given the two parents which differen- 
tiate mother from father identifiers. When we 
examine the relative mean ratings given the 
mother, father, and self in the two groups, we 
find that on the scales of the “warmth” cluster 
it is the difference in the rating given the father 
which distinguishes mother from father identi- 
fiers. On the “strength” scales, the two identi- 
fication groups are distinguished by their 
positioning of the mother. In neither cluster 
does a difference in self-rating distinguish 
between mother and father identifiers. Thus, 
even though the self-concept is involved in 
obtaining the measure of identification, identi- 
fication per se does not predict the subject's 
self-concept. 

As noted above, the significant findings rela- 
tive to religious affiliation (Table 1) centet 
around the elevated mean love score in the _ 
Catholic group. Reference to Table 2 indicates _ 
that in every subgroup comparison between 
subjects in the two religious groups, Catholic | 
subjects have higher love scores than do Jewish | 
subjects. However, within the Catholic group; 
it appears that the mother identifiers are con- 
tributing most to these differences in love f 
self-conceptions. Thus, the more numerous 
Catholic mother identifiers have a mean love | 
score of 55.56, while the less numerous Catho- | 
lic father identifiers have a mean love score | 
of 50.60. Further, the mean love scores | 


| j 
| 
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TABLE 2 
Mean SELF-CONCEPT Scores CLASSIFIED BY 
RELIGION, IDENTIFICATION, AND SOCIAL CLASS 
BACKGROUND 


Social Class 


Subgroup Domi- Love TII IV-V 


nance 


Dominance Love Dominance Love 


All 49.80 50.76 54.14 47.96 47.23 51.34 
subjects (N = 89) (N = 28) (N = 56) 
49.62 48.59 57.23 45.92 45,38 49.48 
(WN = 39) (W = 13) W= 25) 
M 49.08 52.49 50,93 49.43 47.45 52.97 
W = 47) WW = 14) (WW = 29) 
J 49.79 48.08 55.16 46.00 46.00 49.04 
(N = 48) (N = 19) (N = 28) 
JF $1.04 47.33 57.70 45.30 45.69 47,85 
WN = 24) (N = 10) (N = 13) 
JM 47,50 49.00 51.50 45.87 45.21 50.79 
WW = 22) (N = 8) (N = 14) 
RC 49.83 53.90 52.00 52,11 48.46 53.64 
(N = 41) (WN = 9) W = 28) 
RCF 47.33 $0.60 55.67 48.00 45.25 51.25 
(N = 15) W= 3) (N = 12) 
RCM 50.48 55.56 50.16 54.16 49.53 55.00 
(WN = 25) (W = 6) (WV = 15) 


in the Catholic mother identification group 
are quite similar for subjects from lower class 
and upper class backgrounds. In this regard, 
it is interesting to observe (see Table 2) that 
in the three-way breakdown of religion, identi- 
fication, and social class among upper class 
subjects only the Catholic mother identifiers 
have higher love than dominance scores. 
Further analyses of the significant findings 
in relation to social class indicate that in both 
identification groups and in both religious 
groups, upper class subjects have higher domi- 
nance scores than lower class subjects (see 
Table 2), However, these social class differ- 
ences in dominance are greater within either 
religious group for father identifiers than for 
mother identifiers. This finding suggests that 
identification may affect self-conceptions in 
interaction with other socialization variables. 
An analysis of the specific checklist items 
that were selected more frequently by one 
group than the other within each of the sociali- 
zation categories is presented in Table 3. 
Catholic subjects checked more frequently 
than Jewish subjects items from the coopera- 
tive-overconventional octant of the checklist 
(fond of everyone, likes everybody, loves 
everyone, and friendly). This octant is 
weighted most heavily in the derivation of the 
love score. Three of these four items were 
selected significantly more often by Catholic 


TABLE 3 


CHECKLIST IveMs SELECTED WITH DIFFERENTIAL 
FREQUENCY BETWEEN SUBGROUPS ON EACH 
SOCIALIZATION VARIABLE 


Ż 
Identification 
wants to be led F>M 02 
Religion 
can be indifferent to others | J > RC .01 
critical of others J>RC .02 
able to doubt others J> RC .05 
able to criticize self J>RC .05 
easily embarrassed RC>J .06 
very respectful to authority | RC > J .01 
dependent RC>J 05 
will believe anyone RC>J 05 
friendly RC>J 10 
fond of everyone RC>J .01 
likes everybody RC>J 06 
loves everyone RC>J 05 
forgives anything RC>J 02 
enjoys taking care of others| RC > J 06 
Social Class 
able to give orders I-OI > IV-V | .06 
critical of others I-II > IV-V | .09 
businesslike III > IV-V | .01 
outspoken L-II > IV-V | .05 
passive and unaggressive IV-V > HIII | .10 
very respectful to authority | IV-V > I-III | .10 
lets others make decisions IV-V > HI | .10 


Note.—Probability values based upon chi square corrected 
for continuity, two-tailed test. 


mother identifiers than by Jewish mother 
identifiers, While 9 of the 10 items checked 
more frequently by Catholic subjects come 
from the cooperative-overconventional, docile- 
dependent, and responsible-hypernormal oc- 
tants, none of the items checked more fre- 
quently by Jewish subjects comes from these 
octants. Rather, the four items checked more 
frequently by Jewish subjects represent the 
competitive-narcissistic, aggressive-sadistic, re- 
bellious-distrustful, and self-effacing—-masoch- 
istic octants. These item differences between 
the religious groups suggest more critical, 
skeptical, and autonomous self-conceptions in 
Jewish subjects and more conforming, nur- 
turant, and acquiescent self-conceptions in 
Catholic subjects, especially Catholic mother 
identifiers. 

The items checked more frequently by upper 
class subjects (see Table 3) come from the 
managerial-autocratic, competitive-narcissis- 
tic, and rebellious-distrustful octants, while 
lower class subjects more frequently check 
items from the self-effacing—masochistic and 
docile-dependent octants. These self-concept 
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differences between the two class groupings 
are consistent with the findings of other 
studies. Thus, Klausner (1953) reports more 
feelings of inferiority and insecurity in ado- 
lescents from lower class backgrounds. The 
greater emphasis on submissive and passive 
self-conceptions in our lower class subjects is 
consistent with the findings of Hollingshead 
and Redlich (1958) and others that schizo- 
phrenic symptoms, which may be extreme 
forms of these submissive trends, are perhaps 
more frequent in lower than upper class per- 
sons. Similarly, Myers and Roberts (1959) 
report that family relationships and life con- 
ditions appeared to be more conducive to the 
development of isolation and withdrawal in the 
Class V neurotics and schizophrenics of their 
sample. Lane and Singer (1959) also report 
that in contrast to the middle class normal sub- 
jects, lower class normals and schizophrenics 
evidence a greater amount of submission and 
denial of aggression. 


SUMMARY 


This study focused on the relationships be- 
tween the three socialization variables of 
parental identification, religious affiliation, 
and social class to two aspects of the self-con- 
cept, dominance and love. The subjects in the 
study were 89 male Army reserve enlisted per- 
sonnel who were about equally divided by 
Catholic or Jewish religious affiliation. Parental 
identification was measured in terms of the 
greater perceived similarity to one parent in 
contrast to the other on the semantic differ- 
ential. Upper class background was measured 
by grouping Social Classes I through III on 
Hollingshead’s two-factor index, while lower 
class background consisted of Classes IV and 
V on the same index. The primary findings 
were: 

1. No significant differences in self-concept 
scores as a function of parental identification 
were observed. 

2. Catholic subjects have significantly 
higher love scores than do Jewish subjects, 
and the former have significantly higher love 
than dominance scores. 

3. Upper class subjects have significantly 
higher dominance scores than lower class sub- 


í 
jects, and upper class subjects are significantly | 
higher on dominance than on love. | 


Further analyses of the data suggest that 
future research might study systematically 
the suggestive findings that (a) love self-con: 
ceptions are most pronounced in Catholic 
mother identifiers and that (b) the greatest 
differences in dominance self-conceptions are 
found between groups of father identifiers who 
differ in terms of social class status. 
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studies on word association (1906), there 

has been a steadily increasing number of 
studies directed toward the possible relation- 
ship between emotional and cognitive varia- 
bles. In the 1920s and 1930s, a major research 
effort was to demonstrate that emotion is 
one of the determinants of memory. Appar- 
ently, this effort was part of a more general 
attempt to examine the implications of state- 
ments by Freud, Jung, and other psychoana- 
lytic writers. Rapaport (1942), Meltzer (1930), 
Zeller (1950), among others, have reviewed 
many of the studies which served either to 
support or to invalidate the above thesis, and 
these studies will not be reviewed here. It 
might be noted, however, that in recent years 
interest in the emotional determinants of 
memory has diminished, and attention has 
shifted to affective factors related to percep- 
tion. One sign of this shift was Rapaport’s 
decision in 1950 to republish “unaltered” the 
1942 edition of his book Emotions and Memory. 
One might believe that this multitude of 
investigations would have led to some reason- 
ably secure conclusions about the influence 
of emotional factors in the memory process. 
Yet this is not the case. For example, Bugel- 
ski’s (1956) text on learning rejected the idea 
that there is any evidence linking emotional 
factors to memory. Underwood, in a recent 
Statement, refused to attach importance to 
the role of affect in forgetting until further 
research establishes a more “reliable body of 
facts” (1957, p. 57). Even Rapaport (1942), 
after his rather sympathetic survey of the 
relevant literature, concluded that consider- 


S= the publication of Jung’s pioneering 


*The research reported here was done while the 
authors were at the University of Michigan. The 
study was supported, in part, by a grant (M-450) 
from the National Institute of Mental Health, United 
States Public Health Service, administered by E. S. 
Bordin. Portions of this paper were presented at the 
1958 APA Meetings in Washington, D. C. 

* The authors are indebted to R. S. Davidon, R. R. 
Bush, and H. B. Ranken for commenting on certain 
Statistical issues, 
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ably more work is needed before one can under- 
stand the nature of such a relationship. 

The present study grew out of various fail- 
ures—by the first author and by other investi- 
gators (e.g., Grummon & Butler, 1953; Merrill, 
1952)—to confirm the validity of a word 
association technique used by Keet (1948) 
for evoking the forgetting of “traumatic” 
words. Keet had employed the Jung associa- 
tion method for trying to locate that word 
in the stimulus list which resulted in the most 
“traumatic” response on the part of the sub- 
ject (S). Having located such a word, Keet 
found that Ss blocked in remembering it 
during a subsequent learning task, enabling 
Keet to introduce either of two experimental 
quasi-counseling methods for helping S to 
overcome his forgetting of the word. 

After failing with 10 Ss to replicate Keet’s 
method for producing the supposed trauma, 
the first author tried a variation of his tech- 
nique. Instead of asking Ss merely to associate 
any word upon the second presentation of 
the stimulus list—which is the traditional 
procedure (e.g., Hull & Lugoff, 1921; Jung, 
1906; Keet, 1948)—Ss were asked to remember 
their first free association (the procedure used 
by Rapaport, Gill, & Schafer, 1946). Respond- 
ing to this, Ss recalled the large majority of 
their previous associations, but forgot many 
of those which had an apparently emotional 
significance. For example, one S responded 
“bad” to the stimulus word “love,” and later 
failed to remember this response. To develop 
this procedure for the purpose of setting up a 
laboratory analogue of a counseling session 
though, it was necessary to demonstrate that 
the forgetting observed is indeed related to 
emotional factors. 

The study reported here was conducted, 
therefore, to examine the general hypothesis 
that the forgetting of word associations is 
related positively to their emotional signifi- 
cance. In other words, it was expected that the 
more emotional the reaction to a stimulus, the 
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less likely that the specific response would be 
recalled. 

This hypothesis seems relevant to other 
research on affective factors in recall. Yet the 
only other study known to the writers that 
dealt with directly pertinent data is one by 
Rapaport, Gill, and Schafer (1946). The writers 
reported that Ss forgot their associations to 
traumatic stimulus words significantly more 
often than those given to nontraumatic words. 
In that study, however, the stimulus words 
were not equated for familiarity of usage, and 
also other variables may have accounted for 
the results. Thus, Laffal (1955) found that a 
large proportion of Ss’ variance in failures to 
reproduce associations, delayed association 
time, and other “response faults” wereattribut- 
able to the interference among mutually 
competing responses—which he labeled “re- 
sponse entropy.” 


METHOD 
Subjects 

The Ss were female volunteers from an under- 
graduate course at the University of Michigan. Thirty- 
four participated in the main part of the experiment. 
They were seen singly and were administered the 
word association test and the recall test. Free asso- 
ciations, reaction times, and recall responses were 
obtained for all Ss. Galvanic skin responses were 
secured from only 20 Ss (the GSR apparatus was 
inoperative on a number of occasions). 

About 4 months after their original testing, 27 Ss? 
returned for retesting. These Ss again gave their free 
associations to the original list, and rated each of the 
stimulus words for emotionality. 


Procedure 


Each S was scheduled for a one-hour individual 
appointment. When the GSR apparatus‘ was in order, 
one experimenter (F,) attached the GSR electrodes to 
the $’s left hand and forearm, fastened her hand into a 
wooden glove, and placed her arm in a comfortable 
position on a cushion. While Æ, helped S relax, the 
other experimenter (Ez) adjusted the apparatus and 
waited for the leveling of the skin resistance. Æ, then 
gave the following instructions: 

‘The first part of this study has to do with people’s 


*The Ss were paid to come to the retest session. 
Of the seven Ss who did not return, four were no 
longer on campus and the remaining three could not 
be scheduled for an appointment. 

‘The GSR apparatus was designed and built by 
D. Rigler (cf. his doctoral dissertation, 1957). A 
constant current of 40 microamperes was passed 
through S, and changes in resistance were measured. 
Clay disk electrodes were strapped to S’s forearm and 
hand, which was held immobile in a wooden glove. 
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reactions to different words. I am going to read 
a list of words—one at a time. Each time I say 
word, I want you to tell me the first word whi 
comes to mind. So, for example, I might say 
word “table,” and you might then think of the 
word “chair.” In that case you would immediately 
say “chair.” I want you to do this as quickly as 
possible, since I am going to time you. Do you 
have any questions? 
The list of 60 words was read to S. Reaction times 
were recorded to tenths of seconds. Meanwhile, Ey 
recorded S’s GSR deflections which followed the 
presentation of each stimulus word. After each GSR 
deflection returned to the base point, Es flashed a light 
concealed from S, and Æ, could proceed to read the 
next stimulus word. 
At the end of the 60 words, Æ, asked S to recall the 
associations as follows: 
That was fine. Let’s take a short break..., 
Now let’s go on to the second part of the task, This 
time I am going to read you the same list of words. 
But this time, each time I say a word, I want you 
to remember the word you said the Zast time. So, for 
example, if I had said “table” and you “chair,” 
this time you would try to remember “chair,” and 
say it as soon as you remember it. Do you under 
stand? 
Tt may happen that you will not be able to remem- 
-ber a word. In that case, I want you to try to think 
of it and guess as close to the word as you can, If 
you do guess at a word, just tell me that you are 
guessing. Also, if you think you have forgotten @ 
word and should remember it later on, be sure to 
tell me the word immediately. h 
Following these last instructions, the stimulus list 
was read once more and recall responses were noted. 


The Word List 


The word list is reproduced in Table 1. Varieties of 
word association lists have been used widely, evef 
since Jung’s publication of his technique in 1 
Forty of the words in this list were selected from Hull 
and Lugoff’s (1921) English version of Jung’s list. 
Hull and Lugoff studied the reactions of 100 individuals 
to each of the 100 words in Jung’s list. The stimulus 
words were compared according to their tendency t0 
elicit “complex signs of emotional disturbance”—su' 
as long reaction time, inability to make any response, 
misunderstanding of the stimulus word, etc. 

The 30 neutral words in this study were selected 
from the least “disturbing” 50 words of the Hull- 
Lugoff series. Ten of the emotional words were selected 
from the top half of the series, with some modifications 1 | 
of verbal form (e.g., from “fearing” to “fear”). Since |] 
the other words on the Hull-Lugoff list appeared t00 | 
neutral, the remaining 20 stimuli were selected 4 
priori to represent objects and activities that might 
produce some emotion in S. As shown in Table 1, the | 
two sets of 30 words were approximately equal with | 
regard to their frequency of usage, as reported by 
Thorndike and Lorge (1944). 


Retest Procedure 


About 4 months after the original test, the Ss wert] 
contacted once again and offered remuneration fo | 


| 
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TABLE 1 
THE List or SrmuLUs Worps AND THE DATA 
f “ES 

Stimulus Word |Countl Recall] GSR | RT | ER | Var, | Vane 
1. head? AA | 0 | 33.6] 1.5 |s5.0| 14 | 18 
2. man AA | 3 |20.8| 1.5 | 5.81] 16 | 7 
3. frog 25 | 2 |24.7| 1.6 |219| 14 | 12 
4. conduct A} 1 | 31.0] 2.4 |540| 11 |10 
5. truth AA! 6 | 33.6] 21] 4.85| 9 | 13 
6. fur A] o | 37.4] 2.0] 3.27] 1 | 12 
7. pity A| 7 | 341] 2.8 |6.35| 20 | 13 
8. lake AA | o | 15.1] 1.7 | 3.59] 12 | 10 
9. deceive 33 | 11 | 24.9] 3.6] 5.89| 16 | 18 
10. weak A| 6 | 22.8] 1.9]s.31] 9 | 5 
11. ink 20 | 1 | 17.3] 17/219} 4] 8 
12, cold AA | 2 | 21.7] 1.9 | 3.88] 12 | 14 
13. fear AA | 18 | 35.8] 3.6 | 6.78] 21 | 23 
14, lamp A] 1 | 19.2] 20/188] 7| 6 
15. dead AA | 4 | 40.4] 2.6 | 5.98} 15 | 14 
16. power AA | 7 | 39.5] 2.1 | 5.44] 16 | 14 
17. hay 46) 1 | 14.3] 2.6] 2.35] 12 | 12 
18. mother AA | 2 | 35.7] 1.8] 6.53] u | 8 
19. village AA | 0 | 16.4] 1.8] 4.37] 10] 5 
20, wild AA | 3 | 37.7| 3.5 | 5.50] 26 | 20 
21. excel 11 | 6 | 37.1] 3.0 | 4.80] 25 | 19 
22. love AA | 9 | 34.6] 2.1 | 6.80] 12 | 11 
23. sing AA | 6 | 26.2] 2.0 | 4.93] 18 | 14 
U. friendly AA | 7 | 31.2] 2.6 | 6.34] 23 | 17 
25. strong AA | 9 | 18.6] 2.2 | 4.12] 12 | 13 
26. painting 45 | 1 | 42.3] 2.2] 5.58] 19 | 16 
27. despise 26 | o | 30.8] 2.1] 6.19} 6] 6 
28. compete 11 | 6 | 24.4] 3.0] 5.61] 16 | 16 
29. month AA | o | 24.1] 1.7] 2.58] 8| 9 
30. bad AA | 6 | 13.7| 20}5.31| 8 | 4 
31, cow A} o | 19.3] 1.8] 1.92] 9 | 45 
32, new AA | 3 | 18.3] 1.9] 3.92) 7 | u 
33, angry A | 14 | 14.9] 2.2 | 6.80] 16 | 16 
34, tree AA | 1 | 15.8] 1.8 | 2.70] 15 | 45 
35. woman AA | 0 | 14.7] 1.7]6.00] s| 6 
36. mountain AA | o |21] 1.9 |312| 15 | 14 
37. carrot 8| o |23.3| 2.2} 2.06] 19 | 14 
38. hate A | 12 | 21.4 | 2.3 |6.90| 10 | 13 
39, long AA | 4 |22.2|-2.4 | 2.66] 11 | 13 
40. ridicule 9| 1 | 30.8) 24] 6.09] 19 | 15 
41, hiss AA | 3 | 58.6) 2.7] 6.81] 16 | 8 
42. house AA | 6 | 15.4] 2.1 | 4.62] 13 | 4 
43, false A} 9 | 22,0] 2.0] 5.80] 12 | 11 
44, window AA | 4 | 29.5] 1.9 | 2.42] 13 | 16 
45. father AA | 1 | 14.6] 15 |617| 4] 4 
46, door AA | 1 | 19.4] 1.9] 2.24) 7 | 12 
47, pencil 40] 1 | 12.8] 1.4 |173| 8 | 16 
48. sin A} 9 | 45.1] 3.1] 6.53] 15 | 18 
49. quarrel A} 3 | 13.4] 1.9] 5.94] 12 | 10 
50, hunger 37 | 1 | 18.2] 1.9] 4.89] 11 | 14 
SL. marry AA | 1 | 50.0] 3.3 | 6.90] 18 | 17 
52. voyage 4 | 1 | 22.4] 1.9] 4.66) 11 | u 
53, salt AA | o | 11.8] 2.5 |} 2.08] 9 | 9 
54. damage 32| s | 14.4] 2.3 | 4.59] 22 | 17 
55. bird AA | 1 |21| 1.8] 3.12] 16 | 19 
56. good AA | 3 | 13.8] 1.6 | 5.83] 5 3 
57. book AA | 1 |19.3| 1.5 | 4.39] u | 16 
58. beating 5 | 9 | 29.9] 2.6 |605| 21 | 20 
59. old AA | 7 | 14.1] 1.8] 4.55] 14 | 12 
60. big AA | 2 | 14.7] 1.5 | 2.92] 12 | 13 
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returning for another appointment. This time Ss were 
scheduled in groups ranging in size from 2 to 14. The 
word list was administered again, with the same 
instructions as given during the first part of the previous 
procedure. Ss wrote their associations into a 60-page 
booklet, being allowed 8 seconds between the presenta- 
tion of one stimulus word and the next, 

In the second part of the retest period, Ss rated each 
of the stimulus words on several seven-point scales, 
including one which ranged from “emotional” to 
“unemotional.” Each S received a 60-page booklet 
containing the scales on every page. In giving the 
instructions, the Æ said that, even when it seemed 
difficult to rate a word on a scale, one had to make 
the best possible try and assign some rating, Each 
word was presented for 18 seconds.® 


Indices 

The following indices were constructed as measures 
of recall, of emotion, of response variability, and reac- 
tion time. 

Recall, Responses on the recall test were classified 
into three categories: Total Recall, Zero Recall, 
Partial Recall, Total Recall refers to all answers which 
were identical to the S’s original association; Zero 
Recall refers either to no answer at all or to clearly 
incorrect ones; Partial Recall refers either to incorrect 
answers similar to the original response (e.g., “cool” 
after “cold,” or “trip” after “journey”) or to correct 
answers which occurred after S had already given an 
erroneous one. On the recall test, 85.8% of the responses 
showed Total Recall, 11.1% Zero Recall, and 3.0% 
Partial Recall. All but one S forgot at least two asso- 
ciations, but none forgot more than 14." 

Emotion. Three criteria of emotion were employed 
in this study. The first was galvanic skin response. 
The GSR index consisted of changes in the S’s resist- 
ance while she gave each word association. All GSR 
deflections were converted to standard scores as ratios 
of the S’s base resistance, A R/R (see Woodworth & 
Schlosberg, 1955, p. 157). By this method, valid com- 
parisons could be made among the responses for any 
given S7 

The second index consisted of the a priori classifica- 


š Between the two retest tasks, two rather brief 
personality tests were administered in connection with 
some exploratory research. 

6 It is assumed that all failures by Ss to repeat their 
initial response, when instructed to remember it on 
the second stimulus presentation, constituted inade- 
quate retention. When these occurred, Ss tried hard 
to recall the missing word; occasionally they would 
give a delayed correct response, and other times they 
would manage partial recall. 

7™When GSR scores were averaged across Ss, an 
interesting finding emerged. For the 26 stimulus words 
which were identical in this study and Smith’s study 
(1922), there was a moderately high correlation between 


* T-L Count refers to the frequency of each word per million 
words in the English language, as reported by Thorndike and 
Lorge (1944). AA means more than 100 per million; A means more 

is 50 per million. 

Stimulus words printed in ordinary type were selected by Es 
as “neutral,” and italicized words were selected as “emotional.” 


the GSRs—tho = .56. After adding four more words, 
presented in slightly different form in the two studies 
(e.g, “beating” vs. “beat”), rho increased to .60, 
Such a correlation between results more than 25 years 
apart bolsters one’s confidence in the potential stability 
of a GSR index. 
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tion of the stimulus words by the Es as either “emo- 
tional” or “neutral.” The third index was the emotion- 
ality of the stimulus words as rated by the Ss during 
the second testing session. 

Response variability. Two indices were constructed 
pertaining to a response-competition explanation of 
forgetting. One was called Population Variability: the 
number of different responses by the 34 Ss given to 
each stimulus word on the initial association test. 
The other was Retest Variability: the number of 
different responses given by the (retested) Ss to each 
stimulus word in the two different administrations 4 
months apart. In constructing these indices, it was 
assumed that the greater the number of different 
responses for a stimulus, the greater would be the 
potential competition (or interference) among alterna- 
tive responses. 

Reaction time. This variable was measured in tenths 
of seconds by Æ, from the time he began to say the 
stimulus word to the beginning of S’s response. 


HYPOTHESES AND RESULTS 
Hypothesis 1 


Associations that are subsequently forgotten 
have higher GSR deflections than those sub- 
sequently remembered. This hypothesis was 
supported. For each of the 20 Ss for whom 
GSRs were available, the mean difference 
between scores on the Zero Recall and the 
Total Recall associations was computed. 
Although not all Ss had higher average de- 
flections during the former responses, the 
difference was significant at beyond the .05 
level (T = 51, N = 20, two-tailed p < .05, 
by Wilcoxon’s matched pair signed-rank 
test; cf. Wilcoxon, 1949). 


Hypothesis 2 


Associations to “emotional” words, on the 
stimulus list selected by the Es, show greater 
forgetting than associations to “neutral” 
words. This hypothesis received strong con- 
firmation. Comparing each S’s number of 
Zero Recall associations for the emotional 
and the neutral words, it was found that 31 
Ss forgot more of the former, 3 Ss forgot an 
equal number of each, and none forgot more 
of the latter. (By sign test, this difference is 
significant far beyond the .001 level.) Of the 
total of 228 Zero Recall associations (cf. 
Table 1), more than 78% were responses to 
stimuli initially classified as “emotional.” 
For example, 18 Ss—more than half—failed 
to recall their responses to “fear”; 14 and 9 
did not recall their associations to “angry” 
and “love,” respectively; but no one forgot 
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on such neutral words as “head,” “ 
“lake. » 


Hypothesis 3 


Associations to words high on emotionalit 
as rated by the Ss, show greater forgettin 
than those low on emotionality. This hypotl 
esis also was confirmed beyond the 0 
level of significance, by sign test. Tw 
eight Ss had more Zero Recall for associa’ 
to stimuli that were above the median € 
emotionality, two for those given to stimu 
below the median, while four Ss forgot 4 
equal number of associations to high and lo 
emotional stimuli. a 

These findings may appear to demonstra 
that the forgetting of word associations wi 
indeed related to emotional factors 
cannot accept this conclusion, however, u 
one has examined the alternative explana 
that the forgetting of such responses is di 
mined by the degree of competition 
alternative responses, Laffal’s (1955) fin 
that “response faults’ on word associati 
tests, long considered to be indicators | 
emotional disturbance, are highly correlat 
with “response entropy,” referring to th 
uncertainty that any single response is co 
nected with the stimulus, is illustrative of thi 
line of explanation. Our index of Zero Rec 
is one kind of response fault, although thi 
index has been little used in previous studi¢ 

Following this line of reasoning, we exa 
the relation between forgetting (Zero 
and the two indices of response competi 
(Population and Retest Variability). 
relations were almost as high as in Hypoth 
2 and 3, and also significant beyond the -0 
level. Twenty-six Ss forgot more associatio 


Ss showed higher Zero Recall for responses 
stimuli with high Retest Variability, and 0 
six showed lower forgetting for such response 

Thus, it was found that forgetting was í 
most equally related to the indices of emotion 
significance and those of response interfer! 
although perhaps slightly more to the form 
indices. This finding led to a further explot 
tion among the different response in 
It was asked: Can the forgetting of 
associations be accounted for in terms | 
either emotional inhibition or response com 
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FORGETTING or WORD ASSOCIATIONS 


TABLE 2 
INTERCORRELATIONS AMONG THE 
RESPONSE VARIABLES* 


Mi a 

vaste Psa Pap [eaea ee 
Zero Recall +16] .49| .55 50 38 -28 
Mean GSR - -50| .28| .43| .45 -40 
Mean Reaction Time — | -36| .37) .58] .50 
“Emotional-Neutral” - 76 16 | —.04 
Mean Emotionality = 31 +09 
Population Variability — a 

Retest Variability Ti 


^ Correlations were by Pearson r, except for the use of point 
biserials for all correlations with the dichotomous “‘emotional- 
neutral” variable, 

b A positive correlation with the E-N variable means a positive 
relationship with the emotional end. 


petition, or did each variable account for 
different parts of the outcome? 

Whereas until now the findings have per- 
tained to the 34 individuals in the sample, the 
following analysis was performed on the 
responses to the set of 60 words in the stimulus 
list. Since the distributions of most of the 
variables were rather skewed, the scores for 
each variable were transformed to a normal 
distribution (cf. Edwards, 1954, pp. 107-111). 

Correlations among the variables are shown 
in Table 2.8 It is evident that Zero Recall was 
correlated with all variables except GSR. 

To explore these relationships further, some 
partial correlations were computed between 
Zero Recall and other variables, removing the 
effects of Population Variability and Retest 
Variability. Doing this, it was found that the 
relation between Zero Recall and Mean Emo- 
tionality, for example, was little changed. 
The partial correlations were .44 and .50, 
removing Population and Retest Variability, 
respectively. 

This result demonstrated the apparent 
existence of two independent determinants of 
forgetting. Tt led to a more systematic exami- 
nation of the correlational matrix—namely, a 
principal components factor analysis (cf. 
Hotelling, 1933). In the factor analysis, two 


#The following correlational analysis does not 
assume that the 60 words were a random sample of all 
Possible words, since the words were selected to repre- 
Sent different points on an assumed continuum of 
emotionality. For this reason, the statistical significance 

the correlations cannot be assessed. However, the 
correlations do have substantive significance, since they 
show the amount of variance in Zero Recall attributable 
to each variable. 
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TABLE 3 
Factor LOADINGS OF RESPONSE VARIABLES 


Variable 


Zero Recall 

Mean GSR 

Mean Reaction Time 
“Emotional-Neutral” 
Mean Emotionality 
Population Variability 
Retest Variability 


| 
Rebeaes 


main dimensions appear. Table 3 shows the 
results for Principal Axes I and II, and for 
the Rotated Axes I’ and II’. Factor I’ seems 
to represent the emotionality dimension, while 
Factor II’ shows response variability. The 
remaining three variables are located between 
these two dimensions. GSR and Reaction Time 
fall more toward the variability factor, while 
Zero Recall is clearly biased toward emotion- 
ality. 

Only two important factors were found. In 
locating the distribution of the remaining 
variance, a third factor was found most loaded 
with GSR (—.44) and Zero Recall (+.39), 
while all other loadings remained small. 

It seems clear that the emotionality and 
the variability measures were relatively in- 
dependent, while both were related to for- 
getting. What remains unknown is the nature 
of the relationships. The factor analysis does 
not justify the conclusion that forgetting is 
determined more by emotional factors, since 
it demonstrates only statistical and not causal 
connections. Yet statistical relations are a 
necessary condition for causal ones. 

The findings lead one to conclude that the 
forgetting of word associations was a function 
of two independent factors—the emotion- 
arousing potentiality of the stimulus words 
and their tendency to evoke mutually com- 
peting responses. This conclusion seems similar 
to one reported by DeLucia and Stagner (1954) 
from a study of word recognition. They noted 
that word recognition time was clearly affected 
by “... two sets of determinants: frequency 
of usage and emotion-arousing value” (p. 309). 

Before turning to a discussion of the results, 
some findings about frequency of word usage 
are presented. The frequency of usage of the 
“emotional” and “neutral” stimulus words 
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was equal with reference to the Thorndike- 
Lorge word count, as mentioned earlier. When 
the varying frequency of these words was 
related to their tendency to evoke Zero Recall 
associations, no difference was found between 
high and low frequency stimuli (by median 
test, x? = 0.27; not significant). 

Comparing the word frequencies of the 
Total Recall associations with those of the 
Zero Recall ones, it was found that the fre- 
quencies of the former were higher than those 
of the latter. It is possible, then, that the 
relatively greater unfamiliarity of certain 
associations may have played a part in Ss’ 
forgetting. Nevertheless, the difference be- 
tween the two kinds of associations was sub- 
stantively small: e.g., while 83% of the remem- 
bered associations occurred 50 or more times 
per million words in the Thorndike-Lorge 
count, over 70% of the forgotten associations 
fell in this same category. Thus, the frequency 
in the English language of the stimulus and 
response words did not appear to have an 
important effect on the occurrence of 
forgetting. 


Discussion 


Three different indices of the emotional 
import of the stimulus words were found to be 
significantly related to Ss’ failure to recall 
their associations, Such forgetting was unre- 
lated to the word frequency of the stimuli, 
and had little relation to the word frequency 
of the responses. What is the meaning of these 
results? 

The findings have been considered in terms 
of two different explanations, which both 
focus on the strength of the association be- 
tween stimulus and response. One is based on 
an hypothesis of response variability, which 
asserts that probability of recall depends on 
the slope of the response hierarchy, as inferred 
from the frequencies with which different 
responses have been associated to the stimulus 
in the past (cf. Osgood, 1953, pp. 550-551). 
The other explanation invokes the hypothesis 
that the more emotional the stimulus, the 
weaker the connection between it and the 
specific response by S in the association test. 
The validity of this hypothesis depends on the 
presumed existence of an “emotional inhibi- 
tion” process, operating at the moment when 
S makes the association. 


GEORGE LEVINGER AND JAMES CLARK 


The hypothesis of response variability bases 
its inferences on general population parame- 
ters, which suggest the probability of the 
occurrence and re-occurence of various stimu- 
lus-response connections. The hypothesis of 
emotional inhibition does not contradict the 


validity of that approach, but focuses on | 


more specific blocking or failures to recall, 
Since specific predictions are usually more 
erroneous than general ones, it is not surpris- 
ing that little evidence for the validity of the 
emotional inhibition hypothesis has yet been 
found. 

The present evidence leads us to propose 
that the two different explanations are supple- 


mentary, and not contradictory. Let us briefly | 


review the data. 
The finding that GSRs on the first presenta- 


tion were higher for subsequently forgotten | 


than for subsequently remembered associations 
is not central to this discussion. Actually 
GSR, assumed to be an index of emotional 
activation during the associative process, bore 
a higher relation to response variability than 
to stimulus emotionality. 

On the other hand, as seen in the correla- 
tional analysis, the stimuli regarded as emo- 
tional by Es and Ss evoked greater forgetting 
of associations than did those regarded as 


neutral. Furthermore, Zero Recall was found | 


to be independently correlated with these 
indices of emotionality and with those of 
response variability—the former correlations 
being higher than the latter. Although the 


nature of the data prevents the assessment of | 


their statistical significance, this exploratory 
analysis does raise certain questions for future 
studies, 

First, to what extent is forgetting deter- 


mined by emotionality? The results of this 


study merely demonstrate certain correlations 
between these variables. Yet, in the writers 
opinion, they lend some support to the hy- 
pothesis that emotional factors can determine 
forgetting. It seems unlikely that an “emo- 


tionless” theory of association can explain | 


these findings. 

Second, what is the relation between the 
emotional significance of stimulus words and 
the variability within response hierarchies? 
The present study found relatively low, but 
positive, correlations between these sets of 
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variables. Yet it remains possible that emo- 
tional words in general tend to elicit more 
competition among alternative responses than 
do unemotional ones. And if this were so, 
why? The answer to such a question does not 
necessarily lie within the frame of a psycho- 
analytic repression model. On the other hand, 
it does not appear to be provided by more 
systematically supported theories of memory 
and forgetting. 


SUMMARY 


The study aimed to demonstrate that the 
forgetting of word associations is related posi- 
tively to their emotional significance. 

Thirty-four Ss were administered a list of 
stimulus words, containing half “emotional” 
and half “neutral” words. Ss gave free asso- 
ciations to the words, while measures of gal- 
vanic skin response and reaction time were 
obtained. Immediately thereafter, the words 
were presented again in the same order, and 
Ss tried to recall their previous associations. 
Four months later, Ss returned and again 
gave free associations to the same stimuli, 
and also rated the emotionality of each stimu- 
lus word. 

It was found that Ss showed significantly 
higher GSRs while giving associations which 
were later forgotten than while giving those 
which were later remembered. Also, stimulus 
words rated highly emotional by (a) the Es, 
and (b) the Ss, were far more likely to elicit 
associations which were later forgotten than 
were those stimuli rated low on emotionality. 

It was also found that stimulus words which 
evoked high variability in responses (a) among 
different Ss, and (6) between tests for the 
same Ss, were more likely to have their asso- 
ciations forgotten than those which had low 
variability in responses. 

In a factor analysis of the correlations among 
different variables, it was discovered that 
emotionality and response variability were 
independently related to forgetting. It was 
Proposed that the forgetting of word associa- 
tions is a function of both emotional and non- 
emotional determinants. 
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RESPONSIVENESS OF CHRONIC SCHIZOPHRENICS AND NORMALS 
TO SOCIAL AND NONSOCIAL FEEDBACK: 


WALTER J. JOHANNSEN? 
Veterans Administration Hospital, Wood, Wisconsin 


URING the last 2 decades, psychological 
research interest in schizophrenia has 
more and more frequently involved 

a study of social factors related to the syn- 
drome. Much of this interest has stemmed 
from theoretical issues raised by Cameron 
and by Sullivan. Both of these theorists viewed 
schizophrenia as a disorder arising out of 
faulty social adjustment. This point of view 
has received considerable support: Studies 
dealing with this “social dysfunction hy- 
pothesis” (e.g., Affleck, 1954; Davis & Harring- 
ton, 1957; Flavell, 1956; Whiteman, 1954) 
have shown that the functions of motivation, 
concept formation, and learning in schizo- 
phrenia are disrupted by social features of 
test stimuli or presentation. 

Despite the growing mass of data accumu- 
lated in support of the Cameron-Sullivan 
theories, a contention basic to both their 
arguments demands resolution. This funda- 
mental assertion holds that schizophrenics 
display impairment in the ability to codify 
information about their behavior when that 
information derives from an external, social 
source. This statement underlies both Sulli- 
van’s notions of consensual validation and 
Cameron’s role playing theory. Establishing a 
schizophrenic deficit in the ability to respond 
to social feedback? would be consistent with 
the Cameron-Sullivan position and would 
Strongly affirm the utility of the social dys- 
function hypothesis. 


‘This research is abridged from a portion of a 
doctoral dissertation submitted to the University of 
Wisconsin, 

*The author is indebted to his major professor, 
Richard M. Lundy, and to L. Benjamin Wyckoff, 
Samuel H. Friedman, and Carl R, Rogers for assistance 
and criticism during various stages in the completion 
of this project. 

“Feedback” is taken to mean the signaling of 
information about performance to the subject. “Social 
feedback” is such information, the immediate source of 
which is another human individual. “Nonsocial feed- 
back” is information whose source is other than 
another human individual. 


The following study was designed to test 
four hypotheses: 

1. Schizophrenics are significantly poorer 
than normals in modifying behavior on a 
learning task where solution requires reception 
and codification of social feedback. | 

2. Among schizophrenics, impairment is- 
significantly less when the source of feedback | 
is nonsocial than when the source is social, d 
This prediction follows from the contention 
of the social theorists that social interaction) 
is the disruptive element in schizophrenic 
performance. Where interaction is reduced, 
performance approaches the level established 
by normal controls. | 

3. Paranoid schizophrenics function more) 
adequately on this task than nonparanoid 
schizophrenics but not as well as normals, 
This prediction derives from Sullivan’s asser- 
tion that paranoid schizophrenics’ communi- 
cative impairment is less severe and genetically | 
later in origin than that of the other sub- 
groups. 

4. Normal controls, who do not show social 
maladjustment tendencies peculiar to schizo l 
phrenics, perform equally well under social . 
andnonsocial feedback conditions. | 


METHOD 

Subjects 

A total of 125 subjects were tested. Twenty-four 
nonparanoid schizophrenics served under social fi 
back conditions and 24 under nonsocial feedback 
conditions. The same number of normals served in 
equivalent feedback conditions, Fifteen pai ee | 
schizophrenics served in the social feedback condition, 
14 in the nonsocial condition. The number of subjects | 
chosen within each diagnostic category was established 
in order to maintain approximate proportionality of 
sample size to the size of the parent population. | 

Schizophrenic subjects were selected from among 
the patients of the Tomah Veterans Administration | 
hospital. Normal controls were chosen from the 
patients of the general medical service of the Wood | 
Veterans Administration center. P 4 

Schizophrenic subjects were classified on the basis 
of current psychiatric diagnosis. This population was 
then screened to eliminate individuals deemed unsuit- | 
able for the study. Patients were excluded for thé 
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following reasons: age under 20 or over 59, total hos- 
pitalization of less than a full year, psychosurgery, and 
insulin or electroshock therapy during the preceding 
month, Patients whose case histories indicated a 
change in diagnosis from paranoid to nonparanoid 
schizophrenia were dropped from consideration. Finally, 
in order to avoid confounding of functional psychosis 
with established organic conditions, patients bearing 
complicating neurological diagnoses were eliminated. 
A total of 271 nonparanoid schizophrenics and 143 
paranoid schizophrenics remained available. 

From this subject pool, 48 nonparanoid schizo- 
phrenics were randomly selected and assigned to one 
or the other of the two feedback conditions. Two 
paranoid schizophrenic groups were then selected, 
equated with the nonparanoid schizophrenics on age, 
years of hospitalization, and educational attainment. 
Normal subjects were selected to provide groups 
equated with the schizophrenic groups on age and 
length of education, Adequate control of the hospitali- 
zation variable proved to be impossible. To maintain 
consistency of selection, patients bearing neurological 
or psychiatric diagnoses were excluded from the 
control group. A summary of the pertinent actuarial 
characteristics is presented in Table 1. 


Task and Apparatus 


The task presented to all subjects was the 
learning of a repetitive double alternation pattern 
(RRLLRRLL), Two hundred trials were given all 
subjects. Each trial required the subject to respond 
by pressing one of two buttons. This type of task 
appeared ideal for a study of this nature, since condi- 
tions could be manipulated so that the solution required 
codification of externally manipulated feedback infor- 
mation. The subject could not evaluate the correctness 
of response without attention to the feedback signal. 

Subjects were seated at a desk with an unpainted 
6” X 10” wooden panel placed flat on the desk surface 
approximately 10” in front of them. On this response 
board were mounted two doorbell buttons, one 2” to 
the right and the other 2” to the left of center. 

In the social feedback condition, the experimenter 
Sat beside the desk and at a 90 degree angle to the 
subject. Response buttons were wired to two 25-watt 
light bulbs mounted on a panel placed inconspicuously 
on the floor next to the experimenter. These lights 
Served merely to signal to the experimenter which of 
the two buttons had been pressed. 

In the nonsocial feedback condition, the response 


board was placed as in the social feedback condition. 
A panel, 24” X 40”, was mounted vertically behind it. 
This feedback panel was so situated that the subject 
could maintain his gaze on it while responding. Centered 
near the top were two 25-watt bulbs separated verti- 
cally by 5”. To increase discriminability between the 
feedback signals, the top bulb was red and the bottom 
bulb white, 

These lights were the immediate source of feedback 
and were controlled by the experimenter, seated in an 
adjoining room. The subject’s response buttons were 
wired to two bulbs mounted on a panel situated in 
front of the experimenter; from this position he signaled 
information to the subject’s feedback panel. 


Procedure and Instructions 


Subjects were run individually. Prior to the experi- 
mental session, each subject was engaged in conversa- 
tion for about 5 minutes in order to secure rapport. 
Instructions then given were designed to avoid the 
establishment of an atmosphere related to personal 
achievement. All subjects were initially told the 
following: 

All right, we're going to play a game now. You 
see these buttons in front of you? All I want you to 
do is press them, one at a time, and in any order 
you want. Before we get into the game, why don’t 
you go ahead and press them a few dozen times, 
just to get the idea of it. Remember now—one at a 
time and in any order you want. Do it slowly. 
Subject was then encouraged to respond 40 times 

before the experimenter called a halt. No feedback was 
given during this sequence. The procedure was meant 
to accustom the subject to the task, to establish a 
slow speed of response, and to provide a means of 
noting possible stable response tendencies, This 
record of response tendencies was used in data analysis, 

After completion of the free responding sequence 
all subjects were given the following basic instructions: 

Good. You’ve got the idea. Now in this game 
we're going to play, sometimes one of these buttons is 
going to be the correct one to press and sometimes 
the other button is going to be the correct one to 
press. This is the way we're going to do it, 

At this point, subjects serving in the social feedback 
condition were told: 

From now on I will tell you after each time 
whether you pressed the right one or the 
one. If you pressed the correct button, I’ll say 
“fight”; if you pressed the wrong one I'll say 


TABLE 1 
Actuarial CHARACTERISTICS OF DIAGNOSTIC GROUPS 
italizati i bul 
en) accents mean) Lancome 
Group thpt rtada Sitecat A 
u SD Mu SD M SD M sD 
Nonparanoid Schizophrenics 38.29 6.38 9.67 3.68 10.58 2.69 25.54 5.85 
Paranoid Schizophrenics 38.03 6.87 7. 2. 10.73 1.91 24.97 7.44 
Normals 38.37 6.46 — — 9.65 2.90 25.24 7.75 
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“wrong.” Each time you press I will tell you whether 
you were right or wrong. Okay? Remember now, 
sometimes one of these buttons is going to be the 
right one to press and sometimes the other button is 
going to be the right one to press. Try to get as 
many correct as you can. Okay? Go ahead. 

Throughout the experimental series, the experi- 
menter, seated close by the subject and in his full 
view, responded either “right” or “wrong” in a level 
tone of voice. Responses were recorded by the experi- 
menter, 

In the nonsocial feedback condition, subjects were 
given these instructions: 

T’m going to leave you alone now. From now on, 
these lights will tell you after each time whether you 
pressed the right one or the wrong one. If you 
pressed the correct button, this top light, the red 
one, will go on. But if you press the wrong button, 
this light, the bottom one, will go on. Each time you 
press one of these buttons one of these lights will go 
on. It will tell you whether you were right [experi- 
menter points to top light] or whether you were 
wrong [points to bottom light]. Remember now, 
sometimes one of these buttons is going to be the 
correct one to press, and sometimes the other is 
going to be the correct one to press. Try to get as 
many correct as you can. Okay? (experimenter, 
leaving the room] Go ahead. 

Subjects who were unable to follow directions were 
immediately eliminated from the study. In addition, a 
few subjects refused to participate. A total of 16 
subjects were dropped for these reasons (untestable: 
8 nonparanoid schizophrenics, 2 paranoid schizo- 
phrenics, 0 normals; refused: 1 nonparanoid schizo- 
phrenic, 5 paranoid schizophrenics, 0 normals). It is 
unlikely that these factors had an appreciable biasing 
effect on the results obtained, since the frequencies 
involved were small. Substitutions were made on the 
basis of similarity in pertinent actuarial characteristics. 

After the completion of 200 trials, the experimenter 
returned to the subject and, to establish validity of the 
data obtained, asked subject which of the two feedback 
lights signaled a correct response. Then, to obtain a 
control for intelligence, all subjects were administered 
the vocabulary scale of the Shipley-Hartford intelli- 
gence test. Since research has frequently shown vocabu- 
lary is little affected by psychosis (Babcock, 1930; 
Kendig & Richmond, 1940; Shakow, 1946) this test 
seemed to offer an adequate estimate of premorbid 
intelligence. 


RESULTS 
Evaluation of Sources of Bias 


Prior to analysis of the experimental data, 
the contribution of extraneous factors to the 
obtained results was estimated. As an initial 
step, analyses of variance were performed to 
determine whether groups were adequately 
equated on the actuarial variables of age, 
education, and intelligence. None of the re- 
sulting F ratios achieved significance; hence, 
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it may be assumed that these three variables 
did not contribute significantly to the experi- 
mental results. 

Since it was impossible to equate normals 
and schizophrenic groups on total length of 
hospitalization, it was necessary to determine 
whether this factor contributed a bias to the 
experimental results. Correlation analysis 
demonstrated that the degree of relationship 
between length of hospitalization and task 
performance was negligible. Thus, group 
differences in length of hospitalization did not 
appear to affect significantly any differences 
in experimental performance. 

Another source of bias was the effect of 
tranquilizing medication on task score. Since 
nearly 80% of the schizophrenic subjects were 
receiving such treatment, the effect of tran- 
quilizers on performance required evaluation, 
Those subjects not receiving drugs were 
matched with subjects within the same experi- 
mental cell who were receiving medication 
and whose actuarial characteristics were most 
similar to theirs. Within each cell, these tran- 
quilizer and nontranquilizer groups were 
dichotomized into frequencies scoring above 
and below the mean task score for that cell. 
If medication affected task performance, differ- 
ences would be expected between the tran- 
quilizer and nontranquilizer groups in respect 
to distribution about the cell mean. Frequen- 
cies were accumulated from all six cells, and 
the resulting 2 X 2 contingency table com: 
pared by means of the Yates-corrected chi 
square test. The obtained value of 1.02 falls 
far short of significance. It, thus, appears 
that presence of tranquilizing medication did 
not act significantly to bias the results of the 
experiment. 

One other source of bias required evaluation. © 
It was thought that some subjects would 
approach the task with stable response patter 
sets. A bias toward an RRLL pattern would © 
tend to produce a higher score than an RLR 
bias. Inspection of the free responding series — 
data indicated, however, a virtual identity of 
group means in regard to the number of | 
alternations. Preliminary response sets con- i 


tributed little bias to the experimental results: 4 


Analysis of Experimental Results | 
The mean number correct responses was 
calculated for each experimental condition, — 


| 


| 
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and the results are presented graphically in 
Figure 1. These data reveal a striking interac- 
tion. The normal controls show a markedly 
better mean performance under social feed- 
back conditions (161.25) than under nonsocial 
feedback conditions (129.00). This pattern is 
reversed for the nonparanoid schizophrenics 
who perform more adequately under nonsocial 
feedback conditions (127.54) than under 
social feedback conditions (108.00). Paranoid 
schizophrenics show an equivalence of per- 
formance under social (130.47) and nonsocial 
(123.64) conditions. It is noteworthy that 
performance of the three diagnostic groups is 
almost identical under nonsocial feedback 
conditions. 

Extreme skewedness of cell distributions 
made obligatory the use of different, relatively 
independent techniques for evaluating the 
Statistical significance of these results. Since 
Norton (1953) demonstrated that the F test 
is highly insensitive to deviations from normal 
distribution, the analysis of variance was 
used as the basic test of the data, with signifi- 
cance level adjusted to guard against unwar- 
ranted rejection of the null hypothesis. In 
this analysis, one score was randomly elimi- 
nated from the paranoid schizophrenic social 
feedback condition in order to achieve pro- 
portionality of cell frequencies. The summary 


(40 PARANOID 
SCHIZOPHRENICS 

130 

120 NONPARANOID 


SCHIZOPHRENICS 


NUMBER OF CORRECT RESPONSES 


NONSOCIAL 


SOCIAL 


FEEDBACK CONDITIONS 


Fic. 1. Mean number correct responses for diag- 
nostic groups as a function of feedback condition. 
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of the analysis of variance is contained in 
Table 2. 

The results indicate that significant differ- 
ences exist among diagnostic groups with 
respect to their responsiveness to feedback, 
with normals most adequate, nonparanoid 
schizophrenics least adequate, and paranoid 
schizophrenics performing in an intermediate 
position. This result is in accord with the 
hypothesis. 

The significant group X feedback interac- 
tion indicates that the three diagnostic groups 
perform in a different fashion with respect to 
the nature of the feedback source. This result 
generally accords with the hypothesis, except 
that the differentially better performance of 
normals under social feedback conditions was 
unanticipated. 

Two chi square analyses were then per- 
formed to supplement the analysis of variance. 
The results are summarized in Table 3. In the 
first of these analyses an overall mean (number 
of correct responses) was obtained, pooling 
the data obtained from all subjects. Each cell 
was then dichotomized with respect to this 
mean. The frequencies of subjects falling 
above and below this value were obtained. 
These data were then subjected to various 
comparisons. 

It is apparent from this analysis that con- 
clusions derived from the analysis of variance 
are substantiated. Specifically, the three 
diagnostic groups show significant differences 
in performance under social feedback condi- 
tions but not under nonsocial feedback con- 
ditions. Nonparanoid schizophrenics perform 
significantly better under nonsocial than 
under social feedback conditions; however, 
the reverse is true for normals. Paranoid 
schizophrenics perform equivalently under 


TABLE 2 
SUMMARY OF ANALYsIs OF VARIANCE FOR NUMBER 
OF CORRECT RESPONSES AS A FUNCTION OF 
Group AND FEEDBACK CONDITION 


Source df MS F 
Diagnostic Groups 2 9912.52 6.52* 
Feedback 1 1329.24 — 
Groups X Feedback 2 7300.03 4.79* 
Error 118 1522.90 

Total 123 


* Significant beyond the ,025 level. 
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TABLE 3 
Summary OF Cat SQUARE ANALYSES COMPARING 
F. 


Overatt MEAN (Column A) AND FREQUENCIES 
oF SoLvERs AND Nonsotvers (Column B) 


Comparison ieme 
(A) @) 

Overall oa 22.33%*  25.43** 
Nonparanoid Schizophrenics 

Seis whe eet 3.99* 5.73* 
Parano; i enics 

Social ie elena .01 .01 
Normals 

Social vs. Nonsocial 6.84**  10.10** 
Social Condition 

All groups 19.22%% = 21.08% 
Nonsocial Condition 

All groups -18 -80 


* Significant beyond the .05 level. 
** Significant beyond the .01 level. 


either condition. Thus, all hypotheses were 
again borne out with the single exception of 
that asserting a similar mode of responding 
for normals irrespective of feedback source. 

A second set of chi square comparisons was 
then performed. Results are summarized in 
Column B of Table 3. This analysis was de- 
signed to determine whether the number of 
subjects achieving solution differed from cell 
to cell. The criterion of correct solution was 
defined as errorless performance during the 
terminal 40 trials. Inspection of results shows 
that this analysis is entirely consistent with 
the previous analyses. Conclusions based on 
analysis of the response data apply with equal 
force here. 

One final matter requires brief considera- 
tion. An inspection of protocols revealed a 
large number of consecutive RL or LR alterna- 
tions among nonparanoid schizophrenic sub- 
jects. This form of stereotypy was relatively 
less frequent among paranoid schizophrenics 
and even less in evidence among normals. 
The mean frequencies of RL or LR alterna- 
tions within each condition are reported in 
Table 4. x 

While errorless performance on this task 
entails 100 such alternations, scores much in 
excess of this value may be considered indica- 
tive of inefficient perseverative behavior. An 
analysis of variance was performed on alterna- 
tion scores to determine whether significant 
differences existed among groups. As shown 
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TABLE 4 


MEAN NUMBER SINGLE ALTERNATIONS FOR 
EacH CONDITION 


Condition No. Alternations 

Nonparanoid Schizophrenics 

Social 151.96 

Nonsocial 115.96 
Paranoid Schizophrenics 

Social 119.35 

Nonsocial 127.07 
Normals 

Social 106.00 

Nonsocial 112.75 

TABLE 5 


ANALYSIS OF VARIANCE FOR THE NUMBER OF 
ALTERNATIONS IN 200 TRIALS 


Source a MS PF 
efi ic Grou 2 7266.11 5.48" 
Feedback d 1 5597.96 4.22 
Groups X Feedback 2 5476.69 4.13* 
Error 118 1325.47 

Total 123 


* Significant beyond the .05 level. 


in Table 5, the three diagnostic groups differed 
significantly in the mean number of alterna- 
tions, with nonparanoid schizophrenics show- 
ing most and normals least perseveration. The 
significant interaction term indicates that the 
three groups responded differently to the 
feedback conditions. Nonparanoid schizo- 
phrenics gave more alternations under social ; 
feedback conditions than under nonsocial 
feedback conditions; paranoid schizophrenic 
and normals alternated more frequently undet | 
| 
| 


nonsocial than social feedback conditions. 


Discussion 
The hypotheses put forth in this study wer 


in general supported. In a learning task wi 
adequate performance dependent on codifica- 
tion of feedback derived from an outside | 
source, schizophrenics achieved significantly 
poorer scores than normals. Paranoid schizo- 
phrenics proved to be less impaired than not- 
paranoid schizophrenics. As predicted, schizo- 
phrenics performed as well as normals under 
nonsocial feedback conditions, but were signif- 
cantly poorer under social feedback conditions: | 
It was also demonstrated, in accordance wit? — 
the hypothesis, that paranoid schizophrenl@ | 
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achieve a level of performance intermediate 
between that of nonparanoid schizophrenics 
and normals. The single hypothesis which was 
not borne out dealt with an anticipated equiv- 
alence of response level of normals under 
social and nonsocial feedback conditions. 
Normals actually showed significantly better 
performance under social than nonsocial 
feedback conditions. 

These results have implications for several 
areas of research. On the broadest level of 
generality, the finding that the schizophrenic 
is able to function on a normal level in the 
presence of nonsocial feedback bears directly 
on the problem of schizophrenic intellectual 
deficit. The present data lend considerable 
support to those theorists who contend that 
this deficit is a matter of ability rather than 
capacity and that, as a consequence, it is 
reversible (Hunt & Cofer, 1944; Kendig & 
Richmond, 1940; Shakow, 1946). Conversely, 
there is no support for the position that schizo- 
phrenic deficit is similar to brain damage, 
entailing loss of capacity to learn or conceptu- 
alize. 

From the viewpoint of intellectual function, 
it is noteworthy that schizophrenic deficit in 
the present study was most severe under 
social feedback conditions. It is well known 
that attitudinal and motivational factors are 
sensitive to interpersonal influence, and clinical 
judgment suggests that schizophrenics are 
particularly attuned to nuances of social 
interaction. As Wegrocki (1940) says, “The 
test situation need not represent, then, for the 
schizophrene, simply a trial ground of his 
intellectual prowess, but may be a situation 
fraught with affective innuendoes of which the 
experimenter is totally unaware” (p. 63). 

It is tempting to speculate whether the 
traditional schizophrenic intellectual dysfunc- 
tion is not merely attributable to disruption 
deriving from the presence of an examiner in 
the testing room. Would the performance of 
schizophrenics on circular mazes or serial or 
motor learning tasks show the usual decrement 
if testing were conducted with the experimenter 
absent from the room? In support of this 
conjecture it should be noted that where 
explanations of deficit are attempted, these 
attempts are usually couched in motivational 
terms (Malmo, Shagass, & Smith, 1951; 
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Rickers-Ovsiankina, 1937; Ricksher, 1909; 
Wegrocki, 1940). Moreover, where intellectual 
decrement is most severe, the tasks used are 
most often those requiring prolonged attention 
and interest (Shakow, 1946). 

The nature of schizophrenic functioning 
under social feedback conditions introduces a 
further, related question: What aspect of the 
social feedback situation led to the extreme 
degree of disruption noted? One fruitful avenue 
of speculation is suggested by Garmezy’s 
study (1952). He noted that the use of nega- 
tive evaluation after incorrect response in a 
discrimination task led to increased adequacy 
of performance in normals but no change in 
response among schizophrenics. He argued 
that a fear of censure was operating differen- 
tially to inhibit schizophrenic performance 
relative to normals, This interpretation is 
consistent with the Cameron position and 
observations made during the administration 
of the present study. Cameron states (1947, 
p. 486) that schizophrenia develops out of a 
reaction sensitivity to instances of failure.‘ 

Mednick’s recent formulation of the develop- 
ment of schizophrenia (1958) offers a conven- 
ient explanation for the observed phenomena 
when a few additional assumptions are made. 
Mednick contends that schizophrenia is a 
response to anxiety. Anxiety is conceptualized 
as a drive state and incipient schizophrenics 
as individuals continually functioning under 
conditions of anxiety. High drive acts to 
increase the response strength of any habit 
that may be aroused in a given situation as well 
as producing heightened stimulus generaliza- 
tion (a concept similar to Cameron’s “over- 
inclusion”). A process is then initiated in 
which anxiety and generalization mutually 
augment each other. The pattern eventuates 
in responses which are highly irrelevant and 
inappropriate. The ensuing process of chronic- 
ity which takes place is described as one which 


4 Mention must be made of an alternative explana- 
tion deriving from an artifact of the experimental 
situation, namely, the confounding of feedback with 
sense modality, That is, social feedback was presented 
in an auditory fashion and nonsocial feedback visually. 
Differential sensitivity of schizophrenics to one or the 
other mode of presentation may be sufficient to explain 
the results. However, since the literature does not 
demonstrate differential sensory sensitivity in schizo- 
phrenics, this explanation is not supported. 
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involves increasing use of irrelevancy as a 
method of escaping anxiety. 

In explaining the current results, it is as- 
sumed that social feedback arouses anticipa- 
tion of censure and consequently increased 
drive level among schizophrenics. Nonsocial 
feedback conditions, however, involve no 
such change in drive level. It is also assumed 
that normals respond to increased drive in 
problem solving situations in a manner defined 
as adaptive, but that schizophrenics respond 
in a manner that involves retreat into irrele- 
vancies. That is, confronted with a fear of 
censure, the normal responds with an attempt 
to achieve an adequate solution by concentrat- 
ing, focusing attention, formulating hypoth- 
eses, etc. The schizophrenic, especially the 
nonparanoid schizophrenic, responds by with- 
drawing from the requirements of the task, 
i.e., random responding or stereotypy. 

The equivalence of paranoid schizophrenic 
performance under the two feedback condi- 
tions is in some ways puzzling. One possible 
explanation is that paranoid schizophrenics 
share in the basic interpersonal difficulty 
of schizophrenics in general, operative under 
social feedback conditions, but that their 
ability to profit from feedback is less impaired 
than is the case among other schizophrenics. 
While this explanation is plausible, there are 
alternatives. For instance, it may be that the 
paranoid schizophrenic relationship to others 
is one that involves a dehumanization of 
others; thus, perception of a social feedback 
source is not different for them from their 
perception of any other feedback source. 
Another possibility is that the interpersonal 
disorder of the paranoid schizophrenic is of 
the same nature and severity as that of other 
schizophrenics, but that a degree of attentive- 
ness to social stimuli is maintained which is 
hot matched by other schizophrenics. The 
present data do not allow a choice among these 
alternatives; further research is needed in this 
area to clarify the point raised. 

The nonparanoid schizophrenic predilection 
for the single alternation tendency deserves 
consideration. Statistical analysis which dem- 
onstrated differences among diagnostic 
groups cannot indicate whether the prevalence 
of single alternations prevents the achievement 
of solution, or whether inability to arrive at 


WALTER J. JOHANNSEN 


solution results in prevalence of single alterna- 
tions. However, the subject who cannot arrive 
at a solution has few alternatives available to 
him. He may withdraw entirely from the task 
by refusing to respond or insisting on the mean- 
inglessness of it. If, however, he does respond, 
he may continually modify his preliminary 
response disposition in accordance with feed- 
back provided him in order to achieve a solu- 
tion. On the other hand, he may respond ina 
random pattern or in a stereotyped manner. 
Several studies (Fey, 1951; Kerner, 1956; 
Peters, 
sterotypy among schizophrenics under condi- 
tions of stress. These findings are consistent 


with the interpretation that nonparanoid | 
schizophrenics, functioning in an atmosphere | 


involving fear of censure, tend to respond ina 
stereotyped fashion. This mode of response 
interferes with the development of more 
adaptive and successful patterns of response. 


SUMMARY 


This study was designed to test the hy- 
pothesis that schizophrenics’ ability to learn 
under conditions of social feedback is impaired. | 
Subjects were given the task of learning a re- 
peated double alternation pattern under either 
social feedback (experimenter present in the ex- 
perimental room, signaling feedback verbally) _ 
or nonsocial feedback (experimenter absent 
from the room; feedback signaled by lights). 


A total of 125 subjects (48 nonparanoid schizo- } 


phrenics, 29 paranoid schizophrenics, 48 
normals) were tested. The groups were equat 
for age, intelligence, and education. 

Results were generally as predicted. Nor- 
mals showed the highest level of attainment, 
nonparanoid schizophrenics the lowest, an! 
paranoid schizophrenics an intermediate de- 
gree. The most noteworthy finding was that 
normals performed significantly better undet 


social feedback conditions than under nor — 


social feedback conditions while nonparanoid _ 
schizophrenics were significantly poorer unde | 
social than under nonsocial feedback condi 
tions. All three diagnostic groups performed 
equally well under nonsocial feedback con- — 
ditions. 

Discussion centered about the question of 
nonparanoid schizophrenic performance undef 
the two feedback conditions and its differen 


| 


1953; Wulfeck, 1941) have noted | 


| 
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from the normals’ mode of responding. An 
explanation was based on an hypothesis deal- 
ing with differential response to a fear of 
censure. 
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INTERPERSONAL DECISION MAKING: 
THE RESOLUTION OF A DYADIC CONFLICT 


JOHN M. ATTHOWE, JR. 
University of Alabama 


NTERPERSONAL behavior is regarded as an 
important determinant of personality as 
well as of social institutions. Interaction 

is assumed to result in the socialization of the 
child (Sears, 1951), the development of both 
neurotic and normal behavior (Fromm, 1941; 
Horney, 1945; Murray, 1951; Sullivan, 1947), 
success or failure in psychotherapy (Fromm- 
Reichmann, 1949; Rogers, 1951), and emergent 
social systems (Parsons & Shils, 1951). If 
interpersonal influence can lead to both posi- 
tive and negative consequences, we must be 
able to specify the conditions that give rise to 
the one or the other. Do these conditions arise 
within the interaction sequence, or are they 
logically prior to it? 

Sears (1951) has suggested an operational 
basis for describing interpersonal influence 
both as an ongoing process and as an in- 
stigating condition. In the present investiga- 
tion, Sear’s dyadic paradigm provides the 
context for the study of the reciprocal inter- 
action of two subjects faced with a decision 
conflict. 

Conflict resolution was studied in a co- 
operative interpersonal setting. Two questions 
were posed: How do individuals arrive at 
joint decisions when they are faced with 
conflicting courses of action, and what condi- 
tions lead to more efficient or to less efficient 
interpersonal decisions? The time required to 
make a decision and its correctness, or ration- 
ality, were used as measures of efficiency. 

Subjects who had previously served in an 
individual decision making experiment were 
retested in pairs. The dyadic task was identical 
with the individual task. Subjects were 
instructed to select the “better” of two 
alternatives; the only difference in the two 
experiments was the requirement that the 
dyadic choice be a joint (i.e., mutual) decision. 

Three classes of antecedent conditions were 


+The author wishes to thank P. S. Siegel, P. R. 
Farnsworth, R. R. Sears, K. Little, and L. Festinger 
for reading the original draft and for their criticisms, 
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related to the efficiency of dyadic decisions, 
These were: the type of choice situation or 
conflict, differences in the way the partners 
individually resolved decision conflicts, and | 
the nature and amount of interaction. 

Three types of conflict were defined; each 
consisted of an approach (ap) and an avoidance 
(av) dominant subtype. A conflict type was | 
designated by the following ratio? described 
more fully elsewhere (Atthowe, 1960): 


ap — av 
ap + av 


Ap and av represent the value of the expected | 
outcomes, ap representing the possibility of 
gaining money and ay that of losing. As the 
values of the ap and av outcomes near equality, | 
the ratio approaches zero, a limit defined as 
the point of maximum risk and approximated 
by the ambivalent conflict. When either the 
ap or av outcomes dominate, the ratio nears 
the limit +1.00. Both of these limits designate 
minimal degrees of risk represented by the 
univalent conflict. A moderate degree of risk 
was represented by the disvalent conflict, 
which entails an additional feature: subjects ’ 
had to reconcile the possibility of gambling 
(Alternative A, Table 1) with not gambling 
(Alternative B) in making their choice. These 
six situations of conflict are specified in Ta- 
ble 1. 

The second class of antecedents conceris 
individual differences. The way subjects 
resolved similar decision conflicts on & 
individual basis was determined from a prio! 
experiment (Atthowe, 1960). Subjects wi 
different decision times and different degrees 
of nonrational choices were paired. The results 
were dyads representing different degrees of 
discrepancies in the way partners resolvi 
decisions. 

The third class of antecedent conditions 
might be better labeled emerging conditions: 

ae oe of 

? This ratio is a m of di ating types 
conflict along a Giese dicen Athma Io | 
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TABLE 1 
TYPES or CONFLICT 


(Ap subtypes are presented to the left and av 
subtypes to the right) 


A B A B 


Type I. Univalence, minimal risk 


a win 35¢ | win 30¢ | a lose 34¢ | lose 30¢ 
b win 26¢ | win x¢ | b lose 25¢ | lose xé 
Type II. Disvalence 
a win 35¢ | win 5¢ | a win 26¢ | lose 5¢ 
blose x¢ | win 5¢ | blose x¢ | lose 5¢ 


Type III. Ambivalence, maximal risk 


win 17¢ 
lose x¢ 


a win 25¢ 


a win 34¢ 
b lose 34¢ 


win 26¢ 
b lose 25¢ 


lose x¢ 


Note.—A and B are alternative choices; a and b represent 
possible outcomes. The probability of occurrence of any a or b 
was equal to one-half. x represents the unique value producing 
indifference of choice between A and B. 


As the partners interact, differing interpersonal 
relationships emerge. These relations, three of 
which were defined prior to the experiment, 
were assumed to be logically independent of 
the Measures of efficiency: (a) the type of 
influence relationship (i.e., leader-follower); 
(6) the orientation, or cohesiveness, of the 
dyad; and (c) disagreement between partners. 

The influence relationship was defined by 
X+Y 
tively, represent the absolute difference 
between the individual and dyadic choices of 
each partner. If the joint decisions were more 
like the individual choices of one of the 
partners (e.g., X), then X was considered the 
leader. A ratio greater than .50 (i.e, a large 
change for one partner and a small change for 
the other) designated a strong leader-follower 
telationship. A ratio between .21 and .50 
designated a weak relationship, and .20 or less 
designated a compromise relationship. Differ- 
ences were calculated at the point of conflict 
in each of the six choice situations and summed. 
Conflict was defined as occurring when the 
alternatives were reacted to as equal in value. 

The cohesiveness or orientation of the dyad 
was defined by the number of “we” and “I” 
remarks. If both partners verbalized “we” 
more often than “I,” the dyad was designated 
group oriented. If they both verbalized “I” 


the following ratio: . X and Y, respec- 
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more often, the dyad was termed self-oriented. 
If the partners differed in their dominant 
remarks, the dyad was designated imbalanced 
(cf. Heider, 1946). The number of orientation 
remarks and disagreements were also recorded, 


HYPOTHESES 


It has been shown that different types of 
conflict yield different decision times and 
degrees of nonrationality in the individual 
choice situation (Atthowe, 1960). Would the 
same relations be found in the dyadic choice 
situation? Which of the conflict types yield 
more efficient and which yield less efficien' 
dyadic decisions? ; 

Individual differences undoubtedly play a 
role in determining the efficiency of joint 
decisions. Would the differences between 
partners in their typical decision speeds and 
their expressed nonrationality effect their 
joint decisions? If so, what discrepancies yield 
more efficient and which yield less efficient 
dyadic decisions? 

When pairs of individuals reciprocally 
interact, different interpersonal relations are 
bound to emerge. Furthermore, different 
interpersonal relations would be expected to 
be associated with different joint decisions. 
How, then, would the efficiency of joint 
decisions be affected by different interpersonal 
relationships? What forms of interaction are 
found with more efficient decisions and which 
are found with less efficient decisions? 


METHOD 


The Task 


The experimental task required subjects to choose 
the better of two alternative wagers. Each wager 
contained two possible monetary outcomes; the prob- 
ability of each outcome was .5 and was assumed to be 
dependent upon the roll of an “unbiased” die. To 
reduce the immediate effects of reinforcement, no 
money was dispensed during the game. Furthermore, 
subjects were never told whether their choices were 
correct or not. The instructions emphasized the 
reasoning nature of the task, The wagers were said to 
be problems taken from the mathematical reasoning 
section of an advanced intelligence test and arranged 
as wagers. Partners were required to reach agreement 
before the dyadic decision could be specified. 


Subjects 
Fourteen pairs or dyads of male subjects were used. 
Each dyad was comprised of at least one subject whose 
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previous choices were nonrational. As a control measure, 
two additional dyads were tested; each was composed 
of two completely rational subjects. 


Procedure 


The wagers, printed on 3 X 5 cards, were presented 
as a collection of reasoning problems. The point of 
indifference, or conflict, was determined in a random 
fashion by the method of constant stimuli. All of the 
outcomes were held constant except one. This outcome 
(x in Table 1) was varied until the two wagers were 
seen as indifferent. The indifference point was defined 
as one-half the interval comprising the upper and 
lower limens. 

The absolute values and the relative differences of 
the outcomes generally were held constant. The actual 
monetary values in the dyadic experiment differed from 
those in the individual setting by one to two cents. 
Except for the univalent conflict, the expected values 
of the different choice situations were equated. Conse- 
quently, the point of indifference was assumed to be 
“functionally comparable” for all subjects and for each 
of the conflict types in both the individual and dyadic 
settings. 

Decision time and nonrationality were determined 
at the point of conflict. Decision time was measured in 
tenths of a second. Subject’s individual decision time 
was defined as his median decision time for the six 
choice situations. When the mathematically expected 
values of the competing alternatives equaled each 
other, no preference could be logically indicated. This 
point of indifference defined a rational course of action. 
Any course of action corresponding to an indifference 
point which deviated from a rational choice was termed 
nonrational. 

Because of the small N and the observed skewness, 
nonparametric statistics were employed. All tests of 
statistical significance were two-tailed. 


RESULTS 

Types of Conflict 

Nonrationality was significantly related to 
the type of choice situation (p < .01). Dyadic 
decision times exhibited the same overall 
pattern, but the latter trend was not significant 
($ < .30) by Friedman’s two-way analysis of 
variance. The disvalent conflict clearly insti- 
gated the least efficient dyadic decisions, and 
the univalent conflict exhibited maximal 
efficiency. The difference between the univalent 
and disvalent conflicts both in nonrationality 
and latency was significant at the .01 level. 

Differences between subtypes were smaller 
than among types; nevertheless, av subtypes 
produced. significantly more nonrationality 
than ap subtypes (6 < .02). Decision times 
also were longer in the av situations, though 
not significantly so, by the Wilcoxon signed- 
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TABLE 2 
Decision MAKING EFFICIENCY as A FUNCTION oF 
Tyre or CONFLICT 
(Degree is measured in cents as a deviation away from 
a rational solution. Time is measured in seconds.) 


(N = 14) l 
Nonrationality Decision Time 
Type of Conflict ———— 9 ——_________ 
Freq. Degree Avg. Rank 
Univalence 
Ap 8 25.7 S 
Av 6 16 28.1 29) 
Disvalence 
Ap 10 89 38.7 «2m 
Av 10 126 46.2 14 
Ambivalence 
P 6 14 32.4 26 
Av 6 16 35. 2.3 


Note.—Degree represents the total amount of nonrationality 
manifested. Rank is the average of the rankings for a given choice 
situation, A rank of 6.0 would represent the shortest decision time, 


ranks test. Conflicts characterized by the 
possibility of losing exhibited less efficient 
decisions than those characterized by the 
possibility of winning. 

Dyadic conflict resolution differed somewhat | 
from individual resolution. In both settings | 
the univalent conflict yielded the most eff- 
ciency; but, in the individual setting, ambiv- | 
alence produced the largest latencies. The 
same overall pattern was shown in the expres- 
sion of nonrationality in both settings, except | 
individual reactions to the possibility of 
losing resulted in slightly more rationality. 
In the individual setting, decision time was { 
the more sensitive measure of conflict. In the 
dyadic setting nonrationality was mort 
sensitive. 

Individual Differences 

Individual differences in making decisions 
were found to be curvilinearly related to the | 
efficiency of dyadic decisions. When the 
differences between partners were categorized 
into large, moderate, and small discrepanci® 
groups, the greatest amount of nonrationality 
and longest decision times were found in the 
moderately discrepant group (see Figures 
and 2). Larger or smaller differences betwee? 
partners resulted in greater efficiency. SM 
differences showed slightly more efficiency tha? 
large differences. The same pattern W | 
observed in all comparisons. When differenc® 
in individual decision times and in 00% 
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rationality were related to both measures of 
dyadic efficiency, the relations approached 
but fell short of significance by a Kruskal- 
Wallis one-way analysis of variance (p < 
.15, .25, .10, .20, respectively). 


Interpersonal Relationships 


The efficiency of the dyadic decisions was 
also a function of the emerging relationships 
between partners. A weak leader-follower 
relationship was associated with a greater 
expression of nonrationality and a strong 
leader-follower relationship with more ration- 
ality (p < .028 by Mann-Whitney U test). 
Neither a strong nor a weak influence relation- 
ship was associated with the time required to 
reach a decision. Only one dyad exhibited 
compromise in their dyadic choices. 

The orientation, or cohesiveness, of the 
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TABLE 3 
Decision MAKING EFFICIENCY AS A FUNCTION OF 
TYPE or INFLUENCE RELATIONSHIP 
(Nonrationality is measured in cents and 
time in seconds) 


Nonrationality | Decision Time 


Influence Relationship} N 
Avg. |Median 


Rank | Value Rodi Value 
Strong 6 | 10.4} 1.0] 8.3 | 21.2 
Weak 7 5.2 | 22.0 | 7.6 | 20.9 
Compromise 1 6.0 | 20.0 | 2.0 | 66.7 


Note.—The smaller the rank the greater the nonrationality 
and indecision. 


dyad was not related to the efficiency of the 
dyadic decisions, although when the inter- 
action sequence was divided into three phases, 
an early emphasis upon orientation remarks 
resulted in shorter decision times and a late 
emphasis in longer latencies (p < .011 by the 
Kruskal-Wallis test). 

The total number of orientation remarks, 
irrespective of their content, correlated 
significantly with the degree of dyadic non- 
rationality (rho, corrected for ties, = 0.74, 
p < .01). The correlation between frequency 
of orientation remarks and median decision 
times was positive but not significant (rho = 
0.34). A tendency also was shown for the 
number of disagreements to be related to the 
length of the dyadic decision times (rho = 
0.49, p < .10). Disagreement and nonration- 
ality were not related. 

Additional Findings 

The typical strategy employed by the dyads 
in making decisions was one of minimizing 
their maximum possible loss. Dyadic decisions 
were significantly more conservative than 
extravagant (p < .05, Wilcoxon test). A 
nonmonotonic function, with the longest 
decision time coinciding with the point of 
conflict, was repeatedly observed as the 
partnerships made their decisions. As the 
point of indifference was approached, joint 
decision times lengthen (p < .01, sign test). 

The two “rational” partnerships, the 
controls, clearly showed fewer orientation 
remarks (an average of 0.5 as compared to 
16.5) and faster decision times (an average 
latency of 7.5 seconds as compared to 32.2 
seconds) with much less variability. 
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Discussion 


The representativeness and generality of 
the results must be considered within the 
limits imposed by the task and instructions. 
A reasoning and achievement set seemed to 
have been established, with extensive motiva- 
tional involvement as evidenced by numerous 
signs of sympathetic activity (e.g., flushing). 
One of the products of the interpersonal as 
compared to the individual setting was this 
greater involvement. The tangibleness of the 
numerical calculations also limited the vari- 
ability of the findings, as did the cooperative 
nature of the task. Cooperativeness, in this 
sense, has been considered synonymous with 
cohesiveness (Deutsch, 1954). A cooperative 
dyad would not be expected to show the vari- 
ability that a competitive one would. Con- 
sidering these restrictions considerable vari- 
ability and generality was shown. 

The type of choice situation was an impor- 
tant determinant of the efficiency of dyadic 
decisions. Nonrationality was especially in- 
fluenced by the choice situation. The possi- 
bility of choosing either a “sure thing” or a 
“Jong shot” in the disvalent situation insti- 
gated considerably more nonrationality and 
greater indecision. The conflict type char- 
acterized by minimal risk (the univalent 
situation) provided the greatest efficiency, 
especially the double ap-ap conflict. 

The comparatively longer indecision found 
in the disvalent situation in the dyadic setting 
suggests the importance of this type of conflict 
for interpersonal behavior. Subjects had to 
reconcile their conservative and extravagant 
tendencies before they could reach an agree- 
ment on a choice. The instructions in the 
Present study necessitated reciprocal action; 
consequently, the partners’ instrumental 
actions were the reinforcing stimuli for further 
interaction. The probability that a given act 
of influence would yield a specifiable outcome 
is less certain in the interpersonal situation. 
When the choice situation also involves 
uncertainty, as in the disvalent conflict, 
interaction increases, resulting in longer 
decision times and more variability of choice. 

An increase in environmental uncertainty 
and risk led to greater inefficiency in making 
decisions. These results support one of the 
more basic postulates of interpersonal behavior 


Joun M. ATTHOWE, Jr. 


(e.g., Allport & Postman, 1945; Cantril, 1958; 
Killian, 1952; Sherif & Sherif, 1956). The 
greater inefficiency found in av as opposed to 
ap situations attests to the greater vigor of 
abient behavior. 

A general similarity between individual and 
dyadic conflicts was also shown. The lengthen- 
ing of dyadic decision time as the point of 
conflict (i.e., indifference) was approached 
points to a similar pattern of hesitancy and 
indecision in both settings. Furthermore, 
dyadic conflict resolution, like the resolution 
of individual conflict, proceeded conserva- 
tively. The dominant dyadic strategy was to 
select the alternative with the lesser possibili 
of losing, although this conservative trend | 
may be specific to the university population 
studied (cf. Mosteller & Nogee, 1951). 

If orientation remarks can be considered @ 
representative measure of verbal interaction, 
interaction was inversely related to efficiency 
in the present study. As verbal interaction | 
decreased, efficiency increased. Rationality 
was associated with infrequent interaction and 
stronger, or more dominating, influence. In 
two dyads, one of the partners was an “expert” 
(ie., behaved in a rational manner); the 
expert not only completely influenced the 
choices of his partner, but he did so with 8 
minimum of verbal interaction. Dyads con- 
taining rational partners or partnerships 
showed considerably less verbal interaction 
and disagreement as well as much greatél | 
efficiency. E 

Discrepancies in the way partners indi- 
vidually resolved conflicts were nonmono- 
tonically related to the efficiency of their 
dyadic decisions. Although these findings did 
not reach significance, the similarity of the 
four relations indicates a reliable trend. Larg? 
and small discrepancies resulted in mor 
efficient decisions. Moderate, or intermediate, 
differences resulted in maximal inefficiency: 
Too large a difference might be seen as t0? 
much to overcome; whereas very small dik 
ferences could seem inconsequential. Either of 
these anticipations would lessen interaction: 
In contrast, moderate differences suggest 
uncertainty and risk, leading to more intel 
action to reduce it; consequently, decisions | 
would take longer and result in more var 
ability and greater nonrationality. 
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SUMMARY 


The cooperative resolution of a dyadic 
conflict was studied as a special case of decision 
making. Two subjects were required to select 
a joint decision. The instructions and the 
task were designed to induce a reasoning set 
and achievement orientation. The efficiency of 
the joint decisions was related to the type of 
choice situation (conflict), the magnitude of 
individual differences, and the nature and 
amount of interpersonal influence (interaction). 
Latency and rationality of choice were the 
measures of efficiency. Maximal inefficiency 
was associated with the possibility of gambling, 
outcomes of loss rather than of gain, moderate 
individual differences, increasing degrees of 
verbal interaction, and a weak leader-follower 
relationship. The dyadic resolution of a 
decision conflict was conservative in strategy. 
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COGNITIVE CONTROLS OF ATTENTION DEPLOYMENT AS 


DETERMINANTS OF 


VISUAL ILLUSIONS: 


RILEY W. GARDNER? 
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HE past 10 years or so have witnessed a 

remarkable upsurge of interest in 

individual consistencies in cognitive be- 
havior, including perception. A recent study 
(Gardner, Holzman, Klein, Linton, & Spence, 
in press) dealt with relationships between 
cognitive control principles—dimensions of 
individual consistency presumed to represent 
structural aspects of ego organization. Results 
of this study suggest that at least two of the 
control principles apparent in subjects’ per- 
formances in laboratory and clinical tests may 
have particular relevance to attention and 
attention deployment in adaptive behavior. 
The first of these controls is a Field Articula- 
tion principle that is apparent in responses to 
situations in which subjects are confronted 
with perceived incongruity in the form of com- 
pelling cues relevant and irrelevant to the 
adaptive requirement. In the study by Gardner 
et al. (in press) this principle seemed to find 
expression in tests said to represent Field 
Dependence-Independence (Witkin, Lewis, 
Hertzman, Machover, Meissner, & Wapner, 
1954) and tests of the Constricted-Flexible 
Control principle (Klein, 1954). The second 
control is a Scanning principle that is apparent 
in individual consistencies in the extensiveness 
with which subjects characteristically sample 
stimuli. Both principles are relevant to situa- 
tions in which the subject must attend (a) to 
stimuli outside himself or (b) to elements of 
memory schemata within himself. 

The conception that such control principles 
Tepresent important aspects of ego organization 
primarily relevant to adaptive behaviors is 
discussed in several publications (e.g., Gardner, 
1959; Gardner et al., in press; Holzman & 
Klein, 1956; Klein, 1954, 1958). This view of 
individual consistencies in cognitive behavior 
is directly in line with recent emphasis in 


1 The work reported here was supported by research 
grants from the National Institutes of Health, United 
States Public Health Service, Study I was supported 
by Grant M-1182; Study II, by Grant M-2454, 

*The author is indebted to Gardner Murphy and 
John Santos for helpful readings of the manuscript. 
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psychoanalytic theory formation upon the 
Structures that govern adaptation (see, e.g., 
Hartmann, 1958; Rapaport, 1957), 

In the present study, as in that of Gardner 
et al. (in press), attention deployment is 
considered both in the terms of concentration 
upon certain aspects of stimulus field in 
individual “looks” at these fields and in terms 
of orienting movements of the visual apparatus 
in the course of examining these fields. Either 
of these related means of controlling attention 
deployment may be apparent in the operation 
of the Field Articulation control principle. 

Results of the study by Gardner et al. (in 
press) suggested that the dimension of indi- 
vidual consistencies described by Witkin 
(Witkin, 1959; Witkin, Karp, & Goodenough, 
1959; Witkin et al., 1954) is not limited to 
situations requiring the subject to extract an 
item from a dominant field organization (as in 
the Embedded Figures Test). It was also 
apparent when the subject was required to 
ignore an item and limit response to a charac- 
teristic of the surround in a test in which the 
subject judged the sizes of disks bearing small 
pictures that tend to reduce the apparent size 
of the disks. (This test was originally used by 
Klein, 1954, to provide a measure of the Con- 
stricted-Flexible Control principle.) Subjects 
low in Field Articulation underestimated these 
disks much more than subjects high in Field 
Articulation. The latter could presumably 
concentrate attention upon the relevant disk 
properties and inhibit the deployment of at- 
tention to the irrelevant pictures. The capacity 
to deploy attention selectively was also appar- 
ent in a Free Association Test that required 
subject to select associations from his internal 
memory schemata. The performances of sub- 
jects adept at Field Articulation reflected 
fluent selectivity in the deployment of atten- 
tion to (hypercathexis of) ideas related to the 
stimulus words. In contrast, protocols of 
subjects low in Field Articulation contained 
relatively few associations and frequent 
“blocking.” 

In considering the attentional behaviors 
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governed by this Field Articulation principle, 
we are not suggesting that the entire range of 
individual differences observed by Witkin et al. 
(e.g., the generalized “passivity” and “‘depend- 
ence” they reported for subjects low in Field 
Articulation) is referable to attentional 
phenomena. It may well be that the individual 
variation in differential attention deployment 
apparent in laboratory and clinical tests is one 
facet of a much broader pattern of interrela- 
tionships that includes such general features of 
cognitive organization as the clarity and 
differentiation of self from nonself. If Piaget’s 
(e.g., 1947) descriptions of the developmental 
emergence of schemata of attention deploy- 
ment are adequate (above and beyond his 
specific hypothesis concerning attention de- 
ployment as a determinant of apparent relative 
magnitude), we should expect facility in the 
selective deployment of attention to be linked 
to a larger segment of personality organization. 

The Scanning principle, relevant to exten- 
siveness of sampling stimulus fields, seems to 
be independent of Field Articulation and to 
control a distinct aspect of attention deploy- 
ment (see Gardner, 1959; Gardner et al., in 
press; Klein, 1958). The individual consisten- 
cies leading to formulation of this principle 
have been observed in size estimation tests 
and tests of incidental noticing, as well as in 
the “scanning” of memory schemata in the 
Free Association Test referred to above. 

A conception of some of the processes con- 
trolled by the Scanning principle was suggested 
by Piaget’s theory (see Piaget, 1947; Piaget, 
Vinh-Bang, & Matalon, 1958) of the effects of 
centration and decentration upon apparent 
relative size. Following Piaget, Gardner et al. 
(in press) hypothesized that extensive, 
balanced scanning (decentration) leads to 
minimal overestimation of standard disks in 
certain size estimation tests. Limited scan- 
ning—relative anchoring of attention upon 
standard (constant) stimuli in the process of 
comparison—was presumed to lead to larger 
errors. (Gardner, 1959; Gardner & Long, 
1960a, 1960b; Wiirsten, 1947, have studied 
such “errors of the standard” in some detail.) 
The study by Gardner et al. (in press) showed 
that subjects characterized by unusual promi- 
nence of the defense of isolation may be among 
the most extensive scanners. It seemed natural 
that these obsessive subjects should sample 
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widely, whether such sampling is adaptively 
relevant or not, and thus have small errors of 
the standard. 

The hypothesis tested here is that the Field 
Articulation and Scanning principles differ- 
entially affect subjects’ response to two 
classical types of illusion-producing figures. 
It was hypothesized that individual differences 
in Type I illusions would be predictable from 
core measures of Field Articulation, whereas 
differences in a Type II illusion would be 
predictable from core measures of the Scanning 
principle. 

Type I. Tests employing the Miiller-Lyer 
and Squares illusions were selected to repre- 
sent Field Articulation because they were 
thought to sample individual differences in the 
capacity to deploy attention selectively to 
relevant parts of stimulus fields that also con- 
tain misleading irrelevant cues. In contrast to 
the Müller-Lyer, the Squares illusion is based 
on a “contrast” effect. These two figures thus 
seem to provide two different kinds of situations 
in which veridical judgment requires the 
subject to concentrate attention upon relevant 
lines and inhibit attention to irrelevant 
portions of the figures. 

Benussi’s (1904, 1912) ingenious studies of 
the effects of whole-perceiving and part- 
isolating attitudes upon the Miiller-Lyer 
illusion influenced the choice of this figure for 
the present study. The much larger illusion 
effects observed by Benussi in groups of 
subjects instructed to attend to the entire 
figure, as contrasted to groups of subjects 
instructed to withhold attention from the 
angled lines, seems to provide direct evidence 
that attention deployment—presumably in 
single looks at the figure, as well as in free 
scanning—is a key factor determining the 
amount of illusion. Schiller (1942) has also 
distinguished between global and analytic 
types of response to the Miiller-Lyer. In the 
present study it was presumed that the sub- 
jects’ characteristic dispositions toward Field 
Articulation would produce predictable differ- 
ences in response without the kind of special 
instruction Benussi gave to his subjects. 

This view of attention deployment as a 
determinant of the Miiller-Lyer illusion is 
much simpler than that offered by Piaget (see 
Piaget & von Albertini, 1950). He described 
the illusion as the final product of an elaborate 
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series of comparisons of lines and spaces within 
the figure. Both approaches, however, rest 
upon the assumption that attention deploy- 
ment is a key variable determining the amount 
of illusion experienced. 

At should be noted here that gestalt psy- 
chologists have spoken of the Miiller-Lyer 
illusion in very different terms from those 
used by Piaget (see, e.g., Kohler & Wallach, 
1944). Kohler and Fishback (1950) and Selkin 
and Wertheimer (1957) have employed “satia- 
tion” theory to explain the reduction of the 
Miiller-Lyer illusion over many trials (an 
interpretation challenged by Eysenck & 
Slater, 1958). Spitz and Blackman (1958) 
have provided direct evidence that Proneness 
to “satiation” effects may be determinant of 
the initial illusion. Therefore, a kinesthetic 
aftereffects test was used to provide a measure 
of satiation-proneness for the 80 subjects of 
Study I. 

Type IT. A second type of illusion—the 
Inverted T—was selected because of its 
apparent relevance to the Scanning principle. 
In contrast to the Miiller-Lyer and Squares 
illusions, the Inverted T does not confront the 
subject with irrelevant lines that must be 
actively ignored if he is to equate parts of the 
figure accurately. A tendency toward extensive 
scanning could lead (a) to a small illusion effect 
(see Piaget et al., 1958), or (b) to a small error 
(overestimation) of the standard (also follow- 
ing the observations of Piaget and his co- 
workers). Although it was not Possible to 
detide in advance upon the likelihood of these 
two possibilities, it was anticipated that the 
Scanning principle would have demonstrable 
effects upon response to this figure, and not 
upon response to the Miiller-Lyer and Squares 
figures, in which the adaptive requirement is 
different. 
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used was mean log 
was correlated .94 
with the mean-time-in-seconds score used by Witkin. 
reliability coefficient (N = 56) 


this test, the subject is asked 
rod surrounded by a luminous 
left or right) to the vertical eight 


was .95. 

Rod and Frame: In 
to adjust a luminous 
frame (tilted 28° 
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times in each of three conditions: body left (28°), ) 
body right (28°), and body erect. Although the entire 
test was administered as it was by Witkin et al. (1954), 
average error for only the body-erect trials was F 
Results of previous studies (e.g., Gardner et al., in 
press; Witkin et al., 1954) suggest that average error in 
the body-erect condition may be correlated somewhat 
more highly with related scores than is average error, 
total test. The corrected odd-even reliability coefficient 
for this measure was .92, 

Scanning. Size Estimation Test A: The subject 
must adjust a circular 


subject judges each 
disk four times in ADDA order from matched ascending 
and descending starting points. This test is similar to 
Size Estimation Test I used by Gardner et al. (in 
press). The measure for this test was subject’s con- 
stant error for all three disks (12 trials). The corrected 
odd-even reliability coefficient was .98, 

Size Estimation Test B: This test is similar to one 
of Klein’s (1954) and to one of the size estimation pro- _ 
(in press), 


his head freely 
Ascending and 


in eight 
odd-even 


Illusions 


General Procedure. Each of the illusion figures de- 
scribed below was presented to the subject in a room 
completely dark except for a roughly circular light patch 
approximately 25 inches in diameter that illumi- - 
nated the figure being viewed. The light patch was of 
indistinct contour and appeared to fade gradually 
into the surrounding darkness. The figures consisted of — 
strips of plastic tape, 1¢inch wide. Except for the 
horizontal line of the Miiller-Lyer figure, which was 
gray, they were covered with white tempera paint. 
All backgrounds were of black cardboard. Movable 
portions of the figures were affixed to black cardboard 
sliders. No subject could distinguish the contours of 
these sliders. The reduction of background cues un- 
doubtedly enhanced the magnitude of the illusion 
effects. The subject sat in front of a table to view the 
figures at a distance of 10 feet. She was placed so that 
her vision centered at approximately the midpoint of 
each of the figures when accurately adjusted and was 
instructed not to move her head. She was allowed to 
move her eyes freely while making judgments. Her 
eyes were closed at all other times. The subject used a 
single control knob on the table in front of her to adjust 
movable portions of all four figures. This knob was 
connected to a selsyn apparatus on the back of each 


Fa 


COGNITIVE CONTROLS OF ATTENTION 


Following all experimental judgments, the subject 
made four control judgments of the lengths of the 
constant lines in the Müller-Lyer and Squares figures 
with the illusion-producing lines concealed. A short 
vertical white line served to divide the horizontal line 
of the Müller-Lyer figure for these judgments. 

Miiller-Lyer Ilusion. The design of the Müller-Lyer 
figure was suggested by findings of several earlier ex- 
perimenters (e.g., Benussi, 1904; Heymans, 1896; Judd, 
1905; Szewezuk, 1938). The arrow- and featherheads 
were covered with white tempera paint and were 
angled 10° from the horizontal line, which was painted 
gray. In contrast to the usual Müllers-Lyer figure, the 
arrow- and featherheads were only 2 inches long. 
The light patch was just bright enough so that all 
subjects could see all the lines clearly. The left (con- 
stant) side of the horizontal line was 8 inches long. 
Turning the control knob moved the right featherhead 
horizontally. The subject made four judgments of the 
length of the left portion of the horizontal line in ADDA 
order from matched ascending and descending starting 
points. Mean adjusted sizes in the control judgments 
were correlated .24, p < .05, with the traditional 
mean “‘llusion-effect” values. The illusion effect score 
used consisted of the difference between the predicted 
size (based upon the regression of mean adjusted 
size, experimental judgments, on mean adjusted size, 
control judgments) and the mean size to which the 
subject actually adjusted the right-hand half of the 
horizontal line, The corrected odd-even reliability 
coefficient was .95. 

Squares Illusion. This figure consisted of two pairs 
of squares connected by horizontal lines that extended 
to the centers of the inner sides. The smaller pair (1.5 
inches on a side) appeared above and to the right of 
the larger pair (4 inches on a side). The smaller pair 
was connected by a 6-inch line. Turning the control 
knob moved the left-hand large square horizontally, 
changing the length of the connecting horizontal 
line. The subject made four judgments of the length of 
the horizontal line connecting the smaller squares in 
ADDA order from matched ascending and descending 
starting points. The z between control and experi- 
mental judgments was .37, p < .01. The score repre- 
senting the illusion effect consisted of the difference 
between the mean size to which subject adjusted the 
variable line and the predicted size (based upon the 
regression of mean adjusted size, experimental judg- 
ments, on mean adjusted size, control judgments). 
The corrected odd-even reliability coefficient was .90. 

Inverted T, Form A (Horizontal Standard). The 
standard (constant) horizontal line was 200 mm. long. 
The subject adjusted the length of the vertical line 
four times from matched ascending and descending 
starting points. Since no control judgments were 
thought possible, the traditional “illusion effect” 
measure for Study I consisted of the difference between 
the horizontal and vertical lines (e.g., 200 — 170 = 30 
mm. illusion effect), The corrected odd-even reliability 
coefficient was .89 for the 80 subjects of Study I. 

Inverted T, Form B (Vertical Standard). This figure 
was identical to the one described above, except that 
the vertical line remained constant at 200 mm., while 
the horizontal line was variable. Turning the control 
knob moved covering cardboard strips horizontally 
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inward or outward over both ends of the horizontal 
line. The subject adjusted the horizontal line four 
times from matched ascending and descending starting 
points. The traditional “illusion effect” measure con- 
sisted of the difference between the mean adjusted size 
and 200 mm. The corrected odd-even reliability co- 
efficient was .94 for the 68 subjects of Study II. 

Gardner and Long (19602) have shown that subjects 
have much greater apparent illusion when the vertical 
line is constant than when the horizontal line is the 
standard, because of the opposite effects of errors of 
the standard on the traditional “illusion effect” meas- 
ures for these two forms. In Study II each subject’s 
errors with the Inverted T figures were thus divided 
into two components: total error of the standard 
(adjusted size — 200 mm., Form B)-(200 mm. — ad- 
justed size, Form A) and total illusion effect (adjusted 
size, Form B — adjusted size, Form A). Corrected odd- 
even reliability coefficients were: total error of the 
standard, .90; total illusion effect, .96. 


A Test of Satiability 


Kinesthetic Aftereffects Test. The test of satiation 
effects is modeled after those of Köhler and Dinner- 
stein (1947) and Wertheimer (1955). Four wooden 
stimulus bars are mounted on saw horses at a con- 
venient height for a standing subject. At the front of 
the parallel saw horses, between which the subject 
stands, are a standard bar 1.5 inches wide (left) and a 
comparison bar with 30 34-inch gradations, the ninth 
gradation 1.5 inches wide (right). At the back of the 
saw horses are bars .5-inch wide (left) and 3 inches 
wide (right). All bars are approximately 60 inches long. 
The subject, blindfolded, first makes four control 
judgments of the 1.5-inch bar from ascending and 
descending starting points, in ADDA order. He then 
rubs the two interpolated bars (.5 left, 3.0 right) si- 
multaneously, three times per second for 30 seconds 
in time with a metronome. Following this, he is re- 
quired to judge the 1.5-inch bar again. After a 30- 
second pause, he again rubs the interpolated bars and 
judges the 1.5-inch test bar. This sequence is continued 
until four judgments of the 1.5-inch bar have been 
made following interpolation. The starting points 
for these judgments are the same as for the control 
judgments. 

Residual scores are obviously needed to represent 
individual differences in satiation effects in this type 
of test; difference scores are not adequate. The measure 
used was the difference between judged size for Trial 1 
following interpolation and the predicted size (based 
on the regression of adjusted size, Trial 1, experimental 
judgments, on adjusted size, Trial 1, control judg- 
ments). The r between these control and experimental 
judgments was .35, p < .01. The first trials were used 
because of (a) obvious differences in the patterning of 
the subjects’ judgments over the four trials, and (b) 
higher test-retest reliability (o = .54) for the first 
trials than for the groups of four trials. 


Sropy I 


To evaluate the hypothesis that Field 
Articulation determines response to the Type 


124 Rey W. 
I figures, whereas Scanning controls response 
to the Type II figure, 80 sorority members at a 
small midwestern university were tested. 
Their ages ranged from 17 to 22. Except for 24 
subjects who did not take the Embedded 
Figures Test, all subjects participated in the 
first eight procedures described above. 


RESULTS 


Intercorrelations among the eight measures 
were obtained and factor analyzed by the 
centroid method. Six iterations were done to 
improve the estimation of communalities. 
Four factors were extracted. The three factors 
accounting for most of the common variance 
were rotated orthogonally by the quartimax 
method (Neuhaus & Wrigley, 1954).* Table 1 
contains the rotated (orthogonal) factor 
loadings. 


Factor I 


This is clearly the predicted Field Articula- 
tion factor. Like the factor obtained by 
Gardner et al. (in press), it seems to represent 
individual consistencies in the capacity to 
achieve differential response to relevant vs. 
irrelevant cues (in this case, the horizontal line 
vs. the illusion-producing angled lines). Low 
factor scores can be conceived of as represent- 
ing facility in concentration in the face of com- 
pelling distractions. In the Embedded Figures 
Test, the subject must “find” simpler figures 
in contexts containing many misleading cues. 
In the Rod and Frame Test, he must concen- 
trate upon the gravity cues and the rod and 
actively withhold attention (and response) 
from the frame. In the Miiller-Lyer test, 
effective equation of the two segments of the 
horizontal line requires restriction of attention 
to the horizontal line—the same type of re- 
striction that Benussi induced his “part- 
isolating” subjects to employ. In the Squares 
illusion, veridical judgment requires that the 
subject limit attention to the horizontal lines 
and actively avoid attending to the illusion- 
producing squares. Although loadings for the 
illusion effect measures are not high, the role 
of Field Articulation in solving this kind of 
adaptive problem seems clearly demonstrated. 

* Rotation by the varimax method (Kaiser, 1958) 
produced almost exactly the same results. Steven G. 


Vandenberg, of the University of Michigan, provided 
IBM 650 programs for these solutions. 
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The fact that Factor I seems to combine 
elements of Thurstone’s (1944) Factors A and 
B may be interpreted as follows: (a) Thur- 
stone’s Factor B (which included response to 
the Miiller-Lyer figure) had a defining loading 
on the more difficult portion of the Gott- 
schaldt test he used. (6) Factors A and B were 
correlated in Thurstone’s study. Witkin’s 
Embedded Figures Test, more difficult than 
either of Thurstone’s, may be still more effec- 
tive for observing Field Articulation. 


Factor IT 


This appears to be a Scanning factor. At 
first glance, small errors of the standard 
(constant errors) in the two size estimation 
tests appear to be associated with large illusion 
effects with Form A (horizontal standard) of 
the Inverted T. Since errors of the standard 
and illusion effects work against each other in 
Form A, however, apparenily large illusion 
effects may represent small errors of the 
standard. The ambiguity of this result led to 
Study II, which was designed to test the 
assumption that Factor II represents the 
Scanning principle, as represented in errors of 
the standard in all three tests. : 


Factor IIT 


Although the communality of the after- 
effects test is rather low, this factor may 
represent individual differences in proneness to 
satiation effects (see Köhler & Wallach, 1944): 
the greater a subject’s satiability, the greater 
the initial illusion with the Miiller-Lyer and 
Squares figures. This interpretation of the 
loading for the Miiller-Lyer judgments is in 
line with the findings of Spitz and Blackman 
(1958). 


Srupy II 


In order to verify the interpretation of 
Factor II, which rested upon the hypothesis 
that extensive scanners show small errors of the 
standard, rather than large illusion effects, 
in response to the Inverted T with horizontal 
standard, a new group of 68 subjects (female 
business college students, nursing students, 
and secretaries, aged 17 to 30) was tested with 
Size Estimation Tests A and B and fwo forms 
of the Inverted T: Form A, with horizontal 
standard, and Form B, with vertical standard. 
Half the subjects were given the two forms in 
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TABLE 1 
Roratep (Orthogonal) FACTOR Loaprincs: Quartmax SOLUTION 


Measure 


Factor 


1.) Embedded Figures: Mean Log Time 

2. Rod and Frame: Average Error, Body Erect 
3. Size Estimation A: Constant Error 

4.| Size Estimation B: Constant Error 

5.| Miiller-Lyer, Illusion Effect 

6. Squares, Illusion Effect 

7. Inverted T, Form A, “Tllusion Effect” 

8. Kinesthetic Aftereffect, Trial 1 


Percentage of Variance Accounted for: 


each order. Starting points were changed to 
approximately five sigmas above and below 
the anticipated mean adjusted sizes, based 
upon adjustment of Form A by the subjects of 
Study I. 

If the interpretation of Factor II is correct, 


: constant error (error of the standard) in Size 


Estimation Tests A and B should be signifi- 
cantly correlated with total error of the 
standard, as derived from adjustments of two 
forms of the Inverted T. It could also be pre- 
dicted that errors of the standard in the size 
estimation tests are negatively correlated with 
the traditional “‘illusion-effect” measure for 
Form A, but positively with the traditional 
measure for Form B (adjusted size — 200 mm.). 
The confirmation of these hypotheses, like 
the group results concerning errors of the 
standard reported by Gardner and Long 
(1960a, 1960b) would establish even more 
firmly the misleading nature of traditional 
measures for single forms of the Inverted T 


and related figures. 


RESULTS 


Table 2 contains intercorrelations of meas- 
ures derived from the four tests. The hypothe- 
sis leading to the experiment and underlying 


the interpretation of Factor II as a Scanning 


factor is confirmed. Subjects who show large 
errors of the standard in the size estimation 
tests also show large errors of the standard in 
the Inverted T. These errors of the standard 
are of sufficient magnitude with the relatively 
large illusion figures employed to produce the 
predicted correlations in opposite directions 
between the size estimation scores and the mis- 
leading traditional “illusion effect” measures 
for single forms of the Inverted T. When 


I pi ul 
+9424 —.1006 — .0604 -9019 
6374 —.1108 1021 -4290 
+2384 —.7180 -1470 5940 
-0708 —.8220 — .0754 +6863 
4369 —.0576 — .4586 4045 
3591 -0747 — .4858 3706 
-2305 3427 .1482 1925 
— .0047 -0510 — .4298 1874 
21.62 16.78 8.68 
TABLE 2 
PEARSON CORRELATIONS 
(N = 68) 
Size 
Inverted T Illusion Estima- 
tion A 
Traditional 
‘peat Total Overes- 
Effect” Total | OF 
Measures | Eror | Tu- | tima- 
Stand- | peep | Stand- 
Overestimation of | Form |Form| ®d ardd 
Standard A |B 
Size Estimation A | —.20° | .24° | .36°* | .07 
Size Estimation B | —.16 | .20°| .30** | 05 | .70*** 


* p < .05, one-tailed test. 
** p < .01, one-tailed test. 
*** p < 001, one-tailed test. 


illusion effect is separated from error of the 
standard, 1’s between the size estimation 
measures and illusion effect measures are 
nonsignificant. 


CONCLUSIONS 


We have seen how two independent cogni- 
tive control principles—Field Articulation 
and Scanning—differentially guide response to 
two kinds of illusion figures. Although the 
response processes involved in the operation of 
these principles require further exploration, 
these results seem to confirm the interpreta- 
tion of these controls offered by Gardner et al. 
(in press). Field Articulation is evident when 
differential response to part of a complex field 
of stimulation is required. Effective observa- 
tion of Field Articulation also depends upon 
the difficulty of achieving selective attention to 
part of the field. For example, the difficult form 
of the Müller-Lyer illusion used in the present 
study (short, impressive angled lines close to 
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the horizontal line) elicits Field Articulation. 

Response to easier forms of the illusion may, 

however, be determined primarily by factors 

other than Field Articulation. Scanning is 

most evident when the adaptive situation al- 

lows expression of response dispositions con- 

cerning the extensiveness of attention de- 

ployment, 

In the study by Gardner et al. (in press), a 
Field Articulation factor appeared most 
clearly for female subjects, a Scanning factor 
only for men. The present findings thus seem 
to confirm the anticipation that both principles 
can be demonstrated for one sex if a larger 
group of subjects and more effective tests of 
Scanning are employed. 

The minimal scanner shows the kind of 
global and unarticulated attention deployment 
that characterizes children and tends to de- 
crease with age (see Piaget, 1947, Ch. 3; 
Würsten, 1947). The extensive scanner ap- 
parently deploys his attention in such a way 
as to overcome the enhancement of magnitude 
Piaget assumes to be contingent upon repeated 
centrations on anchoring objects in the field 
(in this case, the standard stimuli). As Witkin’s 
(1959) developmental data show, subjects low 
in Field Articulation are also similar to 
children, in their inability to achieve differen- 
tial response to parts of stimulus fields con- 
taining conflicting information. These subjects 
can be conceived of as relatively inefficient 
in attending to relevant cues and inhibit- 
ing attention to misleading irrelevant cues. 
Understanding of other control principles and 
additional aspects of personality organization 
is necessary, however, to evaluation of the 
adequacy of an individual’s cognitive controls. 
Extreme Field Articulation could as easily be 
employed in maladaptive as in adaptive ways. 
Extensive scanning may be an adaptive virtue 
in some situations and an unnecessary or con- 
fusing exertion in others. Thus the analogy 
between limited scanning, for example, and 
the attention deployment of children is not 
meant to imply a value judgment. 

From the psychoanalytic point of view, the 
two experiments provide a further glimpse of 
two aspects of ego organization that are obvi- 
ously relevant to broad classes of adaptive 
situations. Problems of cathexis and hyper- 
cathexis have been an important concern in 
psychoanalytic theory formation from the be- 
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ginning. Freud (1954), for example, began to 
outline a view of attention deployment as early 
as 1895 (in the Project for a Scientific Psy- 
chology). He conceived of the shift fro 
cathexis of the memory image of the n 
satisfying object to cathexis of percepts rep 
senting external objects as a key process in the 
beginning differentiation of the self from the 
nonself. As development progresses, derai 
mechanisms—and we would add, cognitive 
controls—serve to guide attention and its 
deployment in enduring, individually-different 
ways. As Piaget has pointed out, the general 
emergence of schemata of attention deploy- 
ment is a crucial facet of adaptive coordina- 
tion of the self to the external world. The 
present results suggest that at least two dis- 
tinct aspects of ego organization, in the form 
of cognitive controls, can be shown to repre- 
sent enduring strategies of attention deploy- 
ment. As in the study by Gardner et al. (in 
press), the apparent independence of such 
principles is worthy of note. In the present 
study, a subject adept at Field Articulation 
may be either a limited scanner or an extreme 
scanner. And these principles of cognitive con- 
trol are but two of a larger number of principles 
that comprise the person’s overall “cognitive 
style.” i 
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EFFECTS IN GROUP THERAPY! 
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recent conference on research in 

psychotherapy (Rubinstein & Par- 

loff, 1959) pointed up the difficulties 
of validating the effectiveness of psycho- 
therapy by the current methods of assessing 
change in behavior. The number of relevant 
variables in the therapy situation is such that 
even random assignment of cases to treatment 
or no treatment groups (Fairweather, Simon, 
Gebhard, Weingarten, Holland, Sanders, 
Stone, & Reahl, 1960; Rogers & Dymond, 
1954) does not guarantee that differences ob- 
tained may be reasonably associated with the 
form of treatment as distinct from either the 
practitioner or the unique interaction among 
the variables in the situation. Rather than 
attempting to assess the value of the total 
therapy situation, it seems more efficient first 
to isolate the behaviors of the therapist and 
the patient which are characteristic of the 
psychotherapy situation. Then these charac- 
teristic behaviors may be systematically 
manipulated in order to test hypotheses 
about the psychotherapeutic process. Among 
these hypotheses are: (a) that one person can 
influence the behavior of another in a predict- 
able direction; (6) that this change has a 
desirable effect on behavior in a second, 
criterion situation; and (c) that as far as 
possible, the changed behavior in the criterion 
situation is associated with a specific aspect of 
the already circumscribed behavior of the 
experimenter. 

The first point, that one person can influ- 
ence another person’s verbal behavior in a 
specifiable manner, may be accepted as reason- 
ably well established by the studies of operant 
verbal conditioning reviewed by Krasner 
(1958) and Salzinger (1959). It is primarily to 
the second point, the change of subjects’ 
behavior in a situation other than the experi- 
mental one, that this article is addressed. 


1From the Psychology Research Laboratory, 
Veterans Administration Hospital, Palo Alto, Cali- 
fornia. This investigation was supported in part by 
Research Grant M-2458, National Institute of Mental 
Health, United States Public Health Service. 
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Relevant studies in which behavior has been 
manipulated in one situation and the effect of 
this manipulation has been measured in a 
second may be roughly categorized into three 
types: motor operant conditioning, verbal 
operant conditioning, and “psychotherapy.” 

Motor operant conditioning techniques as a 
means of deliberate “therapeutic” manipula- 
tion of patient behavior has been employed in 
a series of recent investigations (King, Mer- 
rell, Lovinger, & Denny, 1957; Lindsley & 
Skinner, 1954; Peters, 1952; Peters & Jenkins, 
1954; Tilton, 1957). In these studies, motor 
operant conditioning was purposefully used as 
a therapeutic technique to bring patients into 
more active contact with reality. 

In the verbal conditioning literature, reports 
on generalization effects are infrequent. The 
usual procedure has been to present a series of 
test-like tasks prior to and immediately follow- 
ing a verbal conditioning session, and then to 
ascertain changes in these tests. For example, 
Williams (1959) used as his pre- and post- 
measures an “interpersonal check-list” and the 
Osgood Semantic Differential Scale. In one of 
his experimental groups, “feeling statements” 
were reinforced; in a second experimental 
group, “nonfeeling statements” were rein- 
forced. The experimental situation itself was 
presented to the subjects as “psychotherapy.” 
Although use of the reinforced verbal classes 
was increased during the experimental ses- 
sions, there were no significant differences 
between groups on the pre- and posttest 
measures. Using a similar design, Rogers (1960) 
also reported significant effects of conditioning 
but failure to obtain generalization as meas- 
ured by a battery of four personality tests. 
However, Sarason (1957) did report significant 
generalization effects as measured by a “free 
word naming” situation before and after a 
verbal conditioning situation. 

The “psychotherapy” studies generally in- 
volve the use of a series of psychological 
measures before and after a stated “treatment” 
period (Rogers & Dymond, 1954). Some 
studies (Grossman, 1952; Keet, 1948) make 


VERBAL CONDITIONING AND Group THERAPY 


use of a specified number of experimental 
sessions to test the effectiveness of a given 
therapeutic technique. There have been 
several discussions (Berg, 1952; Leary & 
Harvey, 1956; Miller, 1954) of the rationale of 
the use of such tests as projective techniques, 
rating scales, and Q sorts as measures of thera- 
peutic effectiveness. Using a different criterion 
and one closer to the present study, Brown 
(1957), working with “chronic ward patients,” 

« used as his dependent measure responsiveness 
to an avoidance conditioning situation (paired 
tone and light stimuli alternately reinforced by 
a painful faradic shock). After the pre-experi- 
mental measures, half the patients were sub- 
jected to various psychotherapeutic manipula- 
tions, principally group therapy. The group 
that had received therapy was significantly 
more responsive to conditioning after therapy 
than before, whereas the control (no therapy) 
group showed no significant increase after the 
same period of time. 

Where Brown used responsiveness to con- 
ditioning as a dependent variable, the present 
study used verbal operant conditioning ses- 
sions as the independent variable and ratings 
of behavior in group therapy as the dependent 
variable. The purpose of this design was the 
attempt to test two hypotheses basic to 
psychotherapy: that interpersonal interaction 
can lead to change in a desirable manner in a 
second, nonexperimental situation, and that 
the observed change may be associated with a 
specific aspect of the experimenter’s behavior. 


METHOD 


Three groups, each consisting of 10 male “con- 
tinued treatment” patients at a VA neuropsychiatric 
hospital, served as subjects. At the start of the experi- 
ment the median age of the subjects was 35 (range: 
21-47), the median education was 12 years (range: 
7-16 years), and the median length of current hospitali- 
zation was 5 months (range: 1-133 months). On ad- 
mission, 28 of the subjects had been diagnosed as 
schizophrenic reaction, one had been diagnosed as 
manic depressive, and one as depressive reaction. Each 
group was given the same experimental task, the ex- 
perimenter’s response being varied. All subjects were 
rated by group therapists before and after the experi- 
mental sessions. 

Experimental task. Subjects were seen for two 
sessions per week for 2 weeks. The situation was 
structured to the patient as “a study into how people 
make up stories about pictures shown to them.” In 
each session, the patients were shown four pictures and 
asked to make up stories to last for 5 minutes each. 
If responses lagged, the usual TAT instructions and 
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encouragement were used by the experimenter. The 
picture stimuli used were line drawings of situations 
common to the experiences of patients in hospitals and 
were scenes such as buying a tie, mowing a lawn, 
fishing, buying a movie ticket, ordering a meal in a 
restaurant. The situations were selected to be common- 
place and not emotionally arousing in themselves. 
This procedure minimized differential “card-pull” 
(Gurel & Ullmann, 1958; Ullmann, 1957a) and reduced 
the possibility that emotionally toned stimuli would 
alter verbal production in a schizophrenic sample 
(Ullmann, 1958). The use of less direct stimuli is in 
line with Krasner and Ullmann’s (1958) suggestion 
that minimal cues (e.g., “mmm-hmm”) about which 
the schizophrenic subject is less aware may be less 
threatening and therefore more effective in increasing 
and guiding his verbal behavior than overt examiner 
responses (e.g., “good-bad,” “right-wrong,” “true- 
false”) which the schizophrenic may perceive as evalu- 
ative of his behavior. 

Experimental groups. Each subject was assigned, in 
the manner described below, to one of three conditions. 
Once in a condition, the same examiner behavior was 
maintained throughout the four story-telling sessions, 
each of which involved four stories. That is, the pattern 
of examiner reinforcement started on the first trial of 
the first session. In the first condition, the positive- 
personal reinforcement group, the examiner responded 
to the verbal class to be reinforced by a head nod and 
the approving sound “mmm-hmm.” In the second 
group, the impersonal-unstructured reinforcement 
group, the examiner responded by pushing a button 
attached to an electric counter which emitted a loud 
click. The dial of the counter faced the patient, and 
he saw an increase in the number on the dial. At the 
end of each story, the counter was reset to zero. Ques- 
tions about the meaning of the counter were answered 
with the phrase that it was one of the experimental 
measures. In the third group, the no reinforcement 
group, the experimenter made no responses to the 
stories told by the subjects. 

Verbal class reinforced. The verbal class reinforced 
was “emotional words” as defined by Ullmann and 
McFarland (1957). There were two reasons for the 
selection of this group of words. The first was the rele- 
vance of such words to group therapy ratings. Ullmann 
(1957b) found that ability to express feeling resulting 
from social premises presented in a projective test was 
significantly correlated with group therapy scale rat- 
ings. Similarly, Ullmann and McFarland (1957) found 
that the number of emotional words used in response 
to TAT cards was highly correlated with the group 
therapy ratings. In terms of relevance, then, emotional 
words represent the sort of material that is produced 
in therapy and is of particular interest to psycho- 
therapists. An increase in the use of emotional words in 
the experimental situation may be therapeutic in 
itself, or the practice of these words during experi- 
mental sessions may lead to an increase in their use 
in the group therapy situation. If either or both of 
these possibilities were the case, it could be hypothe- 
sized that an improvement in group therapy ratings 
would result. 

A second reason for the use of emotional words was 
the high rater reliability previously obtained in the 
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use of this class of verbal behavior. In the original 
work with TAT protocols (Ullmann & McFarland, 
1957), rater reliability was .92. Using a single “hearing” 
of 16 nonreinforced 4-minute tapes of college students, 
three raters (including the examiner in this study) 
obtained a coefficient of concordance significant be- 
yond the .001 level (Weiss, Krasner, & Ullmann, 
1960). 

Dependent criterion measure. The dependent measure 
used was the Palo Alto Group Therapy Scale (GTS) 
(Finney, 1954). The GTS is an 88-item scale, checked 
by therapists for the patient’s behavior during the 
most recent four therapy sessions. Finney (1954) re- 
ported that for 18 groups in a neuropsychiatric hospital 
a median rank order correlation of .84 was obtained 
between scale scores and global rankings of patients 
by group leaders. In the same study, a rank order 
correlation of .80 was obtained between the average 
ratings by 10 ward personnel of adequacy of inter- 
personal relationships throughout the hospital and the 
patients’ GTS scores. 

In the present study, five group therapists? on two 
wards completed GTS forms about their patients. The 
therapists knew that a number of conditions were in- 
volved in the experiment but not to which one the spe- 
cific patient would be assigned. One of the authors 
scored the ratings and then assigned patients to the 
three experimental groups. A restriction on absolute 
randomness of patient assignment was that the means 
and standard deviations of the preratings for the three 
groups were kept as equal as possible. Another of the 
authors (BJC) was the examiner in all cases and did 
not know the subjects’ GTS scores, Immediately after 
the last experimental session, the group therapist 
rerated the patient.* In all cases, the same therapist 
made both the pre- and post-GTS ratings. 


RESULTS 


The major results of the experiment are 
preted ia Table 1. The means and standard 
eviations of the pre-experimental groups were 
not significantly different, The ate gain 
(% — 32) for the Positive-personal reinforce- 
ment group was 5.70 GTS points; for the 
impersonal-unstructured reinforcement group, 
it was 1.00 GTS points, and for the no rein- 
forcement group, it was 0.50 GTS points. 
Finally, the own-control change was at the .05 
level (one-tailed test) for the positive-personal 
reinforcement group (f = 1.83), whereas it 
was not significant (£ = 0.34 and 0.15) for 
the other two groups. 
_ Clinically, these results may be presented 
in the following manner: Five of the 10 subjects 


? The authors wish to express their thanks to Geo 
pecan on Finney, Ralph Forsyth, Sidney 
and, and Donald Lim fi i iteri 
etait for making the criterion 

* The time between pre- and postratings averaged 
14.80 days with a standard deviation of 3.31 days. 
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in the positive-personal reinforcement group 
gained 10 or more GTS points; whereas 2 of the 
10 subjects in the impersonal-unstructured 
reinforcement group gained this much, and 
only 1 of the 10 subjects in the no reinforce- | 
ment group gained 10 or more GTS points. | 


Discussion 


The results of this study will be discussed in 
terms of hypotheses that are basic to inter- 
preting psychotherapeutic interactions. One of | 
these hypotheses is that the effect of such an | 
interaction is measurable in an independent | 
criterion situation. The frequency of use in 
projective situations of the verbal class con- | 
ditioned in this study, emotional words, had | 
previously been demonstrated to be signifi- l 
cantly correlated with GTS scores (Ullmann & | 
McFarland, 1957). The relevance of the use of | 
emotional words to group therapy ratings may 
explain why positive results were obtained in | 
this study. Further, the evidence for the | 
validity of the criterion used in this study 
(Finney, 1954) and the fact that the ratings © 
depended on actual interpersonal behavior add | 
to the pertinence of the criterion as a measure 
of the effectiveness of interpersonal inter- 
actions. 

In terms of amount of change, it is interest- | 
ing to note the relatively brief period of time 
(80 minutes) involved in the experiment 
itself. In addition, these results were obtained l 
within an ongoing hospital treatment program | 
which may well have increased within-group | 
variance.‘ Despite this evidence for the thera- | 
peutic effectiveness of verbal conditioning, 
however, there are considerations that limit 
the generalizations to be drawn. On the one 


‘An idea of the uncontrolled variation present 
within an ongoing treatment program may be obtained 
by considering why subjects on whom pre-experimental 
ratings had been madedid not complete the experiment: 
Eight of the men either failed to come to appoint- 
ments, or when they came, were not cooperative in 
story-telling; four moved to different wards; three 
left the hospital; two men underwent extensive and 
painful dental work that interfered with story-telling; 
two were lost to the study when the group therapist 
moved to a different ward, and procedures were not 
completed with one man because the examiner broke 
her leg. In terms of the criterion pre-experimental 
ratings, these 20 men were not significantly different 
from the 30 who did complete the experiment. The 
average GTS rating of the 20 men who did not complete 


eant was 61.4 with a standard deviation of 
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TABLE 1 
MEANS AND STANDARD DEVIATIONS or Group THERAPY RATINGS BEFORE AND AFTER STORY-TELLING UNDER 
THREE DIFFERENT REINFORCEMENT CONDITIONS 


P 3 TE 
Type of Reinforcement yi E EEEN NE ' 
ži a i o 
positive-personal 10 58.2 20.6 63.9 25.5 .94 5.70 1.83 
impersonal-unstructured 10 56.8 18.4 57.8 21.1 .91 1.00 0.43 
no reinforcement 10 61.0 20.0 61.5 21.1 -89 0.50 0.15 


hand, the technique is not pertinent to re- 
gressed patients who cannot tell stories. Such 
patients are ones who might have been suc- 
cessfully approached by motor operant condi- 
tioning techniques (King et al., 1957; Linds- 
ley & Skinner, 1954). With highly verbal or 
more “normal” subjects—college students, for 
example—it is possible to hypothesize that 
the practice of speech in general and emotional 
words in particular would be less effective 
because these subjects have less need for 
additional practice in emotional expressive- 
ness. It is therefore hypothesized that the 
effectiveness of verbal conditioning as a 
“therapeutic” technique may well be useful 
only with people in a particular segment of 
the continuum of mental health, 

The results of this study also lend support to 
the proposition that the changes in criterion 
measures ascribed to psychotherapy may be 
associated with specifiable therapist behaviors. 
Evidence for this concept stems from the 
three experimental groups used. In both the 
no reinforcement and the impersonal-unstruc- 
tured reinforcement groups, the amount of 
additional personal contact with the patients 
matched that of the positive-personal rein- 
forcement group. The use of the clicker- 
counter in the impersonal-unstructured rein- 
forcement group matched the additional 
attention called to emotional words. Such 
additional cuing, by itself, did not yield a sig- 
nificant gain in group therapy scale ratings. 
Therefore, within the limits of the present 
design, there is evidence that positive-personal 
reinforcing behavior of the examiner was an 
important factor associated with the signifi- 
cant change in group therapy scores. 

It is possible to hypothesize that the amount 
and direction of change measured by the cri- 
terion should be positively correlated with 
change in the experimental situation in the use 
of the reinforced verbal class. The design of 
the present study did not permit a test of this 


hypothesis. All experimental trials were rein- 
forced, that is, there were no nonreinforced 
operant trials. This was done to avoid the 
extinction effect on schizophrenics of unrein- 
forced trials which might have decreased the 
chance of obtaining change on the dependent 
measure. Other reasons for the decision made 
in this study were the failure of Rogers (1960) 
and Williams (1959) to obtain generalization 
effects after having obtained significant con- 
ditioning, and the attempt to design an ana- 
logue of psychotherapy in which the therapist 
is likely to give immediate reinforcement to 
the patient’s first emission of the desired 
verbal behavior. Finally, the hypothesis of the 
positive correlation between change in use of 
the verbal class during experimental sessions 
and change on the criterion may be over- 
simplified. It may well be that the important 
thing learned was the permissibility of spon- 
taneously using emotional words in an inter- 
personal situation rather than practice in the 
use of the words per se. 

Beyond the evidence for specific hypotheses 
which underlie psychotherapy, the present 
paper illustrates the use of verbal conditioning 
as a miniature situation for the study of inter- 
personal behavior (Krasner, 1955). Verbal 
conditioning may be used both as a dependent 
variable to measure the effects of various 
experimental conditions on responsiveness to 
reinforcement, or, as in this study, as the 
independent variable. In either case, the verbal 
conditioning situation permits the reduction 
and isolation of interpersonal variables with a 
resultant increase in the opportunity for their 
systematic study. 


SUMMARY 


This study used a verbal conditioning 
situation to investigate hypotheses relevant 
to psychotherapeutic interactions. Neuro- 
psychiatric patients who were receiving group 
therapy participated in four story-telling 
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sessions during which emotional words were 
reinforced in one of the following three ways: 
a positive-personal manner, an impersonal- 
unstructured manner, and not reinforced at 
all. Ratings made by group therapists before 
and after the experimental story-telling ses- 
sions indicated a significant gain in adequacy 
of interpersonal relationships in group therapy 
for the group receiving positive-personal rein- 
forcement. There was no significant gain for 
the other two groups on this criterion measure. 
The results support the hypotheses that one 
person can influence another person in a posi- 
tive way and that this change may be measur- 
able by an independent criterion situation. 
Further, this change in the subject’s behavior 
may be demonstrated to be associated with 
specific behavior on the part of the experi- 
menter. 
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THE EFFECTS OF A COMMUNICATION ON ATTITUDE CHANGE 
AND SCALE JUDGMENTS! 


WALTER WEISS 
Boston University 


EIss (1959) has recently demon- 

W strated that a person’s expressed 

attitude can be affected by a change 
in his estimation of the relative favorableness- 
unfavorableness of a set of opinion statements. 
The change required was postulated as a 
reduction in discrimination among opinion 
items in the neighborhood of the range of 
acceptance. Subjects judging the scale values 
of 20 opinion items in relation to a presented 
standard for the middle category of judgment 
exhibited a striking central tendency effect, 
with a consequent reduction in discrimination 
among the items in the middle region of the 
scale. These subjects, in comparison with other 
groups given explicit end standards or no 
standards, extended their range of acceptance 
toward the punitive end of the scale. Their 
aggregate attitude on the issue was found to 
be more punitive than the positions of the 
other groups. 

In view of this finding, it is of interest to 
determine if a change of attitude resulting 
from exposure to a persuasive communication 
is also attended by a change in scale judg- 
ments. Patently, a sequential effect of attitude 
change anteceding a change in judgment can 
not be demonstrated. But, at the least, evi- 
dence for the presence or absence of a relation- 
ship would be suggestive for theory. 

A subsidiary purpose of the research was to 
examine the relevance to communication 
effectiveness of the subjects’ evaluations of the 
persuasive communication. In some studies, 
differences in evaluation were found to be 
related to posttest attitude (Hovland, Lums- 
daine, & Sheffield, 1949; Weiss & Fine, 1955, 
1956); in others, they were not (Hovland & 
Mandell, 1952; Weiss, 1958). But the inter- 
pretations of such data are inevitably made 
uncertain by the lack of precise control groups 
formed by the division of the control subjects 
into those who would have and those who 


1The research reported in this article was com- 
pleted under Grant M-1351 from the United States 
National Institute of Mental Health. 
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would not have given favorable evaluations 
of the communication. An attempt was made 
here to provide such judgmental control 


groups. 
METHOD 


A total of 141 students in summer session courses at 
Boston University comprise the sample. The subjects 
were tested during one of their regular class periods. 
They were told that the purpose of the study was to 
examine people’s judgments of the opinion statements 
of others on a particular issue. 

Booklets containing the required treatment combina- 
tions were randomly arranged by consulting a table of 
random numbers and were handed out individually to 
the students. Each booklet consisted of five sections. 
(The term “section” is being used only for descriptive 
expediency; the booklets contained no such designa- 
tion.) The order for the experimental subjects was: 
(1) graphic rating of initial position, (2) communica- 
tion followed by questions asking for evaluations of the 
communication, (3) or (4) judgments of the scale 
values of a set of opinion statements on the issue of the 
communication, (4) or (3) questionnaire assessing at- 
titude on the issue, (5) semantic differential. As 
indicated, the order of Sections 3 and 4 were coun- 
terbalanced. For the control subjects, Section 2, con- 
taining the communication and questions evaluating it, 
was placed at the end of the booklet; all other sections 
were moved forward and the counterbalancing provi- 
sion was retained. Hence, these subjects read the com- 
munication and gave their evaluations of it only after 
they had responded to the attitude questionnaire and 
judged the scale values of the opinion statements. All 
subjects were admonished not to return to a page after 
they had turned it; this restraint was enforced by close 
surveillance. 

The first page of each booklet contained a nine-point 
graphic rating scale, which each subject marked in 
accord with his general position on the issue of capital 
punishment. Scale Position 1 was designated “strongly 
opposed,” Position 3 “moderately opposed,” Position 5 
“in doubt,” Position 7 “moderately favorable,” and 
Position 9 “strongly favorable.” The intervening cate- 
gories were left unlabeled. 

The communication was a two-page article by an 
anonymous author who opposed strongly and vigor- 
ously the use of capital punishment for any crime. The 
article was followed by a number of multiple-choice 
questions asking for the subject’s evaluation of the 
fairness, truth, cogency, etc. of the communication. 

The attitude questionnaire consisted of the 20 items 
of Form A of the Peterson-Thurstone scale of attitude 
toward capital punishment. The subjects responded to 
the questionnaire by marking one of five response 
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alternatives for each opinion statement: strongly agree, 
agree, uncertain, disagree, strongly disagree. 

The subjects judged the scale values of the 36 items 
comprising the combined Forms A and B of the Peter- 
son-Thurstone scales? by selecting a rating of 1-11 for 
each item depending on its judged favorableness-un- 
favorableness towards capital punishment. Scale Value 
1 was labeled “very strong approval,” 6 “neither 
approval nor disapproval,” and 11 “very strong dis- 
approval” of capital punishment. The subjects were 
told that the other numbered, but unlabeled categories 
represented equal steps between the midpoint category 
and the end ones. They were also informed that not 
every rating need be used, for they may feel that none 
of the statements is a reasonable example of a particular 
position on the scale, It is worth noting that the desig- 
nations of the ends of the judgment scale are opposite 
in significance to those used for the comparable ends 
of the self-rating scale. Also, 20 of the items scaled 
comprised the attitude questionnaire and, hence, were 
responded to in two ways by the subjects. 

The semantic differential consisted of 10 pairs of 
evaluative attributes drawn from the factor analytic 
study of Osgood and Suci (1955). A 1-7 rating scale was 
associated with each pair of polar adjectives in the 
usual manner. For half of the scales, the high value 
attribute designated the left end of the scale; and for 
half, the right end. 


RESULTS AND DISCUSSION 
Scale Judgments 


The scale values? of the 36 items were deter- 
mined separately for each group by the pro- 
cedure recommended by Attneave and Cham- 
blis (1957). Their method takes account of 
individual differences in subjective rating scales 
or in the utilization of the ratings for making 
the judgments. For each group, a U-shaped 
relationship was found between median scale 
judgments and heterogeneity of ratings for an 
item, as given by the reciprocal of the inter- 
quartile range, a finding that conforms with 
other judgment data revealing higher vari- 
ability of judgment for stimuli in the middle 
region of a scale in comparison with those 
distant from the central position (Attneave & 


* Although each form of the attitude scale contains 
20 items, the conjoint set consists of 36 different 
statements, since the four neutral ones of each form 
are identical. 

In the presented analyses no distinction is made 
between subjects responding to the attitude scale 
first and then judging the scale positions of the items 
and subjects responding to these sections in the Te- 
verse order. As regards initial position, posttest at- 
titude, scale judgments, and evaluation of the com- 
munication, there were no differences even verging 
on reliability among the experimental or control subjects 
differentiated by this procedural variation. 
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Chamblis, 1957; Edwards, 1946; Volkmann, 
1951). A product-moment correlation of .99! 
was obtained between the scale values based 
on the judgments of the experimental subjects 
and of the control subjects. A comparabl 
correlation was obtained for the subset of 
items comprising the attitude questionnai 
An examination of the individual scale value 


approval or disapproval of capital punishment, 
Furthermore, the two groups were comparable 
in the total number of different rating cate 
gories used in the judging and in the number of 
items placed in any single category or group 
of categories. From the data at hand and on 
the basis of the analyses performed,’ there are 
no discernible differences between the experi 
mental and control groups in their scale judg- 
ments of the opinion stimuli. Hence, if atti“ 
tudinal differences are found as a result of 
exposure to the communication, they cannot 
be attributed to nor do they contribute to 
correlated judgmental effects. 


Attitude Change 


The mean initial position for the experi- 
mental subjects, based on the self-ratings, 
was 5.28; and for the control subjects, it was 
5.61. The difference between the groups does 
not approach significance (p > .50).° 

Since the groups were in no way different in 
their judgments of the opinion statements, 
the scale values of the items were computed $ 
from the responses of the entire aggregate of 
subjects.’ The median scale value of the items 
accepted (i.e., agreed or strongly agreed with) 
was taken as the subject’s attitude position’ 


‘The original Peterson-Thurstone scale values 
correlated —.97 with the values obtained using the 
Attneave-Chamblis procedure. The sign is negative 
because in the original scaling the higher the number, 
the greater was the degree of judged approval, while 
the reverse obtained under the current procedure. 

ë Scaling of the items by the successive intervals 
Procedure recommended by Edwards (1952) yielded 
the same results. The groups were completely com- 
parable in the widths of the determinable scale intervals 
and in the measured scale values of the stimuli. 

© All p values are two-tailed. 

7 None of the attitude results is changed if the scale 
values of the items are based on the responses of the 
control subjects alone. 

*The attitudinal results are unchanged if such 
other means of determining a person’s attitude are 


ATTITUDE CHANGE AND SCALE JUDGMENTS 


Consequently, the higher the median score, 
the less favorable is the respondent to capital 
punishment. Table 1 contains the mean atti- 
tude for each group. By ż test, the difference 
between the groups is significant at the .02 
level, with the experimental subjects exhibiting 
a less favorable position. The difference be- 
tween the experimental and control groups 
was found to be significant for those whose 
initial positions were in the Interval 5-7 (in 
doubt to moderately favorable to capital 
punishment) on the self-rating scale (mean 
2.43 vs. mean 1.79; p < .02), not quite signifi- 
cant for those who were initially even more 
favorable to capital punishment, Interval 
8-9 (mean 1.61 vs. mean 1.31; p < .10), and 
not different for those initially opposed to 
capital punishment, Interval 1-4 (mean 3.42 
vs. mean 3.39).° Hence, on the basis of these 
analyses, exposure to the communication did 
lead to attitude change in the direction of the 
communicator’s position. As in other studies, 
(Carlson, 1956; Hovland, Harvey, & Sherif, 
1957; Lazarsfeld, Berelson, & Gaudet, 1948), 
the effect of the communication was primarily 
exhibited by persons whose initial positions 
were near the midpoint of the attitude con- 
tinuum. These are prople who usually feel less 
strongly or are in conflict about their positions 
on an issue and are likely to be more responsive 
to the influence of persuasive communications 
or other change-inducing experiences. 

The difference between the groups in after- 
attitude may be due to an increase in accept- 
ance of statements with higher scale values 
(i.e. less favorable to capital punishment), 
a decrease in acceptance of statements with 
lower scale values, or both. Of the 20 items of 
Form A, 8 expressed sentiments favorable 


utilized as: applying the original Peterson-Thurstone 
scale values; assigning an arbitrary score of 0-4 to each 
opinion response, depending on which of the five al- 
ternatives was selected, and then cumulating the 
scores; or assigning a score of 0-2 to each response, 
with no distinction made between the two “agree” 
categories and the two “disagree” ones, and cumu- 
lating these scores. All methods of scoring were highly 
intercorrelated. 

® Bartlett’s test revealed a significant heterogeneity 
of variances among the distributions of attitude scores 
of these six subgroups. The 5-7 subgroups were ob- 
served to have larger variances than the others. How- 
ever, there was no reliable difference in variances 
between the experimental and control subjects within 
any subdivision based on initial position. 
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TABLE 1 
POSTTEST ATTITUDE SCORES 
Experimental Control 
M 2.59 2.18 
SD 0.96 1,10 
N 75 66 


to capital punishment, 8 were unfavorable, 
and 4 were neutral. These categorizations 
are based on the location of an item, in terms 
of its median judgment, to the left or right of 
the midpoint category or within it. The allo- 
cation of the items to these three evaluative 
categories accords perfectly with the compar- 
able significance of the original Peterson- 
Thurstone scale values. The mean numbers of 
pro, neutral, and con items accepted by each 
group of subjects are presented in Table 2. By 
t tests, the experimental group was found to 
have accepted significantly fewer items favor- 
ing capital punishment but more items oppos- 
ing it than the control group. However, the 
groups were comparable in the number of 
neutral items accepted and in the total number 
accepted. Hence, for the groups as a whole, 
the observed difference in attitude score fol- 
lowing exposure to the communication seems 
to be describable as a shift of both ends of the 
range of acceptance toward the unfavorable 
end of the scale. Furthermore, there is no 
apparent change in the extent of the range of 
acceptance. This observation can be given 
crude quantitative support by obtaining for 
each subject the range of the scale values of 
the items he accepts. The difference between 
the groups in the means of these ranges does 
not approach significance. 

To determine if the preceding outcomes 
based on the responses of individuals accord 
with group patterns, the percentage of each 
group accepting each item of Form A was 
determined. The relevant data are presented 
in Table 3.” The groups differed significantly 
in their acceptance of only 5 of the 20 items 
(p values < .05). On 2 of these items, the 
experimental group has a significantly smaller 
percentage of agreement in comparison with 


19 No distinction was made between the responses of 
“strongly agree” and “agree” in determining the per- 
centages accepting each item, Though not included, 
“uncertain” responses ranged from 1% to 28%. 
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TABLE 2 


NumBers oF Pro, Con, AND NEUTRAL ITEMS ACCEPTED 
By Eaca Group 


Pro Neutral Con 

Experimental 

“M 1.68 1.37 2.63 

SD 1.86 0.94 2.26 

N 75 75 75 
Control 

M 2.33 1.11 1.88 

SD 1.80 0.91 2.14 

N 66 66 66 

d <.05 >.10 f 
ecn v oo Rl a A 
TABLE 3 


PERCENTAGE OF EACH GROUP AGREEING 
WITH Eacu ITEM 
oo 
Item Scale Value Experimental Control 


4 0.00 0.00 1.52 
1 oz 18.67 an 

uo os 21.33 36.36 

“0.67 26.67 33.33 

waroenble 2 ot 33.33 42.42 
9% 1.82 29.33 46.97" 

16 1.66 24.00 25.76 
ENESE 26.67 48.48° 

2 2.03 29.33 4.24 

9 22 77.33 71.21 

Neutral 1 233 6.67 0.00 
6 (24 25.33 15.15 

1% a8 5.33 9.09 

8 3.01 29.33 18.18 
103.44 60.00 39.39" 

3 > 38s s467 36.36° 
eer 3.80 68.00 48.48° 
2 40 28.00 2.73 

134.26 30.67 28.79 

Tei 9.4.50 28.00 21.21 

W = 75) (N = 66) 
*p < os. 


the control group; and on 3, a higher percent- 
age of agreement. The former 2 statements 
express sentiments favorable to capital 
punishment and the latter 3 are opposed to 
capital punishment, Of more importance is the 
fact that the 3 unfavorable items occupy 
consecutive rank positions as determined by 
their scale values and the 2 favorable items 
are separated by only one rank. Hence, the 
items on which the groups are significantly 
different in acceptance form almost unbroken 
segments of the attitudinal scale. This evi- 
dence is clearly in accord with the outcomes of 
the preceding analyses. 
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Semantic Differential 

An aggregate score on the semantic diffe 
ential was obtained for each subject by de 
mining the mean of his 10 ratings. The rati 
of half the scales were transformed so that 
all scales the higher the number, the le 
favorable was the evaluation of capi 
punishment. The median of the agg 
distribution of scores is 5.0; and 58.1% of 
experimental subjects but only 40.9% of 
control subjects were above the median. 
p value of the difference is .06. Hence, th 
data support the results presented in Table 
based on the attitude questionnaire, 


General Relationship between Attitude a 
Judgment 


Since attitude was affected by the persuasi 
communication but scale judgments were noi 
it is important to determine if position on 
issue of capital punishment bore any relation- 
ship to judgments of the opinion items. For, 
if no interconnection occurred, the j 
mental comparability of the experimental and 
control group has diminished systematic 
import. Hence, comparisons were made be- 
tween the judgments of subgroups of subjects 
distinguished as before in terms of their 
initial self-ratings on the issue. No differen 
that even approached reliability could be found 
between experimental and control subjects 
within any of these subgroups. Furthermore, 
the obtained pattern of judgmental differences 
among the opinion subgroups was not affected, 
whether only experimental or control subjects 
were considered. Consequently, the distinction 
between such subjects was not maintained in 
the following analyses. 

In Table 4 are presented the mean number of 
items allocated to the neutral category (6), to 
the segment of the scale designating at least 
Some approval of capital punishment (1-5) | 
and the segment designating disapproval 
(7-11). By analyses of variance, the between 


H The results are in no way affected if after-attitude 
is employed instead of initial position. The correlation 
between initial position and posttest attitude is —.86 | 
for the control subjects and —.78 for the experimental 
subjects. Also, the data patterns are not changed if 
only the 20 statements are examined that are common 
to the attitude scale and the 36-item set judged by the 
subjects. 


ATTITUDE CHANGE AND SCALE JUDGMENTS 


groups F is significant for the neutral (p < .05) 
and approval (p < .01) categories. The 5-7 
subgroup (initially in doubt to moderately 
favorable) assigns significantly fewer items to 
the approval category than do each of the 
others (p at least < .05) and significantly 
more items to the neutral category than do the 
initially opposed (1-4) subjects (6 < .05) but 
not quite significantly more than the strongly 
favorable (8-9) subjects (.200 > p > .10). 

When the median judgments of the opinion 
items were examined, the 5-7 subgroup was 
observed to have assigned higher values to at 
least 13 of the 16 items that expressed some 
degree of approval of capital punishment and 
lower values to at least 12 of the 16 items 
expressing some degree of disapproval to 
capital punishment than did the other two 
subgroups. These and the preceding results 
suggest a central tendency effect in the judg- 
ments of the 5-7 subgroup, an outcome that 
accords with Weiss’ (1959) supposition con- 
cerning the effects of attitude on scale judg- 
ments of opinion statements. However, despite 
differences in judgments, the groups ordered 
the items in almost precisely the same way. 
All rank order correlations were .99. In gen- 
eral, the data bespeak a relationship between 
position on the issue and judgments of related 
opinion statements. This finding is compatible 
with the outcomes of some previous research 
(Hovland & Sherif, 1952; Sherif & Hovland, 
1953; Weiss, 1959). 

Taken as a whole, the results clearly reveal 
that a persuasive communication can influ- 
ence attitude on an issue without the necessity 
of an attendant change in scale judgments. 
The preceding study by Weiss (1959) indi- 
cated that if a certain kind of perceptual change 
does occur, even without exposure to a com- 
munication, a change in expressed attitude is 
obtained. The two studies therefore establish 
that perceptual change of the kind examined 
is at best only a sufficient condition for attitude 
change. Of course, if the persuasive communi- 
cation produced a much greater change in 
attitude, an associated effect on judgment 
might have been observed. And other com- 
munication variables or experimental pro- 
cedures or ways of measuring change may 
reveal a more intertwined relationship between 
attitude change and perceptual change. But 
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TABLE 4 
NUMBERS or OPINION STATEMENTS ALLOCATED TO 
CERTAIN SEGMENTS OF THE SCALE BY SUBJECTS 
DIFFERING IN INITIAL POSITION 


Initial position 
Scale Segments 
8-9 5-7 14 
Favorable M 16.14 14.53 17.10 
SD 3.54 3.58 3:01 
Neutral M 3.38 4.212 a2 
SD 2.68 3.22 2.74 
Unfavorable M 16.55 17.26 16.26 
SD 3.51 3.23 3.04 


N 42 47 50 


any expectation of a general, unqualified rela- 
tion of necessity between the two is impugned 
by the current data. 


Attitude Change and Judgments of the Com- 
munication 

Justification for using the control subjects’ 
evaluations of the communication rests on the 
following supposition: the judgments given 
by these subjects are the ones they would have 
made had they read the communication at the 
same time as the experimental subjects. 
Although this supposition cannot be con- 
firmed directly, discrepancy between the 
judgments of the control and experimental 
subjects would invalidate this basal require- 
ment and congruence would be consistent 
with it? When the subjects’ evaluations were 
examined, the groups were not found to be 
significantly different on any judgment. The 
relevant data based on all the judgment ques- 
tions are presented in Table 5. Hence, the 
method employed appears to have yielded the 
necessary control groups for examining the 
relationship between judgment and attitude 
change. 

On only one of the five questions was initial 

2 These implications are valid if the two groups of 
subjects are comparable to each other in characteristics 
that would affect their evaluation of the communica- 
tion. The use of random assignment and the relative 
equivalence of the groups in initial position support this 
assumption. 

13 Tt is only for presentational convenience that each 
set of responses is dichotomized; for the analysis of each 
complete response by group table yielded a nonsig- 
nificant difference. 
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TABLE 5 
PERCENTAGE OF Each Group MAKING CERTAIN 
JUDGMENTS OF THE COMMUNICATION AND 
OF THE ComMMUNICATOR’s POSITION 


Experi Control 
Mostly or Completely One- 55.4 66.1 
Sided 
Mostly or Entirely True 66.2 75.4 
Moderately or Very Strong Case 83.8 83.1 
Quite or Extremely Emotional 36.5 47.7 
Presentation 
Author Strongly Opposed to 73.0 78.5 
Capital Punishment 
N 74 65 


TABLE 6 
RELATIONSHIP BETWEEN INITIAL POSITION AND 
JUDGMENT OF COMMUNICATOR as STRONGLY 
ED TO CAPITAL PUNISHMENT 


Initial Position 

8-9 5-17 14 
Coua 87.0 60.0 4 
ercentage ` B 86. 

N 23 20 22 

Experimental 

ercentage 75.0 55.6 88.9 

N 20 27 27 


position found to be related to judgment of 
the communication. A smaller proportion of 
those whose initial positions were within the 
interval “in doubt to moderately favorable” 
(5-7) adjudged the communicator strongly 
opposed to capital punishment (the extreme 
alternative) than did the other subjects. The 
relevant data are in Table 6. For the experi- 
mental group alone, the $ value of x? is <.05 
and for the control group, .06. For the two 
groups combined, p is <.005. This finding is 
consistent with data of Hovland, Harvey, and 
Sherif (1957) who offer an interpretation based 
on the effects of assimilation and contrast in 
judgment. 

Since a number of the cells of the frequency 
table on which Table 6 is based have Ns that 
are quite small, initial position could not be 
utilized in the general analysis of the relation 
between this judgment and communication 
effectiveness. Table 7 presents the mean 
posttest attitude for each judgment subgroup. 
By analysis of variance, only the difference 
between the experimental and control groups 
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is significant. By 7 tests, only the differeng 
between the “other” subgroups was signifi 
cant (p < .05). Hence, for the aggregate gf 
subjects this judgment was unrelated 
Posttest attitude. 

Although the interaction term was no 
significant, it was thought to be sufficient] 
interesting to warrant further examination 
Since most of the subjects giving the “other 
judgment had initial positions of 5-7 and sing 
attitude change was exhibited primarily by 
these subjects, their attitudes and those of th 
corresponding ones making the more extreme 
judgments were computed. The data are in 
Table 8. Despite the small Ns involved, 
analysis of variance revealed significant M 
for the interaction term (p < .01) as well as 
the two main effects. Patently the data do nol 
permit specifying a directional mediating 
relationship between judgment of the come 
municator’s position and attitude change 
Nevertheless, greater attitude change was 
exhibited by those subjects who did not pet 
ceive the communicator as strongly opposed) 
to capital punishment than by those who did. 


TABLE 7 


RELATIONSHIP BETWEEN POSTTEST ATTITUDE AND) 
JUDGMENT OF CommunicaTor’s POSITION 


Strongly Opposed Other 
erimental 
Ey 2.49 2.78 | 
SD 0.97 0.94 
N 54 20 
Control 
M 2.21 1.9% 
SD 1.15 0.81 
N 51 14 
Maa- 
TABLE 8 


RELATIONSHIP BETWEEN Jupcment or ComMUNI 
CATOR’S POSITION AND POSTTEST ATTITUDE AMONG 
SUBJECTS INITIALLY UNCERTAIN OR f 
MODERATELY FAVORABLE TO 
CAPITAL PUNISHMENT 


Strongly Opposed 
Experimental 
1.97 ` 

SD 0.64 . 

N 15 1 
Control 

M 1.87 

SD 0.71 

N 12 
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This result does not accord with the outcomes 
of some previous research (Ewing, 1942; 
Goldberg, 1954; Hovland & Pritzker, 1957; 
Weiss, 1958). 

As regards the other evaluative questions, 
posttest attitude of the experimental subjects 
was in no instance found related to judgment; 
and only on the question of the truth of the 
communication was it found associated with 
the attitudes of the control subjects. To 
exemplify the common attitude-judgment 
patterns exhibited by the subjects on these 
questions, data are presented in Table 9 
relating to the perceived truth of the com- 
munication and attitude position. By analysis 
of variance, neither the main effect of judg- 
ment nor the interaction term were signifi- 
cant;™ of course, the experimental group was 
significantly different from the control group. 
By ¢ test, the attitude difference between the 
experimental and control subjects expressing a 
favorable judgment had a p value of .15, 
while the difference between the subjects 
expressing a less favorable judgment yielded a 
p value of <.05.° Although as previously 
noted the second-order difference was not 
significant, we can still remark that the effect 
of the communication on those considering it 
relatively untruthful was not any less than on 
those evaluating it as truthful. These data 
and the general lack of relationship between 
judgment and attitude change cast doubt on 
any assumed mediating significance of differ- 
ing evaluations of a communication. 

Except for one instance involving a specially 
selected subsample of subjects, attitude 
change and judgment of the communication 
were not interrelated. Although these data 
merely add to the empirical potpourri result- 
ing from the outcomes of previous research, 
they are based on a more precise methodology. 
From a general assumption that the manner of 


M Even when initial position is considered, judgment 
is unrelated to attitude change. 

14 In the absence of control subgroups and in view 
of the lack of a relationship between initial position 
and this judgment, one might have considered testing 
each experimental subgroup against the total control 
group. The result would be a p value < .05 for the 
comparison involving the experimental subjects giving 
the more favorable judgment and one > .15 for the 
other subgroup. This is a reversal of the results obtained 
when the proper control groups are available. 
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TABLE 9 


RELATIONSHIP BETWEEN POSTTEST ÅTTITUDE AND 
Jopoment or TRUTH or COMMUNICATION 


Bateti True Other 
tal 
2.61 2.30 
SD 0.98 1.07 
N 49 49 
Control 
M 2.48 1.73 
SD 0.96 1.03 
N 25 16 


perceiving certain qualities of a communication 
modulates the effectiveness of persuasion, we 
might expect a relationship between perception 
and attitude change. It may be that a certain 
fundamental quality not yet examined will be 
found to have a general and systematic con- 
nection with communicative effectiveness, Or 
further research may continue to exhibit 
variegated results, But since judgments of the 
communication and attitude change are both 
response outcomes that are methodologically 
intertwined, we cannot expect to determine a 
sequential arrangement of effects, And in the 
absence of precisely formulated theoretical 
expectations, obtained or unobtained relation- 
ships are unlikely to have particular conceptual 
significance by themselves. 


SUMMARY 


The purposes of this study were to assess 
the effects of a persuasive communication on 
attitude change and on judgments of the scale 
values of opinion items and to examine the 
relationship between attitude change and 
evaluations of the communication, An experi- 
mental group of subjects was exposed to a 
communication advocating abolition of capital 
punishment, after which they evaluated the 
communication, judged the scale values of 36 
opinion statements, and revealed their own 
attitudes by responding to 20 opinion items. 
All of the opinion statements were taken from 
Forms A and B of the Peterson-Thurstone 
scale of attitude toward capital punishment. 
A control group of subjects provided compar- 
able data; but they read the communication 
and evaluated it only after having responded 
to the attitude scale and having judged the 
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scale values of the 36 items. The subjects were 
assigned at random to the two groups. 

The major results are: 

1. The communication changed attitudes in 
the direction advocated. But the groups were 
alike in their judgments of the scale values of 
the related opinion statements. Hence, a 
change in scale judgments is not a necessary 
condition for attitude change. 

2. In general, evaluations of the communica- 
tion were not related to attitude change. 
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CRITIQUE AND NOTES 
TIME ORIENTATION AND SOCIAL CLASS: 
A CORRECTION 


JOEL E. GREENE anp ALAN H. ROBERTS 
New Mexico Highlands University 


‘HE purpose of the present note is to evaluate 

the findings of a commonly cited study on 
time orientation (Le Shan, 1952). Le Shan hypoth- 
esized that lower class children are less future- 
oriented than middle class children. As a test of 
this hypothesis he instructed subjects representing 
the two social classes to “Tell me a story.” The 
time elapsed in the story was estimated. The 
data as reported are shown in Table 1. Le Shan 
stated that the chi square value for these data was 
highly significant (p = .001, 6 df [p. 592]). The 
writers were unable to duplicate this level of 
significance. Rather, the value of chi square ob- 
tained from his data was 9.65 which, with 6 df, 
is not significant. In addition, it should be noted 
that this calculation of chi square was based upon 
a 2 X 7 contingency table that contained nine 
cells having expected frequencies less than 10; 
A these, six had expected frequencies less than 
Le Shan’s apparent errors notwithstanding, 
the usual method of correcting for small expected 
frequencies is to collapse adjacent categories 
until acceptably large expected frequencies are 


TABLE 1 
DATA FROM LE SHAN (1952, p. 592) 

PME ilis SEME 
EEEa a [aa] s Laai is 
i=) _ = = -_ ~ ~“ 

Lower class 32 |24| 4] 6/4] 3 1 

children 

(N = 74) 

Middle class 12 | 10| 4 | 10 4 1 2 

children 

(N = 43) 
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obtained. There is some danger that if alternative 
ways of collapsing are available, scrutiny of the 
data may result in exploiting chance differences, 
Le Shan’s data can be used to ilustrate this 
point; the writers tried several combinations of 
categories and arrived at values ranging from p < 
.02 to not significant. Lewis and Burke (1949) 
emphasize that chi squares should be based upon 
categories determined by criteria independent of 
the data to be analyzed. 

When such contradictions appear in the analy- 
sis of data, equivocal interpretation of findings can 
be avoided only by relying upon the rationale 
underlying the concept of time perspective and its 
operational definition, In the present instance, 
the rationale should suggest criteria for deciding 
which adjacent categories should be combined. 
Unfortunately no solution to the problem seems to 
be readily available. 

It appears, then, that Le Shan’s findings are 
equivocal and should not be interpreted as clearly 
showing a difference in time perspective between 
middle and lower class children. While primary 
responsibility for the accurate interpretation of 
data remains with the researcher, it is no less the 
responsibility of later investigators who are de- 
pendent upon the results of a particular study for 
theoretical or methodological support to examine 
it critically. Critical examination has apparently 
been lacking in this case. 
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NEED ACHIEVEMENT, FEAR OF FAILURE, AND SELECTIVE RECALL! 


WALTER R. REITMAN 
Carnegie Institute of Technology 


ai study attempts to clarify the conceptual 
status of the need achievement measure 
developed by McClelland, Atkinson, and their 
colleagues by means of a further investigation of 
its behavioral correlates. Two experiments on 
selective recall of meaningful material as a func- 
tion of low, moderate, or high need for achieve- 
ment are described, together with a discussion of 
their significance in relation to the need achieve- 
ment construct. 


MertHop 


Subjects for the first experiment were 108 male 
volunteers from the elementary psychology course. 
The 68 male subjects who participated in the second 
experiment were drawn the following year from the 
same course under an experimental participation re- 
quirement instituted that year.* The n Achievement 
distributions for the two parent populations from 
which subjects came were divided separately into equal 
thirds. Each subject then was classified as low, moder- 
ate, or high in n Achievement by referring his score to 
the breaking points for his parent population. 

Several weeks before recall measures were adminis- 
tered, subjects wrote thematic apperceptive stories 
under Neutral Conditions. These were scored for n 
Achievement by experienced scorers whose scores corre- 
lated at least .90 in reliability checks with an author’ 
of the system. 

Achievement-Orienting instructions, describing the 
second session as a test of abilities in which it was 
important to do well, preceded recall materials for all 
subjects in the first experiment and half of those in the 
second. The other subjects in the second experiment 
received neutral instructions, which described the ses- 
sion as a routine demonstration intended to familiarize 
students with the types of instruments psychologists 
use in studying behavior. Next, subjects read instruc- 
tions for the recall measure embedded in phrases similar 
to those of the initial instructions, so as to maintain the 
appropriate initial set. In these instructions, subjects 
were told they would have one minute to read the story 


1 Based on a paper presented September 1959 at the 
APA convention. I am indebted to John W. 
Atkinson for his aid and encouragement, and to the 
National Science Foundation and the Social Science 
Research Council for fellowships held during the course 
of these studies. 

2 Both of these experiments were run in conjunction 
with unrelated investigations of n Achievement reported 
elsewhere. Full details of the common motive and per- 
formance assessment conditions are omitted here to 
save space. See Atkinson and Reitman (1956) and 
Reitman (1960). 

3 John W. Atkinson. 
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to be recalled and would be asked to recall it about on 
half-hour later. Throughout the second experiment, 
instructions and materials were in written form, t 
minimize interaction between the experimenter an 
subjects. 

The stories to be recalled were paired, each 
ranging from 126 to 129 words. The first two stori 
describe successful achievement. The central fi 
are working on difficult problems with determinat 
and confidence. They succeed and feel elated. The ney 
two stories deal with attempts to achieve that end 
failure. The central figures are grim, worried, hesitant 
The projects involved are based on questionable id 
or are too difficult. The attempts fail; the protag 
give up in despair or discouragement, The next ti 
stories are unrelated to achievement. In one, a troub 
young man tries to reconcile feelings for his pare 
with his desire to marry. In the other, two annoy 
grocers are repairing a display upset by a customer 
dog so they can go home to eat and rest. The first fi 
stories are slight modifications from ones used if 
previous studies (deCharms, Morrison, Reitman, & 
McClelland, 1955; Reitman, 1954). 

All subjects were given one story only. In the firs 
experiment, six subjects each with low, moderate, an 
high n Achievement scores read and recalled each o 
of the six stories. All subjects in the second exp 
were given the first of the two failure stories. 

The general recall coding system is discussed el 
where by Reitman (1954); the recall score depends 
on accuracy of detail and on general thematic conte 
The failure story that follows is reproduced as adn 
tered, except that the detail units have been 
thesized. The thematic content score depends on 
of the task and personal goals of the hero, of his i 
mental activity, and of the outcome and the h 
affective reaction to it. 

This chap is (grim), doing some (heavy) (medita 

ing). He is a (sophomore) and has reached an (i 
tellectual) (crisis). He (cannot make up his mind 

He is (troubled), (worried). N 

He is (trying) to (reconcile) the (philosophies) 

(Descartes) and Thomas (Aquinas)—at his (ti 

age) of (eighteen). He has (read) several (books o 

philosophy) and (feels) the (weight of the world 

(on his shoulders). 

He (wants) to (present) a (clear-cut) (synthesis) 6 

these two (conflicting) (philosophies), to (sai 

his own (ego) and to gain (academic) (recogni 

(from his professor). 

He will (mess himself up) (royally). Too (inex 

enced) and (uninformed), he has (tackled) 

great) a (problem). He will (give up) in (despair) 

(go down) to the (Goodyear) and (drown) b 

(sorrows) in a (bucket) of (Piel’s). 
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RESULTS 


Means and SDs of recall scores for 108 subjects 
participating in the first experiment are presented 
in Table 1 as a function of need for achievement 
and type of story (success, failure, or unrelated). 
There are no significant main effects, but the 
interaction is significant (6 = .025). The subjects 
with moderate n Achievement scores—previously 
found to be associated with evidence of “fear of 
failure” (e.g., McClelland, Atkinson, Clark, & 
Lowell, 1953), go from the highest recall scores 
on success and unrelated stories to the lowest 
scores on the failure stories. Still more interesting 
is the contrast between this relative inability to 
recall stories dealing with failure to achieve under 
Achievement-Orientation and the superior scores 
comparable subjects also moderate in n Achieve- 
ment make when recalling such material under 
the neutral conditions of the previous studies 
(deCharms et al., 1955). 

This result had not been predicted. This fact 
and the existence of procedural differences be- 
tween this experiment and the previous research 
made a second experiment desirable. The second 
study directly compares relative recall of achieve- 
ment failure material by subjects with moderate 
as opposed to high and low achievement motive 
scores under Achievement-Oriented and under 
Neutral Conditions. 

As Table 2 indicates, subjects with moderate 
n Achievement scores once again obtain both the 
highest failure recall scores under Neutral Con- 
ditions and the lowest failure recall scores under 
Achievement-Oriented Conditions, relative to other 
subjects. The difference is in the expected direc- 
tion, and significant at the p = .025 point. 


Discussion 


This replicated relationship between n Achieve- 

ment score and recall of failure material under 
Achievement-Oriented vs. Neutral Conditions is 
of interest in three connections. 
_ First as Atkinson (1958, p. vii) points out, there 
1S as yet no generally accepted construct or system 
of constructs to account for differences in achieve- 
ment-related imagery on the TAT, or for the many 
Kinds of motive-performance relations obtained 
With thematic apperceptive measures of the 
achievement motive. Studies of nonideational 
behavior, such as performance on arithmetic tasks, 
frequently yield roughly linear motive-perform- 
ance associations, and lead to a conception of the 
n Achievement score as some linear function of a 
need to do well. 

The present results do not bear at all on this 
empirical utility of the n Achievement measure 
as a linear predictor of nonideational performance. 
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TABLE 1 
SELECTIVE RECALL As A FUNCTION OF Low, MODERATE, 
OR Hiag NEED z NT pe Success, 
'AILURE, OR UNRELATED STORY 
(N = 12 in each cell) 


Need Achievement 
Story Content Low Moderate High 
M SD M SD M SD 
Success 16.83 | 4.62 | 19.92 | 5.19 | 18.50 | 4.27 
Failure 22.08 | 5.68 | 16.92 | 6.81 | 18.08 | 4.17 
Unrelated 16.75 | 5.51 | 21.50 | 5.38 | 20,83 | 5.96 


TABLE 2 
SELECTIVE RECALL OF FAILURE STORY AS A FUNCTION 
or NEED ACHIEVEMENT AND ACHIEVEMENT- 
ORIENTED vs. NEUTRAL CONDITIONS 


Need Achievement 
h Moderate Mean 
Low and Hig! lera et 
ence in 
N| M | SD|N| M | SD} recall 
24 | 17.62) 6.52 | 10 | 20.80) 7.19 | —3.18 
Ach.-Oriented 23 | 19,48) 6.01 | 11 | 15.73) 5.17 3.75 
Difference Between Mean Differences 6.93 
SE of Difference 3.39 
t 2.05 
> (one-tailed) 025 


They do, however, clearly support the theoretical 
position taken by McClelland et al. (1953, p. 273), 
Moulton (1958), and others who suggested that 
the n Achievement index may be a complex re- 
sultant of several distinct variables, or else may 
yield several distinct score categories, each of 
which is characterized by “emergent” qualities 
(e.g., “fear of failure”) not linearly associated 
with the underlying need achievement scores, 
These experiments, secondly, are relevant to 
research on vigilance and defense phenomena, 
While the present design of itself does not permit a 
choice between primarily perceptual and primarily 
mnemonic interpretations of the results, it does 
suggest an experimental paradigm with two basic 
advantages. The use of printed orienting instruc- 
tions and objective indices of motive and per- 
formance makes possible exact specification of 
four basic predictions in such studies, namely, who 
will distort, what, in which directions, and under 
which conditions. Furthermore, while the motive 
and recall measures yield objective scores, they 
also contain surplus information. Analysis of the 
kinds of stories written by experimentally specified 


144 


subjects, the ways in which they respond to con- 
trolled variations in orienting instructions, and 
the emphases and forms of organization evident 
in their recall protocols should provide valuable 
information on the defense mechanisms involved 
in perceptual and mnemonic distortion. 

Finally, experiments on selective recall of 
meaningful material seem likely to provide addi- 
tional information about the complex motivational 
structures tapped by the n Achievement index. 
In so doing, they promise not only to clarify the 
theoretical issues involved, but also to result in 
still more useful motivational measures. 


SUMMARY 


Under Achievement-Orientation, moderate need 
achievement scores—previously found associated 
with evidence of “fear of failure’—appear re- 
lated to relatively high recall of success and irrele- 
vant stories and low recall of failure stories. A 
second study shows moderate n Achievement, as 
predicted, again associated with relatively low 
recall of failure stories under Achievement-Orien- 
tation, but with relatively high recall of the same 
stories under Neutral Orientation. From these 
results, need for achievement appears a complex 
variable yielding several distinct score categories, 
each of which may be characterized by “emergent” 
qualities (e.g., “fear of failure”) not linearly asso- 
ciated with n Achievement scores. The paper 
discusses the relevance of the results to the theory 
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and use of n Achievement measures and considers 
the relation of these experiments to research on 
vigilance and defense. 
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VALUE, MAGNITUDE, AND ACCENTUATION! 


SAMUEL E. STAYTON anp MORTON WIENER 
Clark University 


IVEN a relationship between size estimation 

and value, the finding still requires theo- 
retical explanation. Tajfel (1957) and Pastore 
(1949) noted that despite differences in results 
among the various studies concerned with the 
problem, larger coins—in the range from penny 
to half dollar—are overestimated more than 
smaller coins. After further analysis of the ex- 
perimental evidence, Tajfel concluded that in a 
series of stimuli “where value changes concur- 
rently with the dimension subjected to investiga- 
tion, the differences between the stimuli of the 
series will be perceived as larger than the objec- 
tively equivalent differences between the stimuli 
of a neutral series, where no association exists 
between value and magnitude” (p. 193). 

In all of the experiments on record the rela- 
tionship between size estimation and value has 
always implied an overestimation of the valued 
object. In every instance higher value has always 
covaried with “greater than” or “more than” in 
the magnitude series, because of the particular 
stimuli—e.g., coins and candy—that were for- 
tuitously selected for study. If a physical at- 
tribute dimension covaries with the value of the 
objects in a series, and this covariance results in 
an accentuation of differences in the physical 
attribute dimension, as suggested by Tajfel, 
there is no a priori reason why estimation of size 
and value must be positively correlated, i.e., 
greater value need not always result in overesti- 
mation. When the value of stimuli in a series is 
correlated with a decrease in the magnitude of a 
physical attribute, accentuation of differences in 
the physical attribute should result in an under- 
estimation of the size of the valued object, rather 
than overestimation. A finding of underestimation 
in such an instance would lend support to the 
theoretical model suggested, i.e., that accentuation 
of a physical attribute dimension underlies the 
relationship of size estimation and value. 

The new popularity of smaller cars permitted a 
test of this hypothesis, 


METHOD 


Twenty-five adults in an evening college psychology 
class served in a group as subjects. Subjects were first 


1 This report is based in part on a thesis submitted 
by the first author in partial fulfillment of his MA at 
a University done under the direction of the second 
author, 


given two sheets of paper. On the first sheet were the 
following instructions (also read aloud by the experi- 
menter); 

Below are six lines, three marked X-Y and three 
unlabeled. The lines, measured by the distance from 
X to Y, represent the length, width, and height of a 
*58 Chevrolet Bel-Aire, drawn to scale. The task is 
to mark off on the lines below the X-Y lines the 
length, width, and height of the ’58 Volkswagen in 
relative proportion to the length, width, and height 
of the Chevrolet. Mark off the Volkswagen dimen- 
sions by placing a short, vertical mark to the right 
of the vertical line already shown, the same way as 
the distances of the Chevrolet dimensions were 
marked off. 

On the second sheet subjects marked off the dimen- 
sions of the Royal portable typewriter in proportion 
to those of the Royal standard typewriter. This task 
was used in order to obscure the purpose of the experi- 
ment. The sheets were then collected. 

Another set of four sheets was then distributed. The 
first task was a modified version of Osgood’s (1957) 
semantic differential for the Volkswagen, consisting of 
five 20-point scales: positive-negative, desirable-unde- 
sirable, pleasant-unpleasant, nice-awful, and good-bad. 
The second task was a semantic differential like the 
first, except this time subjects scaled the attribute 
“smallness of the Volkswagen.” 

The third task, the “desire to own scale,” had the 
following instructions: 

Below is a list of four objects. Indicate by a check- 

mark on the dotted line how much you would like 

to own each object at the present time. If you already 
own one or more of the objects, then indicate your 
desire to own it as if you no longer Possessed the 
object(s). There is something else that is important. 

Assume that the cost of the objects or their trade-in 

value is not important. Pretend that you are so rich 

that the cost of the objects does not make any differ- 

ence; the only thing that matters is that some objects 

are more desirable to own than others. 
The four objects were the Royal standard and portable 
typewriters, the Volkswagen, and the Chevrolet Bel- 
Aire. The 15-point scale was anchored by the state- 
ments on the left: “I would not like to own one at all”; 
and on the right by: “I would like to own one very 
much.” ji 

The fourth task required the subjects to write a 
25-word opinion of the Volkswagen. The purpose of 
this last task was to try to get a sample of the hand- 
writing from which some estimate could be derived of 
the way the subject used space in general. Klein (1954) 
has suggested that unless characteristic styles of space 
utilization are considered, some relationships between 
size judgments and value may be obscured. 

Careful consideration of the problem of order effects 
led to the judgment that a fixed order of presentation 
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of tasks would probably result in less possible con- 
founding—particularly between the desire to own and 
size judgments—than using either a random or a 
counterbalanced order. 


REsULTS 


Five scores were established for each subject. 
The desire to own score (the value of the Volks- 
wagen to the subject) was determined by com- 
puting the ratio of the score marked for the Volks- 
wagen to the average score of the other three 
objects. The Size Estimation score was the prod- 
uct of the length, width, and height judgments for 
the Volkswagen. The Size of Handwriting score 
was the number of inches occupied by the first 
20 words in the description of the Volkswagen. 
The Value of Smallness score was derived from 
the five evaluative scales of the semantic differen- 
tial for “smallness of the Volkswagen.” A semantic 
differential value score for the Volkswagen was 
computed from the five evaluative scale responses 
to the car. 

Twenty-four of the 25 subjects underestimated 
the size of the Volkswagen. The results would 
therefore be the same whether the size estimations 
scores are expressed as the subjects size judg- 
ments or as a deviation from the veridical size of 
the Volkswagen or from the relative size of the 
Chevrolet. Because no assumptions could be made 
about the underlying distribution of the scores, 
nonparametric statistical analyses were performed. 

Kendall rank order correlations (tau) were com- 
puted between the various relevant scores. A tau 
of —.28 (p < .05, two-tailed test) was found be- 
tween the desire to own the Volkswagen and the 
Size Estimation score; also a negative, although 
nonsignificant, correlation was found between 
the semantic differential evaluative score and 
size estimation (tau = —.16, > .05, two- 
tailed test).? The greater the value of the Volks- 


2 The low positive, nonsignificant correlation between 
the desire to own score and the semantic differential 
score (tau = +.16, p > .05) suggests that the two 
Scores may be measuring different types of responses to 
the Volkswagen and/or that one may be a more sensi- 
tive measure of value for each subject than the other. 
Our prior decision to use the desire to own score as 
the measure of value of the Volkswagen is supported 
by the following considerations: (a) Twenty-two of the 
25 subjects scored on the positive half of the evaluative 
continua, and 5 of these checked the highest point on 
all five of the evaluative scales used. In contrast, 12 sub- 
jects were above and 13 subjects below the value of one 
on the desire to own ratio index of value; i.e., for 12 of 
the 25 subjects the desire to own score for the Volks- 
wagen was higher than the average of the scores for the 
other three objects and for 13 subjects it was lower. 
(6) The desire to own score assessed each subject’s 
value of the Volkswagen in relation to judged values of 
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wagen to the subject, the greater the tendency 
to underestimate the size of the car. Since the size 
estimation was based on a unit relative to the 
given size of the Chevrolet, one could hypothesize 
that the underestimation found could also be a 
function of the greater value of the Chevrolet, 
i.e., the greater the desire to own the Chevrolet, 
the smaller the size estimation of the Volkswagen 
and thus overestimation of the valued object, the ` 
Chevrolet, This interpretation would require a 
negative correlation between the desire to own the 
Chevrolet and the size estimation of the Volks- 
wagen. On the contrary, a positive (but non- 
significant) correlation was obtained between 
the desire to own the Chevrolet and estimated size 
of the Volkswagen (tau = +.12, p > .05). There 
is also a negative correlation between desire to 
own the Volkswagen and desire to own the Chevro- 
let (tau = —.45, p < .01). 

A positive and significant relationship (tau = 
+.37, p < .05) was found between size of hand- 
writing and the size estimation of the Volkswagen. 
Partialing out handwriting size to control for 
characteristic usage of space results in a slight 
but nonsignificant increase (tau = —.31)3 in the 
relationship between size estimation and desire 
to own a Volkswagen. A tau of +.277 (tau.os = 
.28, two-tailed test) was found between desire to 
own the Volkswagen and the Value of Smallness 
score of the Volkswagen derived from the seman- 
tic differential. The tau between the Value of 
Smallness and Size was —.26 (.06 < p < .07, 
two-tailed test). 


DISCUSSION 


The results are consistent with the hypothesis 
that if increasing value in a series of objects is 
correlated with decrease in the magnitude of a 
physical attribute, underestimation of the size 
of the valued object results, a special case of the 
more general hypothesis that accentuation of a 
correlated physical attribute is the underlying 
construct in accounting for judgments of a value 
series. 

There is some evidence that the object chosen 
for this study met the minimal requirements to 


three other objects. This relative judging minimized 
each subject’s scale response bias which the semantic 
differential score may fail to control. (c) Evaluation 
via the semantic differential does not necessarily 
imply value or relevance to the person himself. On the 
other hand, the desire to own score in itself implies 
personal value and relevance. 

*The value of tau is not comparable to that of a 
tho or Pearson r. The variance accounted for by the 
tau cannot be estimated as it is for the Pearson product- 
moment correlation (see Siegel, 1956). 
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test the general hypothesis. The positive relation- 
ship found between value of the object (desire to 
own) and the value of the physical attribute (small- 
ness), and the negative correlation between the 
value of smallness and the size estimation, both 
suggest that for subjects in general, the value of the 
Volkswagen is related to its smallness. However, 
for some persons, smallness may be the least de- 
sirable attribute, and the Volkswagen may be 
desired instead for economy of operation or status 
or some other reason, That such is the case is sug- 
gested by the tau of .277 between desire to own 
and the value of smallness, which is smaller than 
would be expected if smallness were unequivocally 
positive for all subjects. 

The evidence is not consistent with alternative 
hypotheses that the amount of underestimation 
of Volkswagen size is a function of greater desire 
to own the Chevrolet, nor that the greater the 
desire to own a Volkswagen the more subject 
should know about the Volkswagen, i.e., the more 
veridical should be his estimate. On the contrary. 
the greater the desire to own the Volkswagen the 
greater the deviation is the size estimate from the 
veridical size of the Volkswagen, and there is a 
Positive (although nonsignificant) rather than a 
negative relationship between desire to own a 
Chevrolet and estimated size of Volkswagen. The 
perceptual judgment measured in this experiment 
is mediated by memory. Previous research (Carter 
& Schooler, 1949) has indicated that judgments 
from memory appear to accentuate the relation- 
ship between value and size estimation. Further 
investigation would be required to determine 
whether similar results would be found in a judg- 
mental task with the object present. 

Although size and weight have been the most 
common attribute dimensions investigated, ac- 
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centuation should also result in under- or over- 
estimation of any attribute dimension (bright- 
ness, shape of nose, etc.) correlated with value. 
Thus, in regard to the attribute of skin color 
(Marks, 1940; Secord, Bevan, & Katz, 1956) 
Negroes perceive their own skin color as less dark 
(in a culture that values “whiteness”). But sun 
bathers proud of their sun tan would probably 
accentuate the darkness of their skins, American 
women may accentuate the thinness of their 
waistlines, but in cultures where plumpness is 
valued, accentuation in the opposite direction 
would be expected. The earlier finding that valued 
objects are overestimated would then appear to 
be only a limited case of accentuation as a process 
hypothesized to account for a wide area of social 
perception and judgments. 
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DOGMATISM AND LEARNING! 


HOWARD J. EHRLICH 
Columbus Psychiatric Institute and Hospital, Ohio State University 


pee is an exploratory study of one aspect of 
Personality and its functional significance 
in a learning situation. Specifically its purpose is 
twofold: to investigate the validity of certain 
aspects of the construct of dogmatism, and to 
demonstrate the role of personality factors in a 
learning situation. 

The construct of dogmatism has been intro- 
duced and defined by Rokeach (1954) as “(a) a 
relatively closed cognitive organization of beliefs 
and disbeliefs about reality, (b) organized around 
a central set of beliefs about absolute authority 
which, in turn, (c) provides a framework for 
patterns of intolerance and qualified tolerance 
toward others.” The present research is concerned 
with the first defining characteristic of dogmatism, 
its closed cognitive structure—and its logical 
concomitant, resistance to change. Through a 
quasi-experimental technique an attempt was 
made to demonstrate the relationship of dog- 
matism to the degree of learning resulting from a 
given classroom exposure. If the postulated 
“closed cognitive structure” of dogmatic persons 
is empirically correct, then persons high in dog- 
matism should exhibit less learning in a classroom 
situation than those low in dogmatism, 

Two interrelated hypotheses have therefore 
been tested: Dogmatism is inversely related to the 
degree of learning in a classroom situation, and 
the predicted relationship between dogmatism 
and learning is independent of academic aptitude. 


METHOD 


Subjects. The original sample consisted of 100 
students enrolled in four introductory sociology 
sections who were present for both the first and second 
administration of the test battery, and for whom 
Ohio State Psychological Examination (OSPE) scores 
were available from the university. A third adminis- 
tration, by mail, reduced the sample size to 57. 

Procedure. During the first week of the academic 
quarter subjects were presented with the Rokeach 


1The materials presented here represent a re- 
analysis and follow-up study on the author’s master’s 
thesis submitted to the Graduate School of the Ohio 
State University in 1955, and supersedes the original 
report. I am grateful to John Evans, Robert Stuckert, 
and Mark Lefton for their assistance in the collection 
of data. Finally I should like to express my gratitude 
to Melvin Seeman and Milton Rokeach for their 
critical reading and assistance with the original report. 


Dogmatism Scale (1956) and a 40-item true-false test of 


sociological knowledge. Half the test items were 
statements of empirical generalizations and the other 
half, statements of definition. The sociology test was 
readministered during the last week of the quarter, an 
approximate time lag of 10 weeks. Five months later 
the Dogmatism scale and the sociology test were 
mailed to the original subjects. Of 62 returns, 57 were 
complete, Split-half reliability for the Dogmatism 
scale is .75; and the correlation between the two 
administrations of the scale, separated by 5-6 months, 
is .73. 


RESULTS 


The intercorrelations of the sociology test 
scores appear in Table 1. The absolute changes 


TABLE 1 


INTERCORRELATIONS BETWEEN Docmatism, OSPE, 
AND SOCIOLOGY TEST SCORES 


Sociology Test 


OSPE 
Time 1 


Time2 | Time3 


Dogmatism 
OSPE - 
Sociology Test (n) 
Sociology Test (ts) 


* p <.05, 
-p <0. 
*** p <.0001. 


in the test scores for the three administrations 
were all significantly different from each other, 
using ¢ tests for correlated means, at p < .001. 


Mail Returns vs. Nonreturns 


A comparison of the 57 complete returns with 
the 38 nonreturns indicated a uniform pattern of 
mean differences. Nonreturns tended to be more 
dogmatic and to score lower on the OSPE, though 
these differences were not significant at the .05 
level. They performed significantly less well on 
the second sociology test (¢ = 3.45; p < .01). The 
bias of the return sample in the direction of 
eliminating those scoring higher on dogmatism 
and lower on the sociology test operated against 
the hypothesized inverse relationship between 
the two by constricting their respective distri- 
butions. The data thus provided a more rigorous 
test of the hypotheses, 
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TABLE 2 


PARTIAL CORRELATIONS BETWEEN DOGMATISM, 
OSPE, AnD Sociotocy TEST SCORES 


Partial r with Partial z with 


Be = OSE es 
Constant ` Constant 
Sociology Test (t) —.24 20 
Sociology Test (t) —.48** -21 
Sociology Test (és) — .49** -26 
“p <01, 
TABLE 3 


SECOND ORDER PARTIAL CORRELATIONS BETWEEN 
Docmatism, OSPE, AnD SocioLocy TEST SCORES 


Partial with Partial r with 
OSPE 


Dı tism 
OSPE and tism 
seal Test ad Initial 
res Scores 
Constant t 
Sociology Test (t) —.43** -12 
Sociology Test (t) —.45** +18 


* p <01 


The Relationship between Dogmatism, OSPE, and 
Learning 

The zero-order correlations between the three 
variables appear in Table 1. As expected, dog- 
matism yields an inverse relationship, and OSPE 
a direct relationship, to learning in all three time 
periods. In Table 2, first-order partials, controlling 
alternately for OSPE and dogmatism, indicate, 
as predicted, that (a) dogmatism is significantly 
and inversely related to learning, and (6) can 
account for a greater proportion of the variance 
in the sociology test scores than OSPE. 

An alternative procedure for exploring the 
relationship between dogmatism and learning, 
that of testing the differences on sociology test 
Scores for high and low dogmatic subjects, divided 
at the median, yielded ¢ values for each of the 
three administrations that were significant beyond 
the .005 level. Thus subjects low in dogmatism 
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entered the sociology classroom with a higher 
level of learning, learned more as a result of 
classroom exposure, and retained this information 
to a significantly greater degree than the more 
dogmatic subjects. 

Since the subject’s initial level of competence in 
the subject matter of the course may itself be 
regarded as a determinant of further learning in 
the classroom, and since the initial scores on the 
Sociology test manifest only a slight relationship 
to academic aptitude, it is necessary to control for 
this as well. Second-order partial correlations 
controlling for initial test scores and alternately 
for OSPE and dogmatism are presented in Table 
3. While the relationship between dogmatism and 
learning remains relatively unaffected, the par- 
tialing out of initial test scores decreases the 
correlations between OSPE and learning con- 
siderably. 


Discussion 


The basic hypotheses of this research that 
dogmatism would be inversely related to the 
degree of learning in a classroom situation, and 
that such a relationship would be independent of 
academic aptitude have been confirmed. The 
study materials add further evidence to the 
validity of Rokeach’s theoretical formulations on 
the nature of the dogmatic mind. Further study 
into the generality of the dogmatism-learning 
relationship seems warranted. 

Finally, an incidental finding of the research 
that the mail returns in a student population 
were biased in the direction of the “better” and 
less dogmatic subjects suggests the need for 
caution in the use of mail questionnaires in such a 
population. 


REFERENCES 

Roxeacu, M. The nature and meaning of dogmatism. 
Psychol. Rev., 1954, 61, 194-204. 

Roxeacu, M. Political and religious dogmatism; An 
alternative to the authoritarian personality, 
Psychol. Monogr., 1956, 70(18, Whole No. 425). 

(Received September 24, 1959) 


Journal of Abnormal and Social Psychology 
1961, Vol. 62, No. 1, 150-152 


SOME CONTRAST EFFECTS 
DOUGLAS S. HOLMES anp 


University of Wisconsin 


TUDENTS of social perception seem increasingly 
S to take the position that the factors affecting 
a person’s judgments or perceptions of other 
people have little (that is important, at least) in 
common with factors affecting his perception of 
inanimate objects (Tagiuri & Petrullo, 1958). 
Nevertheless, it can be shown that principles that 
affect evaluations of inanimate objects also hold 
for some classes of social judgments. Conditions 
analogous to those producing contrast and as- 
similation in judgments of weights also bring 
about these effects in judgments of another’s 
attitudes (Hovland, Harvey, & Sherif, 1957; 
Peak, 1958). In these studies the subjects com- 
pared the evaluated object, whether it was alive 
or not, with some standard. Contrast occurred, 
to take an example, when the psychological 
distance between the object and the standard was 
relatively great; the subjects judged the object as 
being further from the standard than it actually 
was. On the other hand, assimilation, judging the 
object as being nearer to the standard than it 
was, took place when the standard and object 
were relatively close. 

Generalizing from these findings, we would 
expect to obtain contrast effects in the perception 
of people whenever a person is confronted se- 
quentially with two strangers (so that prior learning 
does not affect his evaluations), if the stimulus 
people are strikingly different from each other on 
some judgmental dimension, The major hypothesis 
of this paper is that an anchorage or standard is 
formed on the basis of the experience with the first 
person and that subsequently contrast occurs in the 
perception of the second stranger when there is a 
great perceived difference between the two. 

In testing this view the subjects in the present 
experiment were required to listen to an interview 
between a psychologist and a female college 
student. This psychologist, who was either very 
belligerent or friendly, probably served as the 
standard for the subjects’ subsequent series of 
evaluations where they rated the Personalities 
both of another, more neutral, psychologist and 
of a male student from brief recordings of their 
voices. Since the male student is decidedly different 
from the “belligerent psychologist” (ie., he is 

1 This research was supported by Research Grant 
M1540 (C2) from the National Institute of Mental 
Health, United States Public Health Service. This 


study is part of a larger program of research on factors 
affecting aggressive behavior. 
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not an older, aggressive, high status professional), 
the immediate contrast between the two is rela. 
tively great, and the student will be evaluated asa 
pleasant person. The contrast is not as sharp 
when the “friendly psychologist” serves as the 
standard; he and the student are more similar 
behaviorally. Comparing these two conditions, we 
expect the student preceded by the “belligerent 
psychologist” to be regarded as a friendlier person 
than the student following the “friendly psy- 
chologist.”” 

Experiences intervening between the presenta- 
tion of the intended standard and the reception of 
the information to be judged also can affect the 
magnitude of the contrast effect. If the first 
psychologist is followed immediately by the 
second, more neutral psychologist, the standard 
category, “psychologist” becomes less extreme 


| 


(Helson, 1948), so that there is a smaller difference | 
between this anchorage and the male student, and 
the contrast effect is less. In general, then, the 
male student should receive a more favorable 
evaluation when (a) he follows the “belligerent — 


psychologist,” and (b) before rather than after 
judgments are made of the second, neutral psy- 
chologist. 


f 


MeErtHop | 
Subjects 
Approximately 200 subjects were recruited from an 
introductory psychology class, and were assembled in — 
groups ranging in size from 1-24 persons, depending 
upon their schedules, with the entire group receiving 
a given experimental treatment. In order to achieve 
Proportional cell frequencies across experimental 
conditions for the analyses of variance, subjects wer | 
randomly discarded until there were 36 in each of the 
| 
| 
l 


four main conditions, for a total of 144 subjects. There 
were two females to every male in each condition, but 
the sex variable does not have any significant effects 
by analysis of variance. 


Procedure and Experimental Design | 


When the subjects arrived in the experimental room 
they were told the purpose of the study was to discover 
Some of the personality factors affecting an individual's 
“social perceptiveness.” All had previously taken & 
brief personality inventory. The scores on this in- 
ventory, they were informed, would be related to the 
accuracy with which they assessed the personalities of 
others on the basis of tape recordings of their voices, 

The experimental conditions, organized in a factorial 1 
design, were created by means of these tape recordings: 
The main conditions are as follows: 
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Anchorage: Belligerent or Friendly Psychologist. 
The first recording presented to the subjects was that 
of a supposed actual interview between a psychologist 
in the college administration and a female student 
applying for a loan or scholarship. The same actors 
took the two parts in both versions of the interview. 
For half of the subjects the recording portrayed the 
psychologist as nonsupportive and fairly belligerent, 
while he acted in a much more supporting and 
friendly manner in the recording played to the other 
subjects. Since this psychologist was the predomi- 
nant figure in the skit, both by virtue of the length 
of time he spoke and the preliminary instructions 
that the subject concentrate on his personality, we 
assume that the judgmental anchorage was largely 
influenced by his behavior. Immediately after the 
interview half of the subjects in each condition 
evaluated the psychologist’s personality on a 
questionnaire and all subjects described their per- 
ceptions of him on a 28-item adjective check list. 
No significant differences were found on any of the 
dependent variables between the subjects com- 
pleting the questionnaire and those not responding 
to this form, and this distinction is ignored in the 
present report. 

Stimulus Objects: Psychologist and Student. Fol- 
lowing these evaluations, all subjects listened to two 
one-minute-long recordings in which a second male 
psychologist and a male college student talked about 
themselves. The order of presentation of these 
voices was systematically varied with half of the 
subjects hearing the psychologist first, and the 
others the student first. 


Dependent Variables 


After each presentation, subjects attempted to 
describe the second psychologist and the student on a 
56-item adjective check list. In addition, after they 
had heard both voices the subjects were given a brief, 
7-item questionnaire on which they simultaneously 
tated both the second psychologist and the student. 
Two of these questions assessed the favorableness of 
subjects’ evaluations. They are: “Would you admit 
him into your circle of close friends?” and “Would you 
admit him into the social organization (lodge, fra- 
ternity, club, sorority, etc.) of most importance to 
yourself?” The subjects answered by placing a check 
mark at the appropriate point on a line 644 inches 
long, with the ends labeled as “Definitely yes” and 

Definitely no.” The subject’s evaluation score was the 
distance, in one-sixteenth of an inch units, away from 
the favorable end, summed over both items. 

a — 
k Pe RESULTS 

A Analyses of variance of both the adjective check 
list and questionnaire evaluations of the inter- 
Viewing psychologist indicated that the pre- 

inary skit was highly effective in creating the 
desired perceptions of this first psychologist. For 
both measures the “belligerent psychologist” was 
evaluated in a significantly more unfavorable 
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TABLE 1 


EVALUATIONS OF THE SECOND PSYCHOLOGIST AND 
THE MALE STUDENT FOLLOWING THE 


ANCHORAGE EXPERIENCE 
Tere Stimulus Stimulus Person 
Order “Student Psychologist 

A. Adjective 

Check List 

Scores 

Belligerent P-S 16.50p* 18.19, 

Belligerent S-P 13.564 22.474 

Friendly P-S 15.53 19.315 

Friendly S-P 17.08bs 19.11» 
B. Questionnaire 

Scores 

Belligerent P-S 73.56¢ _98.33np 

Belligerent SP 66.83, 114.53, 

Friendly P-S 71.64, 109.08, 

Friendly S-P 91.755 103.36. 


& Cells containing the same subscript are not significantly dif- 
ferent from each other (at the .05 level of confidence) by Duncan 
multiple range test. Separate analyses were conducted for the 
adjective check list and questionnaire scores. For both measures 
the higher the mean, the more unfavorable the evaluation. 


manner than the “friendly psychologist” (p < 
.01). 

By and large, as can be seen in Table 1, the 
presumed comparison person affected the evalu- 
ations of at least the male student on both meas- 
uring instruments. The latter stimulus person 
received the most favorable evaluations when the 
subjects were given the experience with the 
“belligerent psychologist” as a comparison and 
they were asked to judge the student before they 
judged the second, neutral psychologist. Similarly, 
he was evaluated most unfavorably when the 
subjects were provided with the “friendly psy- 
chologist” as the anchorage and then heard the 
student’s voice before they heard the voice of the 
second psychologist. In both instances there is a 
contrast effect, which apparently is sharpest 
when the extreme cases follow in succession. 

The table also shows that there is no simple 
generalization of evaluations from the first to the 
second psychologist, Instead, here too there is an 
indication of judgmental contrast. The neutral 
psychologist received the most unfavorable 
evaluations when the “belligerent psychologist” 
served as the standard, but only if the subjects 
had listened to the student before hearing this 
stimulus person. It may well be that the second 
psychologist was contrasted with the extremely 
favorably regarded student. 


Discussion 


The results obtained lend weight to the view 
that the judgmental phenomena obtained in the 
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psychophysics laboratory often have their counter- 
parts in the perception and evaluation of people. 
Effects such as those demonstrated in this experi- 
ment seem important for at least two main reasons: 
(a), their probable operation in everyday life; and 
(b), their possible operation in psychological 
experiments designed to isolate quite different 
phenomena (such as experiments on hostility). For 
example, in studies of hostility displacement, the 
subjects’ experience with the frustrator may serve 
as the anchorage in their judgments of other 
people. If these others are relatively similar to the 
frustrator there should be a judgmental assimila- 
tion which would result in their receiving un- 
favorable evaluations, On the other hand, if the 
others are considerably different from the per- 
ceived frustrator, there should be a contrast 
effect in which the others are regarded more 
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favorably than they would have been in the 
absence of the anchorage experience, a 
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A NOTE ON ANXIETY, INSTRUCTIONS, AND 
WORD ASSOCIATION PERFORMANCE! 


IRWIN G. SARASON 
University of Washington 


N an earlier experiment (Sarason, 1959), 

measures of test anxiety and general anxiety 
were related to the subject’s performance ina situ- 
ation involving two consecutive group adminis- 
trations of a word association test (WAT). It was 
then possible to compare WAT responses on the 
first administration with those given on the second 
and also to obtain commonality scores by com- 
parisons with a normative group. In this earlier 
study, half of the subjects took WAT under 
instructions stating that it was a sensitive per- 
sonality test (experimental condition); the other 
subjects were merely told the rules for responding 
(control condition). It was predicted that the 
Test Anxiety Scale (TAS) (Sarason, 1958) would 
relate, particularly in the form of an interaction 
with instructions, more significantly to WAT 
performance than would two measures of general 
anxiety, Bendig’s short form of Taylor’s Manifest 
Anxiety Scale (MAS) (Bendig, 1956) and the 
Lack of Protection Scale (LP) (Sarason, 1958). 
The results of the experiment did not support 
this expectation; in fact, there were suggestions 
in the commonality analyses of greater interaction 
between MAS and experimental instructions than 
for the comparable interaction with TAS. 

The present experiment was designed to follow 
up a problem posed by these results, The in- 
structions originally given to the subjects stressed 
WAT as a personality test. The TAS items, 
however, deal largely with individuals’ reactions 
to achievement and intelligence tests. If the 
earlier experiment were to be repeated with 
subjects told that WAT is a sensitive measure 
of intelligence rather than of personality, would 
the prediction of a more significant interaction of 
instructions with TAS than with MAS and LP 
be supported? The present experiment was 
carried out to obtain an answer to this question. 


METHOD 


Subjects. Subjects were 80 students in introductory 
psychology courses at the University of Washington. 
Independent of the WAT situation, subjects had been 
given the TAS, MAS, and LP. On each of these scales, 


1 This research was supported by grants from the 
Agnes H. Anderson Fund and the United States 
National Institute of Mental Health (M-2397). A 
modification of this paper was read at the American 
Psychological Association meetings in Cincinnati, 
Ohio, September 5, 1959. 


subjects were divided at the median into high and low 
groups. High TAS subjects had scores of five or above; 
low TAS subjects had scores of four or below. High 
MAS subjects had scores of six or above; low MAS 
subjects had scores of five or below. High LP subjects 
had scores of eight or above; low LP subjects had 
scores of seven or below. Since the collection of the 
WAT data was independent of the personality test 
data, it was necessary in analyses of variance to exclude 
certain subjects randomly from some experimental 
groups in order to obtain proportionality of cell fre- 
quencies. Thus, the Ws for the various analyses were 
not identical. 

Procedure. All subjects received standard control 
instructions (Sarason, 1959) in the group administration 
of WAT. Control subjects received only these instruc- 
tions, whereas the experimental groups also were told 
the following: 

As you will see, the test you are about to take is 
a word association test. This is one of the most 
sensitive tests of intelligence, From your perform- 
ance on this test, we will obtain an IQ measuring 
your level of general intellectual ability. 

Responses were untimed. When the subjects com- 
pleted the first WAT, they were given the second test 
with instructions to give the same responses as they 
had given on the first administration. 


RESULTS 


The data were analyzed in terms of 2 X 2 
analyses of variance. Three measures of WAT 
performance were used: The reproduction dis- 
crepancy score (RDS), defined as the number of 
times subject’s responses on the second WAT were 
different from the responses given on the first 
administration; The Commonality-1 score, the 
number of times subject’s responses on the first 
WAT agreed with the most frequent response 
given by Russell and Jenkins’ normative group 
(Russell & Jenkins, 1954); The Commonality-2 
score, the number of times subject’s responses on 
the first WAT were the same as words which 100 
or more subjects in the normative group gave to 
the stimulus. j 

TAS. The analysis of reproduction discrepancy 
scores revealed a significant effect for the variable 
of Instructions (F = 4.56, p < .05, df 1, 70) with 
higher scores in the experimental than in the 
control group. High and low TAS groups did not 
differ, although the interaction between TAS and 
Instructions approached significance (F = 3.11, 
05 < p < -10, df 1, 70) with a tendency for 
greater RDSs for high TAS than low TAS subjects 
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TABLE 1 
MEAN REPRODUCTION DiscrEPANCY Scores (RDS) 


AND ComMMONALITY SCORES FOR GROUP DIVDED 
on Basis OF Test ANXIETY ScaLE (TAS) AND 


INSTRUCTIONS 
FAS mon |o | RDS (Oya aiy 
High | Intelligence | 19 | 25.53 | 26.79 | 38.16 
High | Control 18 | 14.83 | 39.28 | 54.17 
Low Intelligence 19 | 18.74 | 33.63 | 48.58 
Low Control 18 | 17.72 | 30.39 | 44.11 


under the intelligence instructions and vice versa 
under standard instructions. The high TAS group 
under the intelligence instructions showed a 
significantly higher mean RDS than all other 
groups combined (p < .01). The relevant data are 
presented in Table 1. 

There was only one significant effect in each of 
the commonality analyses. For Commonality-1, 
there was a significant TAS X Instructions inter- 
action (F = 7.04, p < .01, df 1, 70). This effect 
was also significant for Commonality-2 (F = 
7.55, p < .01, df 1, 70). In both cases the high 
TAS group under intelligence instructions showed 
lower commonality scores than the comparable 
low TAS groups. Under standard instructions, 
the tendency was for the high TAS subjects to 
obtain higher commonality scores. 

MAS and LP. For both MAS and LP, there 
were higher RDSs under the intelligence instruc- 
tions than under the standard condition, but 
neither the MAS nor the LP analyses showed any 
effects due to anxiety or anxiety X instructions. 


DISCUSSION 


This study began with the question of whether 
TAS interacts with instructions more significantly 
than MAS and LP. The answer appears to be 
affirmative. As in the earlier experiment, the 
TAS X Instructions interaction tended toward 
significance when the measure was the number 
of disagreements between subject’s responses to 
two word association tests (RDS). However, the 
commonality measures of subject’s agreement with 
frequent response words in a normative group 
yielded highly significant interactions in the 
present study but not in the earlier one. 

In the earlier experiment, there were significant 
effects due to both MAS and MAS X Instructions 
for the RDS and commonality measures. No such 
effects were observed here. While previous research 
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(Sarason, 1956) has shown MAS to relate si 
nificantly to certain kinds of intelligence instruc 
tions, the nature of the task and the ac 
instructions used may be highly important. 
the present task, it would appear likely that 
change in experimental instructions from 4 
personality to an intelligence emphasis contribute 
to the absence of significant effects involvit 
MAS. A similar explanation seems appropriate i 
the LP results. 4 
The present TAS results support the hypothi 
that the closer the nature of the experimer 
stress is to the content of anxiety items, the 
predictive the items are. One interpretation 
that under the intelligence instructions, subj 
with high TAS scores tended to become threaten 
and disorganized and emitted relatively mo 
interfering and idiosyncratic self-centered 
sponses than did other subjects. This ten 
was reflected in the greater number of relai 
unusual or less frequently occurring responses ai 
greater RDSs in the high TAS intelligence 
than in the high TAS control group. No s 
differences were found for the experimental 
control low TAS groups. 
The findings of both the earlier experiment an 
the present one are consistent with a construc! 
anxiety conceived as a class of self-orien 
responses which interfere with ongoing 
formance. Anxiety scales in this view might 
considered as devices for selecting individuals 
differ in the tendency to emit these responses 
appropriate motivational conditions. 
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SOME VERBAL ASPECTS OF PRIMARY PROCESS THOUGHT IN SCHIZOPHRENIA! 


ALVIN G. BURSTEIN? 
University of Chicago 


N connection with his study of dreams, Freud 

developed the concept of primary process, a 
mode of mental functioning possessing charac- 
teristics much different from those of typical 
waking thought. Primary process has been said to 
characterize not only dreams but also the thinking 
of schizophrenics. Freud (1925) specifically related 
primary process to the verbal behavior of schizo- 
phrenics when he stated, “In schizophrenia words 
are subject to the same process as that which 
makes dream images out of dream thought, the 
one we have called the primary mental process” 
(p. 131). 

The present study attempts to test the hy- 
pothesis that the effects of primary process are 
evident in the verbal behavior of schizophrenics. 
Two aspects of primary process thought were 
chosen: the treatment of antonyms as synonyms, 
and the treatment of homonyms as synonyms. 


METHOD 


Antonyms. Freud (1949), describing one aspect 
of primary process thought, said, “ . . . contraries 
are not kept apart from each other, but are treated 
as though they were identical” (p. 53). This 
confusion of opposites and identicals is specifically 
related to the area of language by Freud (1956): 
“,, the most ancient languages behave exactly 
like dreams in this respect... (strong-weak, 
old-young, far-near, bind-sever), . . . ” (p. 318). 


As a measure of this presumed tendency, a task was 
developed consisting of 97 multiple-choice items, on 
each of which subject was instructed to choose the 
synonym for a given word from among three alter- 
natives consisting of a synonym, an antonym, and a 
third alternative unrelated to the word to be defined. 
This third type of alternative is called here the uni- 
dentified error alternative. An example will serve to 
illustrate: 


Circle the word that means almost the same as BIG: 
Large (syn.) Small (ant.) Individual (ue.) 


1This paper is based upon a thesis submitted in 
partial fulfillment of the requirements for the degree 
of Doctor of Philosophy, University of Chicago, 1959. 
The study was supported in part by Research Grant 
M-1416 from the National Institute of Mental Health, 
United States Public Health Service. 

*The author is indebted to the members of his 
graduate study committee, and especially to his 
sponsor, Loren J. Chapman, for invaluable guidance 
at €very stage of the research. 

Currently on the staff of the Department of Psy- 
chology at the University of Michigan. 


The purpose of the unidentified error alternative is to 
measure the number of antonym markings made upon 
bases other than those anticipated by the hypothesis, 
e.g., obscure communalities, idiosyncratic associations, 
random markings, etc. Since choices made upon such 
unanticipated bases would by chance result as often 
in marking the antonym alternative as in marking the 
unidentified error alternative itself, a score of antonym 
markings minus unidentified error markings would 
represent a relatively pure measure of the number of 
choices of antonyms per se. Another way of putting 
it is that there should be as many errors made for 
unidentified reasons hidden within the antonym 
markings as there are markings of the unidentified 
error alternatives themselves, so that subtracting out 
the unidentified error markings yields a corrected 
measure of sheer choice of antonyms per se. 

In order to control for the effect upon choices of 
frequency of occurrence in the language, the antonyms 
and unidentified error alternatives were roughly 
equated in the task as a whole by Thorndike and 
Lorge’s (1944) summary of four different word fre- 
quency counts. 

To control for the effects of position markings, each 
type of alternative appeared equally often in each of 
the three possible positions, with the possible sequences 
of alternatives appearing randomly. 


Homonyms, Freud gave many examples of 
primary process in which similarity of sound is 
treated as similarity of meaning. For example, a 
dreamer is said to have substituted “ears” (Ahren) 
of barley and wheat for “honor” (Ehren) on the 
basis of their being homonyms (1956, p. 406). 


Accordingly, a task was constructed consisting of 48 
items parallel in form to those discussed above. Each 
item required the subject to choose the synonym for a 
given word from among three alternatives, a synonym, 
a homonym, and an unidentified error alternative. 
The following sample is illustrative: 


Circle the word that means almost the same as SEA 
Ocean (syn.) See (hom,) Idol (ue.) 


This design presupposes, of course, that in reading the 
items silently the subject is aware of how the words 
would sound if read aloud. If this were not the case, 
the subject could not respond on the basis of two words 
as homonyms. ste 

Once again, the purpose of the unidentified error 
alternative was to measure choices of the predicted 
error that were made upon bases other than those 
anticipated. In a like manner, an error score of 
homonym markings minus unidentified error markings 
would yield a relatively pure measure of the treatment 
of homonyms as synonyms. In order to control for the 
effects of frequency of occurrence in the language, each 
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homonym appeared also as an unidentified error 
alternative in another item. 

We may now state the hypothesis in operational 
terms: Schizophrenic subjects when instructed to 
choose synonyms, exceed normal subjects in their 
choice of homonyms and antonyms as measured by 
the difference score of predicted error markings minus 
unidentified error markings. 

Subjects. The tasks described above were adminis- 
tered to 26 chronic schizophrenics and to 24 normal 
control subjects. The CVS, a brief verbal intelligence 
test, (Hunt, French, Klebanoff, Mensh, & Williams, 
1948) was given to each subject. All subjects were 
Caucasians whose native tongue was English, and who 
gave no evidence in either their records or in interviews 
of central nervous system pathology, drug addiction, 
alcoholism, or mental deficiency. 

The schizophrenics had been hospitalized continu- 
ously for at least 5 years with a diagnosis of schizo- 
phrenia or dementia praecox and had not received 
chemotherapy within the previous month or shock 
therapy within the previous year. Of the normal 
controls, 7 were recruited from ambulatory medical 
and surgical patients at a general hospital, 12 were 
employees of a large Chicago hotel, and the remaining 
5 were Chicago firemen. 

In all cases the CVS was administered either indi- 
vidually or in groups of two or three. The subjects were 
required to answer all questions on the paper-and- 
pencil tasks. Both the schizophrenic and the normal 
groups had a mean age of 44 years with a standard 
deviation of 4 years. Mean years of school completed 
were 10.5 years for the schizophrenics and 10.6 years 
for the normals, with standard deviations of 1.9 and 
1.7 years, respectively. 

Two measures of intellectual functioning were 
computed for the groups. Present functioning intel- 
ligence was estimated by the IQ score obtained from 
the CVS. As might be expected, the normals received 
significantly higher IQ scores, with a mean of 116.7 as 
compared with 90.7 for the schizophrenics. This 
disparity in present functioning intelligence may 
reasonably beattributed to the schizophrenic pathology. 
On the basis of the equivalent age and education, 
the two groups were assumed to be of equal premorbid 
intelligence. 

In addition, premorbid intelligence was estimated 
by the item number of the highest vocabulary item 


TABLE 1 
Error SCORES on ANTONYM Task 
Predicted Unidenti- tae 
Unidentified 
Normals 
Mean 1.3 1.1 0.2 
Median 0.5 0.9 =0.1 
SD 1.7 1.3 2.1 
Schizophrenics 
Mean 9.2 3.2 6.0 
Median 4.5 1.8 2.3 
SD 11.0 3.8 9.1 
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TABLE 2 
ERROR SCORES on Homonya Task 
————————— 
Predicted Unidentifed "Sete 
Unidentified 
-e 
Normals 
Mean 0.2 0.5 —0.3 
Median 0.1 0.3 —0.2 
SD 06 08 0.9 
Schizophrenics 
Mean 3.4 1.1 2.2 
Median 1.5 1.0 0.8 
SD 4.9 1.2 4.6 


passed, a measure introduced by Becker (1956). This 
would appear to be a better estimate than the more 


RESULTS 


The means, medians, and standard deviations 
of the error scores were computed for both groups. 
These results are summarized for the Antonym 
and Homonym tasks in Tables 1 and 2, respec- 
tively. 

Within each group, comparisons of distracter 
and irrelevant markings were made for each task, 
using two-tailed Wilcoxon tests. It was found that 
on both tasks, the schizophrenics chose the 
predicted error alternative more often than the 
unidentified error alternatives. On the other hand, 
the normals showed no significant preference fot 
either type of error. 

In order to compare the groups more directly 
on this differential error preference, there was 
computed for each subject the difference score of 
predicted error markings minus unidentified 
error markings. As discussed above, this is the ~ 
corrected measure of antonym or homonym i 
choice. On both tasks, the schizophrenics exceeded 
the normals (6 < .001). This confirmed the 
hypothesis that schizophrenics, more than normals, 
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treat antonyms as synonyms and treat homonyms 
as Synonyms. 

In order to study the relationship of intelligence 
to this feature of thinking disorder, the rank order 
correlation, corrected for ties, was computed 
between the CVS score and each of the two 
difference scores. Although one might expect 
negative correlations of sizable magnitudes 
between intelligence measures and this measure of 
cognitive disturbance, for the normal subjects 
there appeared to be essentially no correlation, the 
actual values being .16 for the Antonym task and 
.18 for the Homonym task. For the schizophrenics, 
the correlation for both tasks with the CVS IQ 
was —.44, a significant negative relationship 
(p < .05), These significant values for the patient 
group probably reflect the fact that both func- 
tioning IQ decrement and cognitive disturbance 
increase with an increase in overall pathology. 

In an earlier study by Burstein (1959), the 
Antonym task was given to third and sixth grade 
children, and, as predicted, the younger children 
scored significantly higher than the older on the 
corrected measure of antonym choices (antonym 
markings minus unidentified error markings). 
Since in the present study schizophrenics were 
distinguished from normals on this same measure, 
the results support a regression theory of schizo- 
phrenic thinking, at least on a descriptive level. 

The formulation of the hypothesis in primary 
process terms does not, of course, preclude other 
explanations of the data. There exists a consider- 
able body of research with children dating from 
Razran (1939) in which conditioning techniques 
have been used to investigate verbal relationships 
of a sort similar to those considered here. Typically, 
the approach is to use a visually presented word 
as the conditioned stimulus and a GSR as the 
conditioned response. After conditioning is 
established, homonyms, antonyms, etc. are 
presented and generalization curves established. 
These curves are usually found to vary as a func- 
tion of age, and the gradient of generalization is 
said to have altered. 

One may interpret the greater treatment by 
schizophrenics of antonyms and homonyms as 
Synomyms as an expression of increased gen- 
eralization, following interpretations of similar 
Tesults with children. Mednick (1958) has spec- 
ulated that schizophrenia in general is inter- 
Pretable in terms of broadened generalization 
curves, 
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SUMMARY 


On the basis of the psychoanalytic concept of 
primary process, it was predicted that schizo- 
phrenics would show a greater tendency than 
normals to treat both antonyms and homonyms as 
synonyms. The instrument used was a multiple- 
choice paper-and-pencil test. Each item required 
the subject to select a synonym from among three 
alternatives, a synonym, either an antonym or a 
homonym, and a third word which was neither 
and was included to measure choices made upon 
unidentified bases (e.g., obscure communalities, 
random markings, etc.) which might tend to 
inflate the number of distracter markings. A 
corrected score consisting of the difference be- 
tween predicted error markings and unidentified 
error markings was therefore taken as a relatively 
pure measure of the tendency to treat antonyms 
or homonyms as synonyms, independent of other 
error tendencies. 

The schizophrenics exceeded the normal sub- 
jects on this corrected score for both the antonym 
task and the homonym task. 

A possible non-Freudian interpretation of the 
data in terms of learning theory generalization 
is discussed. 
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SOME INCONSISTENCIES IN THE EFFECT OF MOTIVATION AROUSAL ON 
HUMOR PREFERENCES! 


DONN BYRNE? 
University of Texas 


recent article by Strickland (1959) reported 
A the results of a study in which annoyed and 
frustrated subjects showed the greatest preference 
for hostile cartoons while sexually aroused sub- 
jects preferred cartoons with a sexual theme. With 
the Zeitgeist operating, both Strickland (1958) 
and the present author (Byrne, 1957) attacked an 
almost identical problem in their doctoral dis- 
sertations, but with different results. This paper is 
designed to point out the similarities and differ- 
ences between the two studies in the hope of 
leading to a discovery of the reasons for the in- 
consistencies, While a large portion of the present 
study was also concerned with the measurement of 
defense mechanisms, only the relevant aspects of 
the experiment will be described. 


' METHOD 

Cartoons. Using a pool of 400 magazine cartoons, 
five experienced clinical psychologists sorted the 
humorous material into four content categories (and an 
unclassifiable group). The final scale was drawn from 
those 


The cartoons, in random order, were presented to 
the subjects in a group situation. Each subject was 
required to rate the cartoons in four degrees of fun- 
niness (most funny to least funny) with 16 cartoons in 
each category. A cartoon placed in the most funny 
category received a score of three, in the next most 
funny a score of two, etc. Each subject received a score 
on each of the four cartoon types, which could theo- 
retically range from 0 to 48, 

Subjects. The subjects consisted of 76 male under- 
graduates enrolled in an introductory psychology 
course at Stanford University. The experimental 
ponm aen arousal ra sex arousal) each 
consist subj an control group was 
made up of 30 thei 

Conditions. The method selected to arouse hostile 
feelings was one employed successfully by Weatherley 
——— ee 


1 This paper is based on a dissertation submitted in 
partial fulfillment of the requirements for the PhD 
degree at Stanford University. 

2 The author wishes to express his thanks to CL 
Winder, Albert Bandura, Paul R. Farnsworth, and 
Quinn McNemar for their advice and assistance. 


(1956). Briefly, a second experimenter asked for § 
minutes of the subjects’ time for a pilot study in whid 
a confusing “spatial concentration test” was adminis 
tered in an insulting and provoking manner, edi- 
ately afterward, the cartoon test was given. 
The sex arousal group read 10 literary pa ges 
selected on the basis of their sexual content ( 
God's Little Acre, Ulysses, Aphrodite). Each subject 
read these passages (5.5 dittoed pages) and then 
asked to rank them in terms of the desirability of th 
described behavior. When this task was finished, the 
cartoons were administered. 
Half of the control group took the spatial concen- 
tration test from a mild-mannered, polite experimenter, 
The other half read neutral passages from the san 
books from which the sexual material was taken, These 
were ranked by the subjects with respect to how much 
they would like to see the described scene, Again, the 
cartoon test was given immediately afterward. 


RESULTS 


In Table 1, the means and standard deviations 
of the four cartoon scores in the three conditions 
are given. The effect of the conditions on the 
humor scores was determined by analysis of 
variance. For the hostile, sex, and nonsense 
cartoons, motivation arousal did not exert @ 
significant influence on cartoon ratings. The 
ridicule cartoons were rated significantly less 
funny (p < .01) in the sex arousal condition. The 
meaning of this finding, should it be verified in 
future studies, is unclear. | 

Within each condition, there was a good deal of 
consistency in that the hostile cartoons wert 
considered the most funny while the other three 
types received very similar ratings (with the 
exception noted above). Were it not for Strick 
land’s findings, it would be concluded that humot 
preferences are independent of motivational states: 


Discussion 


Independent experimental results which are nót 
Congruent are always a source of embarrassment 
and concern. It is, of course, possible that motiva- 
tion arousal does not influence humor preferences | 
and that Strickland unwittingly reported chance 
findings. Actually, other attempts to relate drivé 
arousal to humor response have yielded uni- 
formly negative results (Byrne, 1958; O’Connell, © 
1960; Sears, Hovland, & Miller, 1940). un 

However, the high level of statistical signif- “ 
cance reported by Strickland (p < .005) makes it 
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CRITIQUE AND NOTES 


plausible to assume that his findings are real ones; 
therefore, the reasons for the conflicting results 
should be sought. Since neither of these two studies 
was designed as a replication of the other, there 
are numerous minor differences in method. Even 
though the populations from which the subjects 
were drawn and the overall designs of the two 
experiments were remarkably similar, they 
differed in several respects as is shown in Table 2. 

Any one or a combination of these differences 
could conceivably account for the divergent 
results. Perhaps the two most important factors 
were the differences in the administration of the 
cartoon test and in the methods used to rate 
cartoon funniness. It has been found that general 
ratings of humor differ in group and individual 
situations (Perl, 1933), and it may be that the 
influence of motivation arousal is dependent on 
such conditions as this. It should be noted that 


TABLE 1 


MEANS AND STANDARD DEVIATIONS OF HUMOR 
Scores ror EACH CONDITION 


159 


none of the negative studies mentioned above used 
individual administration of the humorous ma- 
terial. Also, the way the cartoons were rated 
could have been the important difference. In the 
Strickland study, the scores of a given cartoon 
category were free to vary independently of one 
another. In the present study, a given category 
could not be rated more amusing without causing 
a corresponding decrease in the rating of another 
category. It might well be that an independent 
rating of each cartoon is preferable to having 
four categories with an equal number of cartoons 
required in each. 

Additional work will be necessary in order to 
isolate the crucial factors. In any event, the 
generality of Strickland’s findings unfortunately 
must be restricted until further research is under- 
taken. 
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TABLE 2 


COMPARISON OF THE Two EXPERIMENTS WITH RESPECT TO METHOD 


Strickland 


Byrne 


Definition of Cartoon Content 
same. 


The exact wording of the definitions presented to the judges was not the 
The specific cartoons used as stimulus material were not the same. 


5 experienced clinical psychologists 
Agreement of four out of five judges 


16 each of hostile, sexual, ridi 
and nonsense from a pool of 


luate stu- 


oup 

From least funny to most funny in 
a 4-point categorization (16 car- 
toons in each) 

Experimenter administered a con- 


Cartoons 
Judges 15 faculty members, 
redni dents, and under; uates 
Criterion of Cartoon Accept- Agreement of all judges 
ance 
Number of Cartoons 11 each of hostile, sexual, and non- 
nM sensical from a pool of 150 
Administration of Cartoons Individual 
Rating of Funniness From not funny to extremely funny 
on 15-point scale 
Aggression Arousal Experimenter showed displeasure, 
left subject alone for 20 minutes, 
and then gave cartoon test. 
Sex Arousal 


Subjects viewed photos of nude 
models. 


4 
= 


fusing task in an insulting and 
provoking manner; then another 
‘imenter gave cartoon test. 
Subjects read sexual passages from 
novels. 
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THE EFFECT OF INDUCED STRESS ON FREE ASSOCIATIONS! 


ELI SALTZ 
Center for the Study of Cognitive Processes, Wayne State University 


YIDENCE to date is equivocal concerning the 

effect of stress on such complex cognitive 
activities as free associations. Clinicians typically 
assume that stress or anxiety associated with 
specific concepts will reduce the likelihood of 
common associations to these concepts. Neurotics 
and psychotics do tend to give fewer common 
responses than normals on free association tests 
(Woodworth & Schlosberg, 1954), and Buchwald 
(1959) reported that subjects who are anxious on 
the Taylor scale also give fewer common responses. 
These results raise the question: Does stress or 
anxiety reduce the strength of strong associations 
in complex verbal materials? The Taylor-Spence 
(1952) position argues that anxiety should increase 
the strength of a dominant response, but results 
like those of Saltz and Hoehn (1957) have thrown 
serious doubt on this prediction. 

The point of view taken here holds that in the 
acquisition of any verbal connection, the major 
problem is not the formation of the individual 
association. It is the differentiation of the given 
association from all other associations. This is 
Obvious in paired associates learning. Any single 
pair can usually be learned in one trial. Difficulty 
in learning is a consequence of learning many 
pairs in a single list. Reinforcement of a given 
pair can be conceptualized as strengthening the 

u strength between this pair and other 
pairs, thus producing differentiation. Stress acts to 
break down these boundaries and to produce 
dedifferentiation. Thus, stress reduces the fre- 
quency of common responses in free associations. 

the present study, stress was induced in a 
manner similar to that suggested by Beam (1955). 
Beam had college, subjects learn material im- 
mediately prior to course examinations and 

nd significant decrements in learning. Pre- 
to the present study, subjective reports 

vies obtained from students concerning their 
felt anxiety” in examination situations. More 
Students indicated anxiety after taking a difficult 
exam and immediately before receiving their 
exam grades than reported anxiety immediately 
ore taking an exam. Consequently, the stress 
Condition consisted of administering the free 
association test immediately after the first full 


‘The present study is part of a program of research, 
Pe} ported by a grant to the writer from the National 


a a on the influence of differentiation 


exam of the course. The instructor informed the 
subjects that he would grade their papers while 
they were taking the free association test and 
would give them their grades at the end of the 
hour when all students had completed the experi- 
ment. The course exam was a difficult one, the 
highest grade being 49 correct items out of 60. 


METHOD 


Subjects. Fifty-seven Wayne State University 
students, enrolled in summer classes in introductory 
psychology, served as subjects. 

Material and Procedure. The 100-item Kent-Rosanoff 
free association list was used. Instructions and pro- 
cedure for administration were identical to those 
described by Russell and Jenkins (1954): The 100 items 
were typed on two pages, two columns of 25 stimuli 
on each page, with a blank line for subject to write his 
association following each stimulus word. Subjects 
were told to write the first word that came to mind 
upon seeing each stimulus word. There was no time 
limit. 

Subjects took the test twice, the administrations 
separated by one week, once under no stress and once 
under stress. Thirty subjects had the nonstress con- 
dition first; these were tested in two subgroups, one 
of 12 subjects and the other of 18 subjects. Twenty- 
seven subjects had the stress condition first; these 
were tested in two subgroups, one of 13 subjects, the 
other of 14 subjects. Since the results for subgroups 
within each order of presentation were similar, the 
subgroups within each order were combined for pur- 

of analysis. 
UTEN “Only those stimuli were used for which 
some single response was clearly dominant in the 
Russell and Jenkins restandardization of the Kent- 
Rosanoff. Items were divided into three categories on 
the basis of the frequency with which the dominant 
response to each of the selected stimuli occurred in 
the Jenkins study: 20% to 40% occurrence (21 words), 
41% to 60% (17 words), 61% and above (19 words). 
The words in each category are indicated in Table 1. 
For each subject, the percentage of common responses 
in each Cease was computed. These data were 
analyzed by means of analysis of variance for related 
measures. 
RESULTS 


As can be seen in Figure 1, the effect of stress 
was to reduce the number of common associations 
elicited during free associations. This result 
occurred both when the stress condition preceded 
the nonstress and when the stress trials followed 
the nonstress. The reduction in common associ- 
ations as a function of stress is significant at 
beyond the .001 level. 
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TABLE 1 
SrmuLus Worps, AND THEIR DOMINANT Associations, USED IN THE PRESENT STUDY 


CRITIQUE AND NOTES 


Frequency of Occurrence of Dominant Response* 


o40% 41% 60% 61% and over 
PE an p Semt A one Stimulus—! 

1. Sickness—health (38%) Soft—hard (44%) Table—chair (84: ) 
2. Deep—shallow (32%) Chair—table (49%) Dark—Light (83 
3. Eating—food (39%) Sweet—sour (43%) Man—woman (77%) 
4. Mountain—hills (27%) Ror mooth (44%) Black—white (75%) 
5. House—home (25%) Aish ieta (45%) oman—man (65%) 
6. Fruit—apple (38%) eedle—t 46%) White—black (62%) 
7. Smooth—rough (33%) Carpet—rug (46%) Slow—fast (75%) 
8. Command—order (20%) Sour—sweet (57%) irl—boy (oe 
9. Cold—hot (35%; Eagle—bird (55%) High—low (68% 
10. Sleep—bed (24% Stem—flower (40%) Hard—soft (67%) 
11. mad (35%) Dream—sleep (45%) Lamp—light (63%) 
12. Bath—clean (31%) Hammer—nails (54%) Bread—butter (61%) 
13. Cottage—house (30%) Loud—soft (54! ) Boy—girl (77%; 
14. Swift—fast (37%) Bed—sleep (58%, Light—dark ( va 
15. Hungry—food (36%) Heavy—light (58%) rt (16% 
16. Priest—chi 33%) Tobacco—smoke (56%; Butter—b: (64%) 
17. Square—round (37% Salt—pepper (43%) Scissors—cut (67 
18. or—nurse (24%) King—queen (75%; 
19. Thief—steal (29%) Blossom—flower (67%) 
20. Lion—tiger Colo 
21. Quiet—loud (35%) 


"The percentages indicate the frequency with which the dominant response to each stimulus was elicited in the Russell 4 


Jenkins (1954) standardization. 


The number of common associations given 
during nonstress is almost identical in the two 
orders of presentation, as can be seen in Figure 1. 
While there appears to be a tendency for stress to 
produce somewhat less of a decrement when the 
stress condition precedes the nonstress, this 
difference does not approach the .05 level, sug- 
gesting that order of presentation may not be a 
critical variable. 

Association level, on the other hand, was a 
significant variable here in determining the drop 
in frequency of common responses as a function 
of stress (F significant at beyond the .05 level). 
From Figure 1 it is apparent that the smallest 
decrements occur to the low association value 
items. For the nonstress-stress order of presenta- 
tion, there is little difference between high associ- 
ation value items and middle association items. 
For high association value items, the drop was 
10.8 percentage points. On the other hand, the 
stress-nonstress order does show some tendency 
for a greater drop in the middle association value 
items (7.0 percentage points drop) than by the 
high association value items (3.7 percentage 
points drop). Since neither the latter difference 
nor the difference between orders is significant, 
the safest induction for the present is that there 
is no real difference between high and medium 


association values in the decrement produc 
by stress. 

The writer analyzed the associations to d 
termine if stress tended to produce a tendency t 
give responses which were outside of the total li 
of responses reported by Russell and Jenki 
(1954). No such tendency was observable. Un de 
both stress and nonstress conditions, failure ti 
give the common response usually resulted | 
giving some other word in the Russell and Jenkin 
list for the particular stimulus. " 


Discussion 


The results of the present study are consisten 
with Buchwald’s (1959) findings with 
(high and low scores on the Taylor scale). 
suggest that stress and anxiety reduce the str 
of common responses in free associations, con 
to what is anticipated on the basis of a drive the 
of stress such as proposed by Castaneda a 
Palermo (1955). It might be pointed out that 
their studies of the effect of stress on discri 
tion learning (Castaneda, 1956; Castaneda 
Palermo, 1955; Palermo, 1957), these wri er’ 
found no significant effects upon well-learné 
discriminations as a result of stress. This may be 
consequence of the type of stress used in 
studies: Castaneda and Palermo attempted ti 
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70 


60 


50 


40 


30 


4—A STRESS NONSTRESS 
A-A NONSTRESSJS FIRST 


O—o STRESS STRESS 
@---@ NONSTRESS{ FIRST 


MEAN PERCENTAGE OF COMMON RESPONSES 


Low MEDIUM HIGH 
ASSOCIATION LEVEL 


Fic. 1. Mean percentages of common responses in 
free associations as a function of stress, order of 
Presentation, and strength of association. 


induce stress by telling their subjects to work 
Ne or by excessive pacing of the learning 
s. 

Similarly the paucity of novel responses found 
among the free associations during stress suggests 
that the type of explanation proposed by Child 
(1954) to account for anxiety scale results is not 
applicable here. Child suggests that anxiety has 

„unique stimulus characteristics which produce 
responses which interfere with the responses to 
the stimulus items presented by the experimenter. 
Other methods of producing stress (e.g., electric 
shock) may have more characteristic response 
associates. The present study argues that such 
Tesponse associates are not essential for producing 
decrements in the common responses. 

A simple response competition theory (modeled 
after the theory proposed by Child to account for 
anxiety scale results) would lead to the prediction 
that stress has a linear effect on the reduction of 
common response. This is true because the amount 
of stress induced response competition is constant, 
no matter what the strength of the word associ- 
ations involved at any given moment. 

Farber (1954) has suggested a two-factor 
Position, containing both the drive notion and 
Child’s notion concerning the stimulus character- 
Istics of stress, As a stimulus, stress produces 
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characteristic stress responses which may compete 
with the common free associations. As a drive, 
stress multiplies the strength of both the common 
free associations and their competing stress 
responses. Consequently, the drive characteristic 
of stress should strengthen the strong common 
free associations disproportionately more than the 
weak common associations. Thus, the strong 
common associations should suffer less of a decre- 
ment than the weak common associations. Just 
the opposite occurred. 

The results of the interaction between stress 
and association strength of the common responses 
suggests that dedifferentiation is a curvilinear 
function of boundary strength. This notion seems 
best to explain the phenomena under consideration. 


SUMMARY 


The purpose of the present paper was to in- 
vestigate the influence of stress on the common 
responses in free associations. On the basis of a 
dedifferentiation theory, it was predicted that 
stress would reduce the frequency of common 
responses. On the basis of a drive theory, one 
might anticipate an increase in frequency of 
common responses. 

Stress was operationally defined as a situation 
in which subjects had just taken a difficult course 
examination and were waiting to be told their 


les. 
Results indicated highly significant decrements 
in common responses as a function of stress, 
There was no tendency toward “novel” responses 
(responses not given to a stimulus in the Russell 
and Jenkins standardization). The theoretical 
implications of these observations were discussed. 
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THE EFFECTS OF ANXIETY AND THREAT ON THE SOLUTION OF A 
DIFFICULT TASK! 


IRWIN G. SARASON? 
University of Washington 


“HIS experiment was concerned with the 

performance of subjects differing in anxiety 
scale scores on a concept formation task known 
to be a very difficult one. The subjects differed 
not only with respect to anxiety scores but also 
with respect to the instructions administered to 
them prior to performance. Underlying the re- 
search was the hypothesis that instructions and 
degree of anxiety interact in affecting performance. 
In view of previous research, it seemed that an 
adequate test of this hypothesis could best be 
made using a task high in difficulty level (Nichol- 
son, 1958; Sarason & Palola, 1960). 

There are at least two important aspects of a 
difficult or complex task. Obviously, a complex 
task is hard, and successful performance on it 
Tequires intellectual ability commensurate with 
the level of the problem. But there is another 
aspect of a highly complex task—its potential for 
arousing emotional reactions in some subjects. 
Several studies using anxiety scales have reported 
Poorer performance for high anxious than low 
anxious subjects on complex or difficult tasks 
(Montague, 1953; Ramond, 1953; Taylor & 
Spence, 1952). Other investigators, however, 
have not found significant differences due to 
anxiety on tasks which appear to be relatively 
high in difficulty level (Axelrod, Cowen, & Heilizer, 
1956; Saltz & Hoehn, 1957). It would appear that 
the usefulness of present conceptions of anxiety in 
Predicting subjects’ reactions to challenging, 
difficult tests of intellectual ability is as yet quite 
limited. 

į Better assessment of individuals’ reactions to a 
dificult intellectual problem may be developed if, 
ìn addition to the difficulty and anxiety variables, 
the factor of the conditions under which per- 
formance takes place is better understood. One 
Important aspect of these conditions is the nature 
of the instructions administered to subjects im- 
mediately prior to performance. In the present 
Study, prior to subjects’ attempts at solving 13 
very difficult anagrams, half the subjects were 
told that the anagrams could be easily solved by 
individuals of average intelligence. These in- 


‘This research was supported by a grant from the 
National Institute of Mental Health (M2397). 

*The writer is indebted to William R. Crow and 
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this study. 


structions were called the personal threat condition 
because all subjects quickly discovered that they 
could not solve all of the anagrams. Under this 
condition, it was predicted that low anxious 
subjects would be superior to high anxious sub- 
jects. The prediction is based on the assumption 
that high anxious subjects respond to a condition 
of personal threat with more personalized anxiety 
responses than do low anxious subjects. 

The remaining subjects were told, prior to 
performance, that the anagrams were actually 
quite difficult and that successful solution of all of 
them in the allotted time was highly unlikely, It 
was predicted that under this condition low anxious 
subjects would not perform at a significantly 
higher level than would high anxious subjects. 
This was expected on the assumption that this 
set of instructions would reassure high anxious 
subjects and reduce their anxiety over poor 
performance. 

An additional aspect of the present study 
concerns the comparison of three anxiety scales 
with respect to their usefulness in assessing sub- 
jects’ potential for emission of interfering re- 
sponses under the personal threat condition. One 
of the scales, the Test Anxiety Scale (Sarason, 
1958b), dealing specifically with subjects’ reactions 
to testing situations, was expected to interact 
more significantly with instructions than would 
two indices of general anxiety employed. 


METHOD 


Subjects 

The subjects were 191 students in introductory 
psychology courses at the University of Washington. 
Independent of and prior to participation in the experi- 
ment proper, these subjects were administered the 
Test Anxiety Scale (TAS) and Lack of Protection 
Scale (LP) of the Autobiographical Survey (Sarason, 
1958a, 1958b). Ninety-one of the subjects were also 
administered the Bendig short form of the Taylor 
Manifest Anxiety Scale (MAS) (Bendig, 1956). Since 
the personality test data and the performance scores 
of the subjects were obtained independently of each 
other, certain of the subjects had to be randomly 
omitted from each of the statistical analyses in order 
to meet the analysis of variance requirement of pro- 

tionality of cell frequencies. ‘ j 
A TAS, the subjects were divided into high, 
middle, and low anxious groups. The extreme groups 
corresponded approximately to the upper and lower 
30% of the TAS score distribution. The middle TAS 
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TABLE 1 
THIRTEEN ANAGRAMS ADMINISTERED TO ALL 
SUBJECTS 
1. ETLHHA 8. SUTCBII 
2. ETROS 9. SCLIAO 
3. RECMI 10. EVSUORN 
4. CNEGAH 11. RSANEO 
5. NMGOINR 12. IMTCELA 
6. NSRWAE 13, ELSAUX 
7. SPRUUE 


group corresponded to the middle 40% of the dis- 
tribution. High TAS subjects had scores of 6-16; 
middle TAS subjects had scores of 3-5; and low TAS 
subjects had scores of 0-2. 

On LP the subjects were also divided into three 
groups corresponding to the upper and lower 30% of 
the distribution and the middle 40%. High LP scores 
ranged between 9-20; middle LP scores ranged between 
4-8; and low LP scores ranged between 0-3. 

Because of the smaller MAS N, the subjects were 
divided at the median on this scale. High MAS subjects 
had a range of 6-17, and low MAS had a range of 0-5. 


Procedure 


The anagrams used were selected from Sargent’s 
list of those of high difficulty (Sargent, 1940). Table 1 
gives the anagrams employed. 

All subjects were tested in groups ranging in size 
from 35-50. Each subject was given a booklet con- 
sisting of two pages with the instructions on the first 
page and the list of anagrams to be solved on the 
second. The time limit for all subjects was 18 minutes. 

The instructions page consisted of two paragraphs. 
For all subjects the first paragraph provided the 
following information: 

On the next page you will see a series of dis- 
arranged words. Your job will be to rearrange each 
group of letters so that they make a meaningful 
English word. Start when you are so instructed. 
Stop at the stop signal. Write your name at the top 
of the next page when given the signal. 

The subjects under the personal threat condition 
were given booklets which had the following second 
paragraph on the first page: 

Ability to organize material such as the letters 
on the next page has been found to be directly 
related to intelligence level. High school students 
of above average intelligence (IQ greater than 100) 
and most college students should be able successfully 
to complete the task. You will have 18 minutes in 
which to complete it. 

The subjects under the nonthreat conditions were 
given booklets which had the following second para- 
graph on the first page: 

Most of you probably have worked anagrams, 
The task on the next page works the same way. 
These anagrams, however, are harder than most 
you have seen in books and magazines. Conse- 
quently, you may not finish all of them and you 
may find some of the anagrams very difficult. If 
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this happens, don’t worry about it. No one will 
find the anagrams easy. 
The seating arrangement of the testing room was 


such that none of the subjects could have known that 


there were two forms of the instructions. 
RESULTS 


A 2 X 3 analysis of variance was performed on 
the number of correct anagram solutions with 
three levels of TAS (high, middle, and low) and 
two levels of instruction (threat and nonthreat), 
Only one F was statistically significant, that for 
the interaction between test anxiety and instruc- 
tion (p < .01). Under the threat condition, the 
middle and low TAS subjects performed at a 
higher level than did the high TAS subjects, 
However, just the opposite was the case under 
the nonthreat condition, where the high TAS 
subjects were superior to middle and low subjects, 
Table 2 presents the means and standard devi- 
ations for the six experimental groups. 

A similar 2 X 3 analysis for LP also revealed 
only one significant effect, that for the anxiety by 
instructions interaction (p < .025). The nature of 
this interaction was identical to the one obtained 
when subjects were divided on the basis of TAS. 
Table 3 presents the means and standard devi- 
ations for the LP analysis. 

A 2 X 2 analysis of variance carried out on the 
MAS data failed to reveal any significant effects, 


TABLE 2 
MEANS AND STANDARD DEVIATIONS FOR NUMBER 
or Correct SOLUTIONS For SIX EXPERIMENTAL 
Groups DIVDED on THE Basis or TEST 
Anxtety Scare (TAS) 
AND INSTRUCTIONS 


Threat Instructions |Nonthreat Instructions 
TAS Mu SD N M SD N 
High 3.54 | 2.13 | 26 | 5.31 | 3.22 | 26 
Middle 4.73 | 2.64 | 37 | 3.49 | 2.15 | 37 
Low 4.92 | 2.61 | 25 | 4.04 | 3.17 | 25 
TABLE 3 


MEANS AND STANDARD DEVIATIONS FOR NUMBER OF 
Correct SOLUTIONS FOR Six EXPERIMENTAL 
Groups DIVIDED on THE Basis OF LACK OF 
PROTECTION SCALE (LP) AND INSTRUCTIONS 


Threat Instructions Nonthreat Instructions 
LP M SD N M SD N 
High 3.93 | 2.33 | 29 | 5.38 | 2.88 | 29 
Middle 4.54 | 2.70 | 37 | 3.92 | 2.61 | 37 
Low 4.90 | 2.50 | 21 | 3.48 | 2.40 | 21 
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although the anxiety by instructions interaction 
was in the same direction as the significant inter- 
actions reported above. 


Discussion 


The empirical findings of the present research 
are quite clear-cut. For two of the three anxiety 
scales used, it was found that, according to the 
predictions made, threat and nonthreat conditions 
differentially affected high, middle, and low 
anxious subjects. Under the threat conditions, the 
performance of high anxious subjects was on a 
significantly lower level than the performance of 
subjects in the rest of the anxiety score distribu- 
tion. Just the opposite result emerged under the 
nonthreat condition. Although the anagrams task 
was clearly a very difficult one and, judging from 
informal comments of subjects after the experi- 
ment, a frustrating one also, it would seem that 
the nonthreat instructions may have functioned 
as a reassurance to high anxious subjects. 

Studies by Sarason and Palola (1960) and 
Nicholson (1958) have provided evidence suggest- 
ing that the optimal conditions for arousal of 
interfering responses in high anxious subjects are a 
combination of high difficulty level and personal 
threat, Although the difficulty variable was not 
manipulated in the present experiment, the results 
are consistent with this generalization. The 
finding here of a superiority of high to middle 
and low anxious subjects under the nonthreat 
condition seems to be a potentially important one. 
It is possible that a condition which provides 
reassurance and consequently better performance 
for a high anxious individual may actually serve 
to lower the motivational level and performance 
of less anxious individuals (Sarason, 1958a). 

If these inferences are extended to the general 
problem of intellectual evaluation, it may well be 
that when intelligence or aptitude testing is 
conducted under conditions of personal threat 
(as perceived by the subject), high anxious sub- 
Jects’ ability will be systematically underestimated 
due to their emission of interfering anxiety re- 
Sponses. This possibility would seem particularly 
likely in the case of college entrance examinations, 
where the student is well aware of the evaluative 
and selective purposes of testing. Certainly, the 
Present study, together with previous research, 
Suggests the need for more intensive study of the 
tole of individual difference variables such as 
on in intellectual performance (Sarason, 1957, 

It had been expected on the basis of previous 
Tesearch that the test anxiety measure would 
More significantly interact with instructions than 
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would the LP and MAS measures (Sarason, 
1958a, 1959b, 1959c). However, significant inter- 
actions were found for both TAS and LP. It would 
seem that there are serious shortcomings in present 
knowledge of (a) the conditions under which only 
the specific test anxiety index relates to perform- 
ance, and (b) the conditions under which both 
general and specific indices of anxiety relate to 
performance. 


SUMMARY 


‘Two variables were manipulated in this research, 
anxiety and experimental instructions admin- 
istered prior to the subjects’ performance on a 
series of anagrams high in difficulty level. Three 
measures of anxiety were used: the Test Anxiety 
Scale (TAS), the Lack of Protection Scale (LP), 
and Bendig’s version of the Manifest Anxiety 
Scale (MAS). The two sets of instructions differed 
in that one set was designed to pose a personal 
threat to the subject and the other set was de- 
signed to reassure the subject. 

The results showed that TAS and LP, but not 
MAS, significantly interacted with the instruc- 
tions. High anxious subjects under the threat 
condition performed at a lower level than did low 
and middle anxious subjects. Just the reverse was 
true under the nonthreat conditions, where high 
anxious subjects were superior to low and middle 
anxious subjects, 

The results were interpreted in terms of the 
interfering responses aroused in high anxious 
individuals under personal threat conditions. 
The problem of the specificity of anxiety scales 
was also discussed. 
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EGO STRENGTH AND PERCEIVED COMPETENCE AS CONFORMITY VARIABLES! 


KAY H. SMITH? 
Wayne State University 


RESSURE toward conformity is generated 

whenever a person, as a member of a small 
group, finds his perceptions or judgments at 
variance with a majority of the other members of 
the group. The amount of yielding to such pressure 
logically should vary with the value placed by the 
deviate on the ability of the other group members 
to judge correctly in the situation relative to his 
perception of his own ability to do so. The two 
studies available in the literature on this point 
used ambiguous material. Hochbaum (1954) used 
as a task the prediction of the outcome of cases 
from case materials. Kelman (1950) found similar 
results using the autokinetic phenomenon. The 
present study is an attempt to validate these 
findings employing a situation similar to that 
used by Asch (1952), in which it is rather obvious 
to the deviate that the other group members are 
giving erroneous judgments. 

Resistance by the deviate to such majority 
pressure would seem to be dependent upon his 
capacity to assimilate discrepancies in informa- 
tional input without being threatened. A general 
name given to this attribute is “ego strength.” 
Barron (1953, 1954) proposes his Es scale as a 
measure of this variable. Daniels (1957) found 
that an index using the Es scale and the Self- 
Contextual-Influence test was related to suscep- 
tibility to persuasive communication. Crutchfield 
(1955), using an Asch-type situation (1952), 
found a correlation of —.33 between conformity 
and Es scores, 

The hypotheses to be tested are: 

Hypothesis I. Persons who place higher value 
on their own ability to judge yield less under 
Pressure to conformity than those who place a 
lower value on their own ability relative to that 
of the other group members. 

Hypothesis II. Persons who have high ego 
strength yield less under pressure to conformity 
than persons with low ego strength. 


MeEtHoD 


Es scale. This scale consists of 68 items taken from 
the MMPI. The specific items and the method of 


1The data reported here are taken from a thesis 
submitted by the author in partial fulfillment of the 
requirements for the Master of Arts degree at Brigham 
Young University. 

* Acknowledgment is made to Kenneth Hardy whose 
advice and guidance on this project were greatly 
appreciated. Gratitude is also expressed to Ross 
Stagner for his helpful suggestions in the preparation 
of this manuscript. 


scoring are found in Barron (1953, 1954). The scale 
was administered without any filler items. The items 
were arranged in the order in which they appear in 
the long form of the MMPI. 

Judgment apparatus. Judgments were made on the 
standard Galton bar, with the addition of a sliding 
clamp fitted with a thumb screw that could be tightened 
or loosened. The clamp was so arranged that it was 
hidden from the sight of anyone seated in front of the 
apparatus, and enabled the confederates in the experi- 
ment to make accurately any judgment desired by the 
experimenter. Each time a confederate made his 
judgment he would simply adjust the bar until its 
movement was arrested by the clamp. The clamp was 
loosened when the critical subject was adjusting the 
bar to allow freedom of movement in any direction. 
The side of the bar used as the standard of judgment 
was set at 30 centimeters. 

Subjects. The subjects were 48 students from three 
introductory psychology classes at Brigham Young 
University. The Es scale was administered in class, 
the total number of students tested being 253. 

Since Daniels (1957) had reported a sex difference in 
ego strength scores, the means and standard deviations 
of Es scores were computed separately for males and 
females, For the males the mean was 46.90 and the SD 
4.55, and for the females the mean was 43.25 and the 
SD 3.68, the means being significantly different at well 
beyond the .01 level. To take into account this sex 
difference 12 males and 12 females were counted off 
from each extreme of the distribution. These subjects 
comprised the sample for the remainder of the experi- 
ment. 

Ten male and 12 female Brigham Young University 
students served as confederates. With the exception of 
2 male graduate students all were undergraduates, 

Procedure. Each of the selected subjects was taken 
individually into a room containing the Galton bar and 
given three trials, each beginning at zero, at adjusting 
the apparatus so that the sides appeared equal, This 
yielded the individual average judgment. 

At this point in the experiment the subjects were 
assigned at random to the three experimental groups, 
four high males, four low males, four high females, 
and four low females to each group, according to the 
original ego strength score dichotomy. The procedure 
iod for these different experi- 


in all cases along with one subject—respectively, either 
male or female. These three people were introduced to 
each other as subjects and by name, but beyond this, 
verbal interaction was kept to a minimum. Each group 
was told that the group had been assembled to assess 
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the difference between individual judgments and 
judgments arrived at in cooperation with others. Next 
the three group members were given percentile scores 
supposedly indicating how they compared in accuracy 
with all the people participating in the experiment on 
the original judgment. For Group A (high self-valua- 
tion) the subject was always given a percentile score of 
95, the male confederate one of 73, and the female 
confederate one of 54. For Group B (low self-valuation) 
the subject was always given the percentile score of 54, 
the female confederate the score of 73, and the male 
confederate the score of 95. For Group C (no differ- 
ential in self-valuation) all three were given the per- 
centile score of 73. These percentile rankings were 
given orally before the group and the meaning of the 
score was explained. 

The procedure for the group judgment was then 
explained. The first person was to begin from zero and 
adjust the bar until he thought the sides were equal, 
then the second would make whatever adjustments he 
felt necessary from this point until he was satisfied with 
the judgment, then the third would do likewise. This 
final adjustment, they were told, would be taken as 
the group measure. This procedure was to be followed 
three times with a different order each time so that 
each would have an opportunity to make the final 
adjustment. The three series followed this order: 
male confederate, subject, female confederate; subject, 
female confederate, male confederate; female con- 
federate, male confederate, subject. The final measure- 
ment of the subject in Series 3 was taken as the critical 
group measure, since at this point the greatest cumu- 
lative effect of the discrepancy should have been 
achieved between his perception of equality and that 
demonstrated by the other group members. The 
difference between this measure and the subject’s 
individual mean measure was taken as the conformity 
score. 

After the session was completed the subject was 
informed of the purposes of the experiment and of the 
deception. He was also asked if at any time he had 
suspicions as to the authenticity of the instructions 
given, or of any aspect of the situation. In all cases 
the deception seemed to be complete. 


RESULTS 


The two hypotheses were tested by means of an 
analysis of variance. The summary table is given 
in Table 1, which indicates that the first hypothesis 
may be accepted as tenable. The second, however, 
cannot be accepted on the basis of these results. 

To test the directionality of difference on the 
ranking variable, ¢ tests were run between the 
three experimental groups. The differences be- 
tween means of the three experimental groups 
were all in the predicted direction. The difference 
between the high (A) and low (B) ranked groups 
(p < .01) and the equal (C) and low (B) ranked 
groups (p < .05) were significant. The difference 
between the high (A) and equal (C) ranked groups 
approached but did not quite reach the level of 
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TABLE 1 
ANALYSIS OF VARIANCE FOR CONFORMITY SCORES 


Source of Variation SS | df | MS R p3 
Ego strength -73| 1| .73 | 1.53 |>.20 
Experimental 
Groups (ranking) | 14.60] 2] 7.30 | 6.50 |<.01 
Interaction 1.47] 2] .73 | 1.53 |>.20 
Within cells 47.04 | 42 | 1.12 

Total 63.84 | 47 


significance (p > .05). Upon this evidence it 
seems logical to consider the directionality of the 
relative ability value hypothesis supported. 


Discussion 


The results obtained here on the relationship 
of Es scores and yielding to group pressure seem 
to be in opposition to the findings of Daniels 
(1957) and Crutchfield (1955). With the Daniels 
work the discrepancy might possibly be due to 
the use of another measure in addition to the Es 
scale. The major difference between the present 
study and that of Crutchfield seems to be in the 
sample used. Since Es scores are correlated with 
intelligence, and since Crutchfield used Air Force 
officers, there is a possibility that his sample was 
biased toward the high end of the scale. The 
present sample included groups from each extreme. 
A possible consequence of such a difference in 
sample comes from the work of Janis (1954). 
Janis reported that while those of his subjects who 
evidenced the most change seemed to be passive 
and dependent, those who resisted change deviated 
from the norm in the direction of neurosis, com- 
pulsion, and hypochondriasis. While those scoring 
low on the Es scale have been described as passive 
and dependent, the scale also contains items 
indicating possible neurosis, compulsion, and 
hypochondriasis. Perhaps indeed Es scores bear a 
curvilinear relationship with susceptibility to 
pressure toward change of judgment. 

The results of this study support fully the 
conclusions reached by Hochbaum (1954) and 
Kelman (1950) regarding the way in which per 
ceived competence is related to susceptibility to 
pressure toward conformity. Their conclusions 
can now be extended to instances in which pressure 
favors a judgment that is in direct opposition to 
sensory cues, 


SUMMARY 
The problem investigated in this research 
concerned the relationship of two personal vari 
ables to susceptibility to group pressure to change: 
The personal variables used were ego strength, aS 


at 
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measured by the Barron Es scale, and perceived 
competence for the task. Subjects were 12 males, 
and 12 females selected from each extreme of the 
distribution of Es scores. These subjects were 
given three trials each on the Galton bar to es- 
tablish an individual mean. They were then 
assigned at random to groups and given instruction 
to establish expectancies of high, low, or average 
competence on the task. In a group situation 
confederates gave judgments four centimeters 
longer than the subject’s previous mean. The 
difference between the subject’s final measure in 
this group situation and his individual mean was 
his conformity score. 

It was concluded that: 

1. Subjects selected from the extremes of a 
distribution of Es scale scores do not differ in their 
susceptibility to group pressure to change. 

2. The degree of susceptibility to group pressure 
to change is related to the subject’s own perceived 
competence for the task involved. 
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THE “YEASAYER” AND SOCIAL DESIRABILITY: 
A COMMENT ON THE COUCH AND KENISTON PAPER 


JAMES B. TAYLOR 
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recent paper in this journal investigates 

the influences of an “agreeing response set” 
on a number of scales, and explores the relationship 
of this set to other personality factors. A wealth of 
data is presented, all of it seeming to demonstrate 
“both the far-reaching importance of response set 
in the area of psychological tests and the major 
proposition that the agreeing response tendency is 
based on a central personality syndrome” (Couch 
& Keniston, 1960, p. 173). 

There is a possibility, however, that these 
many findings can also be explained as reflections 
of the social desirability variable. Couch and 
Keniston’s results may be viewed as once again 
demonstrating the importance of social desirability 
in personality scaling, and as giving further 
information on the “social desirability set”—i.e., 
the set to endorse socially undesirable statements. 

It will be recalled that Couch and Keniston 
devised a scale to measure agreeing response set. 
The scale consisted of 360 items drawn from a 
variety of personality questionnaires. Its items 
were chosen so as to minimize the influence of 
specific content. Subjects rated each item on a 
seven-point scale and were assigned an Over-all 
Agreement Score (OAS). The OAS had an ade- 
quate split-half reliability (85) and was fairly 
consistent over 1 or 2 weeks’ time. 

Now it seems possible that the OAS does not 
measure an agreeing response set per se, but rather 
measures the tendency to endorse socially unde- 
sirable items. The OAS is based on statements 
drawn from a number of Likert-type scales. 
Previous research has demonstrated that the 
majority of such statements can be located on a 
social desirability continuum. Further, there is a 
strong tendency for personality scales to be loaded 
with socially undesirable items. There is also 


considerable evidence to show that people respond 
differentially to items on the basis of social desir- 
ability: the more socially desirable an item is the 
more likely it is to be endorsed. 

Willingness to endorse socially undesirable items 
seems to be a general response trait that is mani- 
fested irrespective of the obvious content of the 
item (Edwards, 1957). By choosing a heterogeneous 
group of items, there is a good chance that Couch 
and Keniston developed an Over-all Agreement 
Scale largely made up of socially undesirable items. 
If this happened, the OAS should be regarded as 
another measure of the social desirability set—a 
measure similar to the SD scales reported in 
several other papers (Edwards, 1953; Fordyce, 
1956). 

The issue seems worth raising, not only for this 
study but also for other studies that ostensibly 
measure the agreeing response set. If such studies 
are measuring the social desirability set as well as 
(or instead of) the agreeing response set, then it 
may be that their conclusions need re-evaluation. 
Perhaps future studies should make an effort to 
differentiate between the two response tendencies. 
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ALLEN L. EDWARDS 


[Pp warns (1953) has shown that there is a high 
AČ positive correlation between the social 
bility scale value of a personality item 
the probability of endorsement of the item. 
is finding has been confirmed in a number 


1959; Hanley, 1956; 
1956; Taylor, 1959; 
& Rumrill, 1959). 


er consideration of the relationship 
the social desirability scale value and the 
ility of endorsement of a personality item 
Tesulted in the concept of a socially desirable 

to a personality item. A socially desirable 

is defined as a True response to a person- 
“ality item with a socially desirable scale value or a 
response to a personality item with a socially 
Undesirable scale value (Edwards, 1957b). A 39- 
“item Social Desirability scale (SD) has been de- 
a measure the tendency to give socially 
‘scribed 


s, 1955, 1957a, 
1958; Kenny, 


sirable responses to personality items and is de- 
by Edwards (1957b). Scores on the SD have 
been found to correlate substantially with scores on 
many other personality scales of the True-False 
kind (Edwards, 1957b; Edwards, Heathers, & For- 
is dyce, 1960). In particular, it has been found that if 
| 4 scale contains a large percentage of items keyed 
-lor Socially desirable responses, then scores on this 
will correlate positively with scores on the 
an regardless of the True-False keying of the 
Mems in the scale, Similarly, it has been found 
that if a scale contains a small percentage of 
items keyed for socially desirable responses, then 
“Scores on this scale will correlate negatively with 
ret on the SD, again, regardless of the True- 
raise keying of the items in the scale. 
„In a recent article, Couch and Keniston (1960) 
Bepitined Over-all Agreement Scores (OAS) on 
i items. They describe the OAS as measuring 
oy of subjects to agree or respond 
to items regardless of the content of the 
tems. Part of the evidence reported by Couch and 
ce ow with respect to the construct validity 
OAS OAS consisted of the correlations of the 
With 32 Minnesota Multiphasic Personality 
a” (MMPI) scales. They found that MMPI 
which had a large percentage of items keyed 
“are tended to correlate positively with the OAS 
tems po IPI scales with a large percentage of 
_ “tS keyed False tended to correlate negatively 
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with the OAS. They interpret these findings as 
“... indicating that scales with an unusually 
high (or low) percentage of items scored True are 
significantly influenced by the agreeing response 
set” (p. 154) which they regard the OAS as pri- 
marily measuring. 

An alternative hypothesis, which Couch and 
Keniston did not consider, is that scores on both 
the OAS and on the MMPI scales may also be 
influenced by the tendency of subjects to give 
socially undesirable responses.! The present study 
is concerned with the evidence concerning this 
alternative hypothesis. 


Merxop 


The percentage of items keyed for socially 
undesirable responses, based upon the social 
desirability scale values of the MMPI items 
obtained by Heineman (1953) and reproduced in 
Dahlstrom and Welsh (1960) was obtained for 31 
of the 32 MMPI scales investigated by Couch 
and Keniston.? The product-moment correlation 
between the percentage of keyed socially undesir- 
able responses in each scale with the correlation 
of the scale with the OAS was found to be 77. 
The corresponding product-moment correlation 
between the percentage of keyed True responses 


1In view of the Perna of a socially eae 
response, it should be obvious t a socially undesirable 
reponse is defined as a False response to a personality 
item with a socially desirable scale value or a True 
response to a personality item with a socially undesir- 
able scale value. 

2 Scoring keys for many of the 32 MMPI scales are 
given by Dahlstrom and Welsh (1960). Harrison 
Gough kindly made available to us the scoring keys for 
the scales not given in Dahlstrom and Welsh. We used 
only 31 of the 32 scales because for one scale, Social 
Presence (X1), 


items reported as 
and the scoring keys 


seriously influence any 


no way P or in the present paper. 


by Couch and K 
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in each scale and the correlation of the scale with 
the OAS was .85. 

For the 31 scales the correlation between the 
percentage of keyed True items and the percentage 
of keyed socially undesirable items is .55. The 
partial correlation between the percentage of 
socially undesirable items and the OAS correla- 
tions, with the percentage of True items held 
constant, is .69. The corresponding partial correla- 
tion between the percentage of True responses and 
the OAS correlations, with the percentage of 
socially undesirable items held constant, is .80. 


Discussion 


On the basis of the results reported in the present 
study, the OAS correlations with the MMPI 
scales are consistent with both the hypothesis 
that scores on the OAS (and MMPI scales) are 
influenced by acquiescent tendencies and the 
hypothesis that they are influenced by social 
desirability tendencies. There is no compelling 
evidence in favor of either hypothesis. 

It is unfortunate that the possible confounding 
of the acquiescence and social desirability sets 
was not given consideration in the development 
of the OAS. An acquiescence scale independent of 
the tendency to give socially desirable (or socially 
undesirable) responses would be of obvious value 
in personality research. 
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‘AYLOR (1961) and Edwards and Walker 

(1961) have suggested that our previous 
study of “Yeasayers and Naysayers” (Couch & 
Keniston, 1960) should be re-examined for mani- 
festations of the social desirability variable 
(Edwards, 1957), This note reports further 
analyses of our data to this end. 

Edwards (1957) has defined the social desir- 
ability variable as the tendency of subjects to 
answer “Yes,” “True,” or “Agree” to statements 
having consensually-defined social desirability, 
and to answer “No,” “False,” or “Disagree” to 
statements having consensually-defined social 
undesirability. Two different operational ap- 
proaches have been used to determine the degree 
of social desirability of personality test items: 
(a) a panel of judges rates each item in terms of 
this dimension, then these ratings are scaled by 
appropriate methods to arrive at the social desir- 
ability value of the statement; (b) the probability 
of endorsement of the item by a population of 
subjects can serve as an inductive indicator of 
the same variable: high frequency of endorsement 
defines a socially desirable item, while low fre- 
quency of endorsement defines a socially undesir- 
able item. The high relationship between these two 
methods of assessment has been amply demon- 
strated by Edwards (1957) and others. 

Edwards attempted to develop a short measure 
of this general response tendency by having a 
Panel of 10 judges evaluate a set of MMPI items 
from the F, L, K, and A scales (Edwards, 1957, 
Pp. 29-30) for the social desirability of a “True” 
or “False” answer. The 79 items with perfect 
agreement as to social desirability were subjected 
to further item analysis. The subsequent selection 
Tesulted in a 39-item scale that Edwards has 
termed the Social Desirability scale (SD). 


Relationship between the SD and the OAS 


The SD (Edwards, 1957, p. 32) score was 
computed for the 61 subjects (Harvard sopho- 
mores) in our previously reported study. The 
mean SD score was 29.6 with a standard deviation 
of 6.4. These values are quite similar to those 
Teported by Edwards (1957). 

In our previous article (Couch & Keniston, 
1960, pp. 152-153) we described our procedures 
for assessing the agreeing response tendency 
from the sum of Likert-style responses on an 
Agree to Disagree continuum to 360 personality 
test items that had been carefully selected for 


diversity of content. This Over-all Agreement 
Score (OAS) provided our basic measure of the 
agreeing response set. 

Taylor (1961) and Edwards and Walker (1961) 
both suggest that our OAS measure may be 
highly negatively related to social desirability. 
They hypothesize that subjects high on the OAS 
(“yeasayers”) are endorsing socially undesirable 
items, while those low on the OAS are endorsing 
socially desirable items (and rejecting the unde- 
sirable ones). 

In order to test this hypothesis, the correlation 
between the OAS and the SD was computed for 
our population, A coefficient of r = — .34 resulted. 
While this correlation is significant at the .01 
level in the predicted direction, there are three 
points to be considered in interpreting the finding. 
First, the level of the correlation with the SD 
is relatively low in comparison with the level 
obtained with other personality variables that are 
highly related to the OAS; for example, the Anal 
Preoccupation scale (z = .75), the Excitability 
scale (r = .57), and the Conventional Anxiety 
scale (r = .57) (Couch & Keniston, 1960, p. 157). 
Second, the SD is so constructed that approxi- 
mately 77% of the items are keyed “False.” 
Thus, a negative correlation with the OAS would 
be expected on the basis of the agreeing response 
tendency alone. Third, it is pertinent to question 
whether the SD actually represents a pure and 
valid measure of the social desirability variable. 
The SD was developed from a rather limited 
array of items on a single test (MMPI) in an 
area of content centered around the admission or 
denial of anxiety and mental disturbance. This 
component of abnormality (which is also socially 
undesirable) is clearly evident in the following 


items of the SD: 


My hands and feet are usually warm enough. (True) 

I am very seldom troubled by constipation. (True) 

Most of the time I would rather sit and daydream 
than to do anything else. (False) 

My sleep is fitful and disturbed. (False) hes: 

I have had periods in which I carried on activities 
without knowing later what I had been doing. 
(False) 

6. Lam not afraid to handle money. (True) 

7. I frequently notice my hand shakes when I try to 

do something. (False) 

8. I sweat very easily even on cool days. one 

9. 


ind on one thing. (False) 
Brera EEEE someone almost 


wr She 


10. I feel anxiety about something or 
all the time. (False) 
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11. I have been afraid of things or people that I knew 
could not hurt me. (False) 

12. I feel hungry almost all the time. (False) 

13. It makes me nervous to have to wait. (False) 

14. I shrink from facing a crisis or difficulty. (False) 

15. I sometimes feel that I am about to go to pieces. 
(False) 


The admission or denial of anxiety symptoms is 
probably, in part, a reflection of the person’s 
actual degree of disturbance vs. normality. Thus, 
it is not clear what generality the SD has as a 
measure of response set, nor is it clear to what 
extent the variable of “Ego-strength vs. Anxiety” 
is permeating the measure and thereby contami- 
nating the scale. 


Development of the High Mean and Low Mean Scales 


In order to investigate further the relationship 
between agreeing response set and social desir- 
ability, alternative measures of social desirability 
were developed from our own assessment data. 
Following Edwards’ demonstration that the 
probability of endorsement of items is highly 
related to their social desirability, we reasoned 
that items with high means in our population 
would reflect sentiments which are the most 
acceptable (or socially desirable) in this cultural 
setting. Conversely, items with very low means 
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should tend to reflect a lack of social desirability, 
and, thus, promote a high frequency of disagree- 
ment. 

From the total set of 360 items in our OAS 
measure, we selected the 35 items with the highest 
means to form a High Mean scale (Mean 5.22; 
SD 0.35), and the 35 items with the lowest means 
to form a Low Mean scale (Mean 2.41; SD 0.44), 
All items in the High Mean scale had means 
greater than 5.0; while the Low Mean scale 
contained items with means less than 3.0. The 
Likert scale was scored: Strongly Agree (7), 
Agree (6), Slightly Agree (5), No Answer (4), 
Slightly Disagree (3), Disagree (2), and Strongly 
Disagree (1). A sample of the items in these 
scales! is presented in Table 1. 


1 The complete set of 360 items in the OAS measure 
has been deposited with the American Documentation 
Institute. We have also included the means and 
standard deviations of each item, as well as an indica- 
tion of the 35 items in the High Mean and Low Mean 
scales. Also, the complete set of centroid and rotated 
factor loadings for all eight factors has been deposited 
with the American Documentation Institute. Order 
Document No. 6694 from ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Congress; 
Washington, D. C., remitting in advance $1.75 for 
microfilm or $2.50 for photocopies. Make checks 
payable to: Chief, Photoduplication Service, Library 
of Congress. 


TABLE 1 
SAMPLE ITEMS FROM THE HiGH MEAN AND Low MEAN SCALES 


Sample Items 


x 
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igh Mean Scale 


ere are days when one awakes from sea eii a care in the world, full of zest 
of ¥ 


and eagerness for whatever lies ahead 
Happiness is one of the primary goals of life. 
A man should be his own harshest judge. 


The world is teeming with opportunities and promises of success for anyone with 


sufficient imagination to ive them. 
Most Beppe you meet are Fiend 


aid. 
T like to think things out ahead of time. 


‘The vast majority of men are truthful and dependable. 


I get along well with le. 
I persist in the face of difficulties. 


Don’t encourage fears and anxieties by dwelling on them. 


Low Mean Scale 


The future looks black as pitch, with little in view to justify a core of hope or faith. 


It is misery to be born, pain to live, grief to die. 


Beneath the polite and smiling surface of man’s nature 
I cannot escape the conviction that fate somehow has 
The prospect is pretty hopeless: it looks as if the nations were heading for their 


doom—one final, fateful, suicidal, global war. 
Shower your friends with gifts; that is the 


of energy in the service of others. 
No one is of great emotional significance to me. 
I couldn’t care less what others think of me. 


The only reason for being “moral” is that you usually get caught if you aren’t, 


lly and obliging, more disposed to aid you than to 


is a bottomless pit of evil. 
it in for me. 


o y way to a full life. 
Tf a man is to fulfill his destiny he can expend no more than a fraction of his supply 


m 


he e aaa b 
bohh h Bho o 
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pet ped pad ped pad O pd p pa pt 
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CRITIQUE AND NOTES 


An inspection of these items lends subjective 
support to the thesis that a high probability of 
endorsement (agreement in this case) indicates a 
high social desirability of the attitude implied by 
the item, for example, “I get along well with 
people.” and “I persist in the face of difficulties.” 
On the other hand, a low probability of endorse- 
ment (disagreement in this case) seems to indicate 
a low social desirability of the item, for example, 
“T couldn’t care less what others think of me.” 
and “The only reason for being ‘moral’ is that you 
usually get caught if you aren’t.” 

A high score on the High Mean scale should 
measure the positive side of the social desirability 
tendency, while a high score on the Low Mean 
scale should measure the opposite tendency to 
endorse socially undesirable items. The nature of 
these two measures of social desirability permits a 
more direct test of the relationship of this ten- 
dency to the agreeing response set. Both scales are 
susceptible to the agreeing response tendency, but 
differ as to whether this effect is compounded with 
a positive or negative measure of social desirability. 
On either the High Mean or the Low Mean 
scales, a high score should reflect a similar positive 
contribution of the agreement tendency; however, 
the scales should involve opposite contributions 
from the social desirability tendency. Thus, both 
scales should have positive correlations to the 
OAS in so far as agreeing response set permeates 
the measures. 

Reasoning on the basis of the social desirability 
variable, the High Mean and Low Mean scales 
should have opposite relationships to each other 
and to the SD—since the High Mean scale is a 
Positive measure of the social desirability tendency 
while the Low Mean scale is a measure of the 
reversed tendency (social undesirability). If both 
response set tendencies are operating to equal 
degrees, then there should be no relationship 
between the two Mean scales. The computed 
intercorrelations between these response set scales 
are presented in Table 2. 

Table 2 shows that both the High Mean and 
Low Mean scales are significantly related in a 
positive direction (r = .65 and .42, respectively) 


TABLE 2 
INTERCORRELATIONS OF RESPONSE SET MEASURES 


peels OAS Mean | Mean 
OAS 1.00 —.34| .63] 42 
SD 1.00 | —.16 | —.38 
High Mean 1.00 | —.23 
Low Mean (N = 61)* 1.00 


* 105 level of significance (r = .26). 
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with the OAS measure. The fact that the correla- 
tion with the OAS is considerably higher for the 
High Mean scale than for the Low Mean scale 
appears to contradict Taylor’s (1961) contention 
that agreeing response set would be negatively 
related to the social desirability tendency. The 
evidence, in fact, points in the opposite direction. 

The expected negative correlation between the 
Low Mean scale and the SD is found (r = —.38); 
however, there is a rather surprising lack of 
relationship between the High Mean scale and the 
SD (r = —.16). In our population, there is no 
relationship between the tendency to agree with 
the most frequently endorsed (agreed with) items 
and scores on the SD. 

The expected lack of relationship between the 
High Mean and Low Mean scales is found (r = 
—.23), thus, lending some support to our hy- 
pothesis of opposing components of response sets. ? 
However, the explanation of this finding is not 
clear from the present set of rather confusing 
intercorrelations. We therefore turned to a more 
thorough approach using factor analysis. 


Factor Analysis of MMPI Scales. and Response 
Set Measures 


Much previous research on the determinants of 
personality test scores has turned up convincing 
evidence for the presence of two pervasive response 
sets—namely, the tendency to agree (or disagree) 
with items regardless of their content, and the 
tendency to endorse (or reject) items that appear 
socially desirable. Since the effects of these response 
sets are frequently compounded with one another, 
it is difficult to identify their independent corre- 
lates in the personality realm or even to decide 
which response set explains a particular pattern 
of findings. Having developed measures that 
assess these tendencies in different ways, it should 
be more possible to study their unique charac- 
teristics in a domain of personality variables. 

Since much previous work as well as the present 
controversy has centered on the personality 
correlates of response set in the MMPI, it seemed 
appropriate to factor analyze a combined domain 
of MMPI scales and response set measures. For 
each subject, we had scores for 32 MMPI scales 
that measured personality variables, a Sum True 
score that represented the total number of “True” 
responses given on the MMPI, the SD, the OAS, 
and the High Mean and Low Mean scales. These 37 
variables were intercorrelated and the resulting 
correlational matrix was factor analyzed by the 
Thurstone Complete Centroid method. Eight 
factors were extracted which altogether accounted 
for approximately 787% of the total variance. 

Since our interest is focused on the response set 
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tendencies, the first two factors were rotated 
through the main measures under discussion : 
namely, the SD and the OAS. A third factor was 
positioned in reference to the High Mean and 
Low Mean scales. Subsequent factors have been 
disregarded in this analysis. In Table 3 we have 
presented these rotated orthogonal factor loadings. 

The very high loadings of the SDS on Factor I 
(88) and the OAS on Factor II (.87) allow us to 
conclude that the first two dimensions clearly 
reflect the main aspects of these Tesponse set 
measures. It should be noted that the OAS 
measure is not loaded on the SD factor, and the 


TABLE 3 
Rortatep Factor Loapincs or MMPI AND RESPONSE 
ET 


Scales* I pig m hb 
—___| 
1-L -06 | —.40 | —.17 61 
2-F —.50 «15 | —.23 78 
3-K +64 | —.46 | —.06 86 
4-Hs =.95 -30 -05 64 
5-D —.61 +22 | —.41 86 
6-Hy —.25 | —.09 +09 58 
1-På —.49 -18 | —.27 61 
8-Mf —.23 +08 -18 31 
9-Pa —.4 .12 -11 70 
10-Pt —.78 -53 | —.09 94 
11-Se —.80 40 | —.13 98 
12-Ma 22 44 17 76 
13-St -55 | —.05 32 80 


Ẹ 
Y 
1 
a 
a 
8 
l 
N 
© 
s 


15-Do «S1 | —.15 +12 67 
16-Re +32 | —.58 06 81 
17-Ac -63 | —.39 +13 79 
18-5, -76 .07 -06 82 
19-Te -59 | —.26 -12 84 
20-Ai -22 | —.47 -13 75 
21-Si —.80 04 | —.15 88 
22-Es -59 | —.35 -20 78 
23-A —.76 54] —. 
24-R —.36 | —.44 | —.27 -62 
25-Dp —.69 -58 | —.05 .96 
26-Fm —.47 -46 | —.29 -64 
27-R; «56 | —.44 | —.07 73 
28-To -46 | —.61 -33 -98 
29-X1 -58 | —.02 +25 74 
30-X2 —.45 55 -06 73 
31-0i «06 | —.27 +19 50 
32-Lp -76 | —.36 | —.01 90 
33-Sum True — .43 74 -06 99 
34-SD -88 | —.38 -00 92 
35-OAS -00 -87 -00 76 
36-High Meane -16 17 -55 +92 
37-Low Mean —.33 -23 | —.58 -52 
Za = 10.56 | 6.74 | 1.82 | 28.72 


^ The MMPI scales are given standard abbreviations. These 
are explained more fully in our previous report (Couch & Keniston, 
1960, p. 155). 

P The communalities are for eight factors. 

° An additional High Mean scale of 50 items has ‘been omit- 
ted from this analysis. 
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SD measure has only a small loading (—.38) on 
the OAS factor. Since the factors are orthogonal, 
this pattern implies that the response set ten- 
dencies reflected in these measures are essentially 
independent of each other, 

The Factor I loadings on the MMPI scales 
correspond closely to the same pattern of relation- 
ships that Edwards (1957) has previously noted as 
the correlates of the SD. In terms of previous 
factor analytic studies of the MMPI (Kassebaum, 
Couch, & Slater, 1959), this “SD dimension” 
Seems to represent the general disposition of 
Anxious Social Withdrawal vs. Normal Social 
Participation. 

The Factor I loadings of the MMPI scales show 
very much the same pattern as the correlations of 
these scales with the OAS measure, as reported 
in our previous article (Couch & Keniston, 1960, 
p. 155). The general dimension might best be 
termed Emotional Impulsivity vs. Control. 

Of particular interest are the loadings of the 
Several response set measures on the first two ] 
factors. The Sum True measure from the MMPI 
is negatively related (—.43) to the SD Factor I 
and positively related (.74) to the OAS Factor II, 
The tendency to answer “True” on the MMPI 
apparently involves both the tendency to endorse 
socially undesirable admissions of anxiety and the 
agreeing response tendency itself. 

As already noted, the SD has a small negative 
loading (—.38) on the OAS Factor I. This part of 
the SD’s variance may be that portion due to 
the high percentage of “False” responses on the 
scale, 

In view of the present controversy, it is quite 
significant that the High Mean and Low Mean 
scales do not have any substantial loadings on the 
first SD factor. The only high loading of these 
scales on the first two dimensions is the positive 
one (.77) of the High Mean scale on the OAS 
Factor II. Apparently, agreement with “socially 
desirable” items is a relatively good measure of 
the agreeing response tendency; whereas, agree- 
ment with “socially undesirable” items is a far less 
appropriate indicator of that same tendency. 

The lack of substantial loadings of the High 
Mean and Low Mean scales on the SD Factor I 
make it clear again that these new response set 
scales of social desirability are not highly related 
to the SD measure. This finding throws some 
doubt upon the validity of the SD as a general 
measure of the social desirability variable. 

The most interesting new finding is that the 
High Mean and Low Mean scales emerge as the 
clear defining points of a third bipolar factor. The 
high positive vs. negative loadings suggest that 
these two scales are measuring some bipolar 
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response tendency related to the “endorsement” 
variable used in their method of construction. 
The best measure of this third factor would be a 
Balanced (High-Low) Mean scale that would 
cancel out the agreeing response set. The factor 
loadings of this scale as estimated by Thomson’s 
Pooling Square technique are: 


I I 
Balanced Mean scale 33 A ery 3 74 


The loadings of this Balanced Mean scale on 
the first two factors indicate quite low relationships 
to the general dimensions reflected in the SDS 
and the OAS measure. The high loading on the 
independent third dimension suggests that these 
new scales may represent a major aspect of the 
social desirability tendency that is not related to 
the SD. 


SUMMARY 


Some questions have been raised concerning 
the relationship of the agreeing response tendency 
to the social desirability variable. In an attempt to 
investigate this problem we have presented evi- 
dence suggesting that agreeing response set as 
measured by the OAS is independent of whatever 
aspect of the social desirability tendency is 
measured by the Social Desirability scale. 

Certain new measures—the High Mean and 
Low Mean scales—were developed to assess the 
social desirability tendency, using the degree of 
endorsement by the subject population as a 
criterion of construction. These inductive measures 
of social desirability were not highly related to 
the SD. 

A factor analysis of the several response set 
measures and an array of standard MMPI scales 
supports our contention that the agreeing response 
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tendency and the social desirability variable are 
independent of one another. The emergence of a 
third orthogonal factor defined by the High Mean 
and Low Mean scales suggests the possibility that 
a major aspect of the social desirability tendency 
is unrelated to the SD. 

Although we have not carefully examined the 
Personality correlates of the different social 
desirability measures in this paper, the factor 
analysis implies that the SD measure is centered 
around the admission or denial of anxiety, while 
the High Mean and Low Mean measures are 
centered around a conformity to conventional 
values, 

The manifestation of different response sets in 
the area of personality tests is quite obviously a 
challenging and complex problem—and well 
merits further examination by interested 
researchers. 
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T view of the considerable body of research 
published during the past 9 years on the 
pervasive influence of the social desirability 
variable on scores on personality scales of the 
self-report type, we regard it as unfortunate that 
our note (Edwards & Walker, 1961) was necessary 
in order to call the attention of Couch and Keniston 
(1961) to the possible confounding of social 
desirability and acquiescence in their OAS. 

In their original article, Couch and Keniston 
(1960) found that MMPI scales with a large 
percentage of items keyed True tended to correlate 
positively with the OAS while MMPI scales with 
a large percentage of items keyed False tended to 
correlate negatively with the OAS. They inter- 
preted these findings as “... indicating that 
scales with an unusually high (or low) percentage 
of items scored True are significantly influenced 
by the agreeing response set” (p. 154) which they 
regard the OAS as primarily measuring.’ 

In our note (Edwards & Walker, 1961) we 
called attention to an alternative hypothesis: 
that scores on both the OAS and MMPI scales 
may also be influenced by the tendency of subjects 
to give socially desirable (or undesirable) responses. 
Using their data, we reported a correlation of .77 
between the OAS correlations with the MMPI 
scales and the percentage of keyed socially unde- 
sirable responses in the MMPI scales. The corre- 
sponding correlation between the OAS correlations 
with the MMPI scales and the percentage of 
keyed True items in the MMPI scales was .85. 
On the basis of these results, we concluded that 
the OAS correlations with the MMPI scales 
were consistent with both the hypothesis that 
scores on the OAS (and MMPI scales) are influ- 
enced by social desirability tendencies and the 
hypothesis that they are influenced by acquiescent 
tendencies. We also stated that there was no 
compelling evidence in favor of either hypothesis. 

After reading their second article (Couch & 
Keniston, 1961), we arrive at exactly the same 
conclusion and point of view we expressed in our 
note. We still believe, for reasons developed 
below, that they have not presented any conclusive 
evidence in favor of either hypothesis. 


1If this statement is taken seriously, then one 
may ask why it is that the OAS correlates only .01, 
-16, .19, .10, and .14 with five of the Thurstone Tem- 
perament scales in which the corresponding proportion 
of keyed Yes responses is 1.00, .95, -90, .80, and .70, 
respectively. Couch and Keniston offer no explanation 
for these “exceptions.” 


CorreLations or ARS witH THE SD 
OAS Scale 


Couch and Keniston find that the OAS scale 
correlates negatively and significantly with the 
SD. They suggest that the negative correlation 
can be accounted for by the fact that 77% of the 
items in the SD are keyed False; thus, a negative 
correlation is to be expected with the OAS on the 
basis of the agreeing response tendency operating 
in the SD.* We suggest, as an alternative hypoth- 
esis, that the negative correlation is the result of 
the fact that subjects who obtain high scores on 
the OAS are giving, in general, somewhat more 
socially undesirable responses to the items in the 
OAS than subjects with low scores. In other words, 
we would predict that the OAS contains more 
items with socially undesirable scale values than 
items with socially desirable scale values. If this 
is the case, then a subject who has a tendency to 
give socially desirable responses would be more 
likely to disagree with the items with socially 
undesirable scale values and, thus, obtain a low 
score on the OAS. Do the facts support this 
alternative hypothesis? 

Social desirability scale values are not available 
for all 360 items in the OAS, but we do have the 
mean ratings assigned the items on the seven-point 
agreement scale. The proportion of the total 
number of items with mean ratings in each scale 
category is as follows: Strongly Agree (.000), Agree 
(.000), Slightly Agree (.105), No Answer (.410), 
Slightly Disagree (.321), Disagree (.152), and 
Strongly Disagree (.011). Thus, none of the items 
in the OAS have mean ratings in the Strongly 
Agree, or Agree category. The distribution of these 
ratings, in view of the consistently found relation- 
ship between probability of endorsement of an 
item and its social desirability scale value, suggests 
to us that the OAS is more heavily weighted with 
with socially undesirable scale values than it is 
with items with socially desirable scale values. 
The negative correlation of the OAS with the SD 
is compatible with this analysis. 

One may raise the question, as Couch and 
Keniston do, as to why the negative correlation, 
although significant, is not higher than it is. It has 
been suggested elsewhere (Edwards, 1957; Ed- 


*The problem of acquiescence in the SD has not 
been ignored. It was first discussed by Edwards 
(1957, pp. 34-39) and evidence was presented bearing 
upon the issue. A more detailed discussion with further 
evidence is given by Edwards (1961). 
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wards, Heathers, & Fordyce, 1960) that relatively 
low correlations of personality scales with the SD 
may be obtained when the scale has (a) a relatively 
large number of items with neutral social desir- 
ability scale values, or (b) a fair degree of balance 
in the social desirability keying of the items. 
Assuming, as Couch and Keniston do, that the 
mean rating assigned the items on the agree 
response continuum is some indication of the 
social desirability scale value of the item, we note 
that .836 of the total number of items have mean 
ratings in the middle three categories (Slightly 
Agree, No Answer, and Slightly Disagree) of their 
continuum. No items have mean ratings in the two 
extreme (socially desirable) Agree and Strongly 
Agree categories and, thus, the remaining rela- 
tively small proportion, .164, of the items fall in 
the two extreme (socially undesirable) Disagree 
categories, If the distribution of items had been 
symmetrical about the middle category, we believe 
the correlation between the OAS and the SD 
would have been lower than the reported value. 
It is the loading of the OAS with many neutral 
items and the slight weighting in favor of the 
Disagree items that we believe results in the 
relatively low negative correlation of the OAS 
with the SD. 


High Mean Scale 


The High Mean scale is found by Couch and 
Keniston to have a low and insignificant negative 
correlation with the SD. They argue that since 
the 35 items in this scale all have “high” mean 
agreement ratings, then they must also have 
“high” social desirability scale values, Of impor- 
tance here is what is meant by “high.” The mean 
agreement rating of the 35 items in the High 
Mean scale is 5.22 and this is their Slightly Agree 
category. None of the items have mean ratings in 
the Agree or Strongly Agree category. It seemed 
questionable to us, therefore, whether one could 
assume that all of these items had “high” social 
desirability scale values, and we decided to have 
the 35 items rated for social desirability. 

We presented the 35 items in the High Mean 
scale to 46 students at the University of Washing- 
ton and obtained ratings of social desirability for 
each item on a nine-point scale, using methods 
described by Edwards (1957). The mean social 
desirability rating of the 35 items on the nine- 
point scale was 6.3 with a standard deviation of 
1.66. This category on the social desirability 
continuum is labeled “Mildly Desirable” and is 
adjacent to the “Neutral” category. We believe, 
on the basis of these findings, that Couch and 
Keniston are incorrect in their belief that the 
items in the High Mean scale all have “high” 
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social desirability scale values and that, therefore, 
the High Mean scale should have a high positive 
correlation with the SD. As we shall show later, 
we believe, because of the nature of the items in 
the High Mean scale, that it is a better measure of 
the agreement response set than either the OAS 
or the Low Mean scale. 


Low Mean Scale 


That a negative and significant correlation 
should be obtained between the Low Mean scale 
and the SD is again consistent with the fact that 
the 35 items in this scale have mean agreement 
ratings in the two extreme (Disagree and Strongly 
Disagree) categories and, thus, this scale may be 
expected to be more sensitive to social desirability 
tendencies. We suspect, however, that if the 
items in the Low Mean scale were scaled for 
social desirability some of them would be fairly 
close to the neutral section of the social desirability 
continuum and that possibly some would fall on 
the socially desirable side of the continuum. It is 
possible, in other words, that there is also some 
degree of balance in the social desirability keying 
of the items in this scale. 


Tue FACTOR ANALYSIS 


As anyone who has ever worked with factor 
analytic techniques knows, rotations, unless one is 
seeking simple structure, are completely arbitrary. 
A plot of the rotated factor loadings given by 
Couch and Keniston will show convincingly that 
their choice of a rotation did not result in simple 
structure and, for that matter, neither does the 
rotation we have chosen to make.* We leave it to 
the reader, therefore, to decide whether the rota- 
tion made by Couch and Keniston makes better 
psychological and psychometric sense than the 
rotation we have made. We believe that it would 
be very informative for the reader to make, in 
the best Thurstone tradition, the actual plots of 
the rotated factor loadings as given by us in 
Table 1 and as given by Couch and Keniston. 

The loadings of various reference scales on our 
two orthogonal factors are given in parentheses in 
our discussion with the loading on the first factor 
given first and the second factor second. The posi- 
tive pole of our first factor is defined by the SD 
(.90, .00) and various other scales which have 
been shown in the past to have high positive 
correlations with the SD. Similarly, the negative 
pole is defined by various MMPI scales which 


3 We chose a rotation that would maximize the 
loading of the SD on the first factor and minimize it 
on the second, whereas Couch and Keniston chose a 
rotation which would maximize the OAS on the second 
factor and minimize it on the first. 
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TABLE 1 
Rotatep Factor Loaprncs or MMPI AND RESPONSE 
Ser SCALES 
Scales I Ir 
L +13 —.56 
F —.63 — .33 
K 73 —.36 
Hs —.70 —.03 
D -.71 —.41 
Hy +22 EJ 
På —.55 —.19 
Mf —.07 -06 
Pa — 48 125 
Pt —.92 +14 
Sc —.94 —.10 
Ma —.40 47 
St 65 37 
Pr — .82 -28 
Do 59 -03 
Re -67 —.34 
Ac 81 +02 
Sp 68 37 
Te 79 13 
At -60 —.18 
Si —.69 —,29 
Es -16 +10 
A =:91 -16 
R =.11 — .66 
Dp — .86 .22 
Fm meee g 
R : =. 
To :79 —.29 
X1 -63 52 
X2 —.61 44 
Oi .33 —.15 
Lp 85 -00 
Sum True —.74 „55 
SD -90 -00 
High Mean (omitted)* 12 78 
High Mean =r Wy hae 
ean oa . 

Low Mean —.46 —.09 

Za? 15.96 4.87 


sss 
This High Mean scale was included in the Couch and Keniston 
(1961) factor analysis, but omitted from their discussion. 


have been shown in the past to have high negative 
correlations with the SD. We believe our first 
factor can be simply and accurately described as a 
social desirability factor. Further evidence on 
this point is given by Fordyce (1956) and Edwards 
and Heathers (1961). 

The negative pole of the second factor is defined 
by two scales, R (—.11, —.66) and Z (.13, —.56), 
two scales which in the past have been found to 
have relatively low correlations with the SD 
It is of significance to note that the R scale con- 
tains 40 items and that all of the items are keyed 


4 Psychometric explanations of why R and L should 
and do correlate to a relatively low degree with the 
SD are presented by Edwards, Heathers, and Fordyce 
(1960) and in greater detail by Edwards (1961). 
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False and that the 15 items in the L scale 
also all keyed False. We believe that the seco 
factor can be described as acquiescence and th 
and L define the negative pole of the factor, 
Positive pole of the second factor is defined by ti 
Couch and Keniston High Mean scale (—.17, 
and by another High Mean scale (.12, .78) 
sisting of 50 items which they included in thei 
factor analysis but omitted from their discussi 
Our original note stated that we believed 
scores on the OAS might possibly be the result 
both acquiescence and social desirability. TI 
rather substantial loadings of the OAS (—.45 
on both the social desirability and acqui 
factor support our original contention. 


CONCLUSION 


One may, of course, speculate at consid 
length as to whether what has been consis 
identified, in considerable research during 
past 9 years, as a social desirability variable 
really Anxious Social Withdrawal vs. Norms 
Social Participation, Psychoticism vs. No: malit 
Ego-Strength vs. Ego-Weakness or some oth 
equally dynamic variable. We do not believe tha 
these and various other reinterpretations of th 
social desirability variable serve to clarify mat 
in any degree, although the new labels may 
more satisfying to the psychologist who is clinicall} 
and dynamically oriented. i 

We have nothing against personality dynam ics, 
but we do not believe they should be invol ed 
when relatively simple psychometric consideration 
are adequate to account for the correlations 
have been found between various perso: J 
scales and the SD. And, just as the correlati 
of personality scales with the SD can be ac 
counted for in terms of the psychometric proper! 
of the scales (Edwards, 1957, 1961; Edw 
Heathers, & Fordyce, 1960) so also can 
first factor loading of the MMPI scales be 
counted for in terms of the zero-order correla 
of the MMPI scales with the SD (Edwards 
Heathers, 1961). y 
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A NOTE ON ROKEACH’S THEORY OF PREJUDICE 


HARRY C. TRIANDIS 
University of Illinois 


Res Smith, and Evans (1960), in 
Rokeach’s stimulating book The Open and 
Closed Mind, state that “insofar as psychological 
processes are involved, belief is more important 
than ethnic or racial membership as a determinant 
of social discrimination” (p. 135). To prove their 
point, they asked subjects to respond to a nine- 
point scale ranging from “I can’t see myself 
being friends with such a person” to “I can very 
easily see myself being friends with such a person.” 
The stimulus persons used were persons of the 
same or different race or religion as the subjects, 
who had the same or different beliefs as the 
subjects. All possible combinations of race and 
belief were used. The beliefs dealt with God, 
communism, labor unions, socialized medicine, 
etc. The writers state that “most of the time they 
(the subjects) discriminate on the basis of belief 
and not on the basis of racial or ethnic group when 
they are given the opportunity to react to social 
stimuli differing simultaneously on both charac- 
teristics” (p. 153). Thus, they suggest that the 
subjects prefer people of a different race or religion 
who agree with them on questions of belief over 
people of the same race or religion who disagree 
with them. 

If Rokeach et al. were limiting their argument to 
friendship choices there would be no reason to 
quarrel with their interpretation of their findings. 
The view that friendship choices are based on 
belief similarity has been suggested by other 
writers (e.g, Newcomb, 1953, 1956, 1958). 
Triandis (1959) has found that similarity in 
aspects of the belief systems of subordinates and 
supervisors is related to interpersonal liking 
between them. But when Rokeach et al. discuss 
their findings under the title of “prejudice” the 
Present writer must object. A concept more general 
than prejudice is the concept of social distance. 
Rokeach’s research technique used friendship as 
the variable under investigation, and so inevitably 
dealt only with a relationship involving small 
social distance. Prejudice and discrimination are 
much more relevant to acceptance or rejection of 
a relationship involving relatively large social 
distance, such as acceptance of a person as a 
neighbor or as a student in one’s university. In 
the present study, Rokeach’s hypothesis was 
tested by means of a questionnaire covering a 
wide range of social distances and was not sup- 
ported by the evidence. 


Mersop 


A questionnaire was used that involved a 100-point 
scale of social distance ranging from willingness to 
marry a person (0 social distance) to willingness to 
participate in the lynching of the person (social distance 
of 97 points). This scale was developed by the present 
writer in a previous study (Triandis & Triandis, 1960), 
The scale values were computed using the Thurstone 
successive intervals procedure, which yields an equal 
interval scale, The scale included statements concern- 
ing the acceptance of the person as an intimate friend 
(social distance of 11 points), as a roommate (30 
points), as a personal chum in the respondent’s club 
(31 points), as a speaking acquaintance (52 points), 
and also statements referring to the unwillingness of 
the respondent to accept the stimulus person as a 
neighbor (70 points), or “in our universities” (81 
points). 2 

The subjects were presented with descriptions of 
16 stimulus persons arranged in a factorial design. The 
16 stimuli appear in Table 1. Each stimulus included 
the specification of the race, religion, belief system, 
and occupation of the person who was to be judged on 
the scale of social distance. Two values of race (Negro 
and white), two values of religion (same denomination 
as the subject and different religion from the subject!), 
two values of occupation (bank manager and coal 
miner), and two values of belief system were used. 
The two values of belief system were obtained by 
asking the subjects to read the 13 ways of life that 
were prepared by Morris (1956). The subjects were 
told that they would be examined on their content later 
and were asked to indicate the most preferred and the 
most rejected way of life. The subjects were then 
instructed to think of the most preferred way of life 
when the words “has the same philosophy as you’ 
appeared in the questionnaire and to think of the most 
rejected way of life when the words “has a different 
Philosophy” appeared. 5 

For each of the 16 stimulus persons, subjects re- 
sponded to the 15 items of the social distance question- 
naire. These responses were scored as in Triandis and 
Triandis (1960, p. 112). Thus, for each stimulus person 
a single score representing the social distance experi- 
enced towards “him” was obtained. The 16 scores 
obtained from each of the 112 subjects were subjected 
to an analysis of variance. 7 

The subjects were 112 University of Illinois white 
undergraduates of both sexes. 


Resutts 3 
The means and standard deviations obtained 
from the 112 subjects to the 16 stimuli appear in 
1 Roman Catholic, Protestant, or Jewish denomina- 


tions, depending upon which appeared to the subject 
to be the most different from his own religion. 
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Table 1. The analysis of variance of these mean 
scores is shown in Column 1 of Table 2 and that of 
the 1,792 scores obtained from the 112 subjects is 
shown in Table 3. 

Table 1 shows, as expected, that the same 
philosophy, same religion, white bank manager is 
acceptable as an intimate friend while the different 
religion, different philosophy, Negro coal miner is, 
on the average, not accepted as any kind of 
friend, although he is accepted in the same apart- 


TABLE 1 
Meran SOCIAL DISTANCE SCORES OBTAINED FOR EACH 
STIMULUS 
Stimulus Person Manot 
White, same philosophy, same re- 11.1 0.014 
ligion, manager 
Same philosophy, white, bank 12.5 0.39 
manager, different religion 
Same philosophy, white, coal 17.0 2.22 
miner, same religion 
Different philosophy, white, bank 16.2 0.67 
manager, same religion 
Same philosophy, white, coal 18.7 24 
miner, different religion 
Different philosophy, white, bank 20.0 1.46 
manager, different religion 
Same philosophy, Negro, bank 21.6 3.98 
manager, same religion 
Different philosophy, white, coal 23.5 3.28 
miner, same religion 
Same philosophy, Negro, bank 27.2 5.36 
Dineen, different religion 
ifferent philosophy, white, coal 29.6 4.00 
aea ifferent religion 
i SN Pet, Negro, 31.5 4.04 
anager, same religion 
Same Philosophy, Negro, coal 32.3 6.00 
2 Dia, same religion 
ee philosophy, Negro, 35.6 4.95 
ink manager, different religion 
ae Philosophy, Negro, coal 37.1 5.92 
DR, different religion 
‘erent philosophy, Negro, coal 39.0 4.83 
Dita same religion 
erent religion, Negro, coal 45.8 4.96 


miner, different philosophy 


TABLE 2 
ANALYsis OF VARIANCE OF MEAN SCORES 
; Oth minded 
oan All Subjects | Other, | Oat fects 
ss |%| ss |%| ss |% 
Race eat 
0 A 2.71 | 59 | 3.68 | 64 | 1.93 | 50 
nee "3 | 18 | a2 | 14 | <85 | 22 
R p y Es 16 -79 | 14 -70 | 18 
Tnteractiong AEE 4] $ 
Te 
otal 4.62 5.74 3.87 
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TABLE 3 
ANALYsIS OF VARIANCE OF INDIVIDUAL SCORES 


Source ss | %| ay |Y lr} » 


ance 
Race 302.70/15.7| 1{302. 701378 <.001 
Philosophy 83.27] 4.3] 1| 83.27/104 |<.001 
Occupation | 93.21] 4.9] 1| 93.211115 |<.001 
Religion 24.26) 1.2) 1| 24.26) 30 }<.001 
Subjects 1415.60|73.5|1776} 80] 
Interactions 7.33) .3) 11) — 

Total 1926.37 1791 


ment house. The large standard deviations of the 
Negro stimuli reflect the exclusion of all Negroes 
from the neighborhood (scale value 70) in the 
case of many subjects, while the majority of 
subjects accepted some Negroes as friends of 
some sort (scale value 41 or lower). It is interesting 
that the Negro coal miner of the same philosophy 
and religion has the highest standard deviation, 
a fact that indicates considerable ambivalence 
among subjects. 

Table 3 shows that while all treatments produce 

ignificant differences in social distance, race is 
by far the most important determinant, in the 
sense that it controls more variance than the 
other factors, Race is about four times as important 
as belief system, which is about as important as 
occupation. The analysis of variance that is based 
on the means of the 112 subjects for the 16 stimulus 
persons (Table 2) shows similar results. 

Table 1 throws more light on the matter under 
investigation. We find that neither the person who 
has all positive characteristics except belief 
system nor the person who has all positive charac- 
teristics except race are acceptable as intimate 
friends. However, the latter receives a significantly 
larger (p < -0001) mean social distance score than 
the person who differs in belief system. For all 
similar comparisons using the data in Table 1 we 
obtain the same results. There is no doubt that 
race is a more important determinant of social 
distance than is belief system. ei 

Many writers in the field of prejudice have 
argued that people who are characterized by high 
tolerance for ambiguity and open-mindedness 
are less prejudiced. Since one of Morris’ (1956) 
ways seems to deal with this matter (Way No. 7: 
“We should at various times and in various ways 

from all other paths of 


something 

life....”), We compared the scores of the 53 
subjects who had chosen this way as their most 
preferred way with those of the 59 subjects who 
picked other ways. The analyses of variance of 
these two groups are shown in Columns 2 and 3 
of Table 2. The open-minded subjects tend to be 
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less prejudiced (p < .20), yet race is still a very 
important determinant of the variance of their 
social distance scores. 

It is also of interest to consider whether the 
results obtained by Rokeach can be replicated 
with our data. One of Rokeach’s approaches was 
to list the number of times a person with a different 
race or religion was preferred over a person with a 
different belief system. To approximate this 
measure, we compared the social distance due to 
race and that due to differences in beliefs and 
tabulated how frequently one was greater than, 
smaller than, or equal to the other. We found 
that 41 subjects showed greater social distance 
due to race than due to belief. The social distances 
due to race and belief were equal for 37 subjects, 
and a smaller social distance due to race than due 
to belief was obtained in 34 cases. The trend goes 
against Rokeach’s findings, but the chi square is 
not significant. Thus, our data suggest that race is 
as frequently a factor in social distance as differ- 
ences in the belief systems of the two persons. 
But while belief system frequently affects social 
distance, it does not often involve a great deal of 
social distance, People do not exclude other 
people from their neighborhood, for instance, 
because the other people have different belief 
systems, but they do exclude them because they 
are Negroes. Rokeach’s magnificent attempt at 
subsuming all the data under one simple notion 
succeeded only because of the design of his study. 
Had he considered something more than friendship 
he would not have obtained his results. It seems 
that an adequate theory of prejudice is much 
more complicated than Rokeach’s. The author 
(Triandis & Triandis, 1960) has suggested a theory 
that can subsume Rokeach’s as a special case. 

The present study is also open to criticisms, 
For instance, is the distance between the most 
preferred and the most rejected value system, in 
Morris’ (1956) scheme, as great a distance between 
belief systems as one could obtain in life? Probably 
not. On the other hand, the distance between 
communist and anticommunist (beliefs used by 
Rokeach) may in fact be smaller than the distance 
between some of Morris’ ways and yet might 
have controlled much more variance than the 
differences in philosophy used in the present study. 
“Communism” in the present climate of inter- 
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national relations includes “surplus meaning” 
involving visions of saboteurs, murderers, etc, 
It seems likely that the use of beliefs involving 
communism, God, etc. accentuated Rokeach’s 
results and favored his hypothesis. His data 
dealing with labor unions (p. 141, Table 7.1) were 
mush less compelling than his data on the other 
belief systems. 

The present note has been quite negative to 
Rokeach’s theory of prejudice. This in no way 
detracts from the great value of the other parts 
of his most stimulating book, 


SUMMARY 


Rokeach’s hypothesis that prejudice can be 
accounted for primarily by differences in the 
belief systems between two persons, or between a 
person and members of a racial or religious group, 
was tested and rejected. Rokeach’s own experi- 
mental confirmation of his hypothesis is due to 
an artifact of his method of measuring prejudice. 
His position is re-evaluated in the light of the 
present findings. 
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E ooe (1961) method is, in my judgment, 

inappropriate and so also is his conclusion. 

_ The essence of the difference between his 
method and ours is that while we asked subjects 
to respond to specific social stimuli such as “Negro 
who believes in God” and “White atheist,” 
Triandis asks his subjects to respond to “Negro, 
same philosophy” and “White, different philoso- 

_ phy.” Triandis reports that race is now a far more 

~ important determinant of social distance than 

belief, quite the opposite to our findings. 

‘Triandis’ conclusion stems not so much from 
his use of a social distance scale, which he makes 
so much of, but from his use of the terms “same 
philosophy-different philosophy,” which he makes 
so little of. What do these terms mean to Triandis’ 
subjects? Triandis tells us that his subjects first 
studied Morris’ (1956) Thirteen Ways of Life, 
then picked the “most preferred” and the “most 
rejected” of these 13, and then imagined the 
“most preferred Way” when the stimulus “same 
Philosophy” appeared, and imagined the “most 
_ Tejected Way” when the stimulus “different 
philosophy” appeared. 

„By way of example, Morris reports that Way of 
Life 7 (“preserve the best that man has attained”) 
is generally the most preferred, and Way of Life 13 
(“obey the cosmic purposes”) the least preferred. 
_ These Ways are presented to subjects in the form 

of paragraphs containing many complex ideas. 

f Each paragraph is over 100 words long. Thus 
Triandis’ subjects must somehow have substituted 

for “Negro, same philosophy” and “White, 

different philosophy”: “Negro, that-long-and- 

_ involved-paragraph-on-Way-7” and “White, that- 

long-and-involved-paragraph-on-Way-13.” 

Triandis does not tell us what his subjects 
remembered of these two paragraphs (not to 
Speak of the 13). Were these paragraphs ego 
involving? I doubt it. The belief variable pitted 
against race is vague, abstract, overintellectual, 
and complicated; hence weak! And the polarities, 

Same” and “different philosophy,” are not 
really opposites, just different. They are not Ways 
producing social conflict in everyday life. They 
_ Tepresent ideals by which “people say they would 
like to live, and not how they do live” (Morris 
& Jones, 1955, p. 534). Thus the difference between 

same” and “different philosophy” is a difference 

_ Which, psychologically and sociologically, makes 
little difference, 
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Compare the impact of “Negro, same philos- 
ophy” and “White, different philosophy” with the 
impact of “Negro who believes in God” and 
“White atheist.” Here both race and belief are 
salient. Both variables are specific, concrete, 
easy to imagine, and easy to remember. Both 
variables are ego involving, and it is easy to 
imagine that such people exist even if one has 
not personally met them. 

Triandis suggests that our results are artifacts 
due to our selection of a particular socially po- 
larized belief (communism) to pit against the 
race variable, To make sure our results were not 
artifacts we repeated our research with eight 
widely differing issues: God, communism, desegre- 
gation, etc. These issues are socially salient and 
controversial, though not equally so, On all eight 
issues we found friendship ratings to be more a 
function of belief than race, though not equally so. 
These results hold in the South as well as in the 
North. To these eight issues Triandis now adds a 
ninth: “same philosophy-different philosophy,” 
and his results are now exactly the opposite. I 
have tried to state why, in my opinion, this 
ninth replication is not a crucial one, but rather a 
trivial one, All our data and Triandis’ are con- 
sistent with the following proposition: the more 
salient a belief, the more will belief congruence 
override racial or ethnic congruence as a determi- 
nant of social distance. 

The preceding considerations notwithstanding, 
the issue of race versus belief as determinants of 
discrimination is far from settled. A crucial test 
of the belief congruence hypothesis will not be 
forthcoming from more questionnaire studies 
using different wordings, but from research on 
what people do in real life. A preliminary study 
along this line is reported in The Open and Closed 
Mind (1960, p. 156). $ 

Perhaps our findings relevant to the belief 
congruence hypothesis can be made more palatable 
if certain ideas previously discussed (p. 164) are 
rephrased. Only when social constraints are 
placed on a person would we expect him to dis- 
criminate in a way he otherwise would not, if 
left alone by society. No society constrains all 
the actions of its members, and in those actions 
not subject to social sanction discrimination 
along racial or ethnic lines would not take place, 
not even in the South. In other words, the locus 
of racial and ethnic discrimination is to be sought 


187 


188 


in society, not in the individual’s psyche. If 
society’s constraints were altogether removed 
from a particular segment of life man would 
still discriminate, if discriminate he must, not in 
terms of race or ethnic grouping, but in accord 
with his basic psychological predisposition, 
characteristic of all human beings, to organize 
the world of people in terms of the principle of 
belief congruence. 


CRITIQUE AND NOTES 
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ERRATUM 


On page 145 of the article “Identifying Jewish Names,” by P. F. Secord and Ellen Saumer 


(J. abnorm. soc. Psychol., 61, 144-145), a mean response bias score of 5.17 for high and low — 
prejudice army personnel is erroneously reported as not significant at the .05 level, This mean _ 
is significant at the .01 level, a result which is consistent with Previous studies of response bias . 


| 


in the identification of Jews from photographs. 
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“CREDULOUS” AND “SKEPTICAL” VIEWS 
OF HYPNOTIC PHENOMENA: 


EXPERIMENTS ON ESTHESIA, HALLUCINATION, AND DELUSION! 


J. P. SUTCLIFFE? 


of hypnosis so long as terms like hallu- 
cination, delusion, and amnesia are used 
in a purely descriptive sense. Concerning the 
interpretation to be put upon the hypnotic 
subject’s behavior and testimony about his 
subjective experiences—that is, concerning 
the nature of hypnotic phenomena—there is 
traditional disagreement. 

Some assume a priori a distinction between 
trance behavior and simulation, and accept 
in good faith the subject’s testimony concern- 
ing the occurrence of “genuine” hypnotic 
_ phenomena. They believe that hypnosis can 

produce aberrations in any given mental or 
_ behavioral function and transcendence of 
~ normal capacity. They assume that hypnotic 
- suggestions are substitutable for real stimuli as 
conditions of perception, and consequently 
that hypnotic fantasy is akin in its sensory 
content to perception produced by parallel real 
“stimuli. Erickson (1952) is representative of 
this “credulous” view. 

The “skeptics” do not accept hypnotic be- 
havior and testimony at face value, as no 
“unequivocal criterion for distinguishing trance 
behavior and simulation has yet been found. 
Furthermore they note that at least some 
hypnotic subjects adopt devious and mislead- 
ing means of achieving suggested effects and 
-misreport their actual experience (Pattie, 
_ 1933). They leave it an open question whether 
any given trance performance may not be 
matched in a nontrance state. They accede 
that an hypnotic subject may act as if the 
world were as suggested, taking his cues for a 
_ successful performance from the hypnotist’s 
explicit and implicit directions (Sarbin, 1950; 
White, 1941) and from the “demand charac- 

1 Arran; ion and publication of 
this ieee E io Etag E eera tenure of 
a Rockfeller Foundation Fellowship in Social Science. 

2 For their help in various ways, I wish to express my 
sincere appreciation to A. G. Hammer, R. W. White, 
Martin T. Orne, and M. Brewster Smith. 
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teristics” of: the situation (Orne, 1959); and 
they allow that there may be differences of 
subjective experience in trance and nontrance 
states; but they doubt that hypnotic fantasies 
have the same sensory content as parallel 
perceptions of real stimuli. 

The main points at issue may be summarized 
in two questions of fact: Is hypnotic behavior 
like voluntary behavior? Is it akin to an actor’s 
performance of a role? or Is hypnotic behavior 
relative to a “subjective reality” the sensory 
content of which is identical in form with that 
produced by the stimuli of a parallel objective 
reality? These questions may be answered by 
comparing hypnotic behavior with normal 
waking behavior and with waking acting or 
simulation. 

For experimental study the two viewpoints 
need to be formulated in relation to specific 
hypnotic phenomena. A major difference be- 
tween them concerns the subject’s perceptions, 
so attention can reasonably be restricted to the 
cognitive phenomena of hypnosis. These may 
be classified in pairs such as paresthesia and 
anesthesia, positive and negative hallucination, 
and the like, according to whether the subject 
is required to produce or inhibit an experience. 
Tt then remains to select given modalities of 
sensory experience. For illustration, the two 
views are stated in relation to hypnotic anes- 
thesia and paresthesia for painful stimulation. 

The credulous view is that with hypnotic 
suggestion of anesthesia the subject has no 
more sensory experience in the presence of 
painful stimulation than he would have in the 
absence of such stimulation. Similarly, with 
hypnotic suggestion of paresthesia the subject 
has sensory experience in the absence of pain- 
ful stimulation exactly as in the case where 
such stimulation is present. On this view, 
hypnotic suggestions are substitutable for real 
stimuli as conditions of sensory experience. 

The skeptic doubts that painful sensory 
content is eliminated by suggestions of hyp- 
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notic anesthesia and that it is produced in 
paresthesia, He holds that the subject may 
agree with the hypnotist’s suggestions in acting 
as if the suggested state of affairs were so. 
There are two possibilities, however, concern- 
ing the subject’s perceptions: 

1. Simulation. Although the subject com- 
Plies with the suggestions he knows to the 
contrary that he is or is not being subjected 
to painful stimulation. Typically he misreports 
his true experience. If this be so, the central 
problem of hypnosis concerns the subject’s 
motivation for misreport as a function of the 
hypnotic relationship and its procedures. In 
other respects no distinctions would be drawn 
between hypnotic behavior, voluntary be- 
havior, and role playing. 

2. Delusion. In complying with the hyp- 
notist’s suggestions the subject in some sense 
does not know that the real state of affairs is 
otherwise than suggested. If the subject is 
receiving painful stimulation there can be no 
Passive ignorance, and “not knowing” must 
mean an active denial of reality. This version 
of the skeptical view which allows that the 
hypnotized subject may be deluded about some 
features of the trance situation has affinities 
with the credulous view. It may, however, be 

' held independently of those aspects of the 
credulous view which concern sensory content. 

Methodological weaknesses render equivocal 
much of the available, clinical and experi- 
mental evidence on these issues (see Sutcliffe, 
1960). Evidence alleged to support the credu- 
lous view can be shown to be consistent with 
the skeptical view, and most points at issue 
Temain unresolved. A necessary albeit insuffi- 
cient condition for resolution is improvement 
in the design of experiments on hypnosis. This 
Paper reports a series of experiments conducted 
in awareness of these difficulties: a discussion 
of the experimental design is followed by 
descriptions of experiments on esthesia, hallu- 
cination, and delusion that bear on the credu- 
lous and skeptical views. 


DESIGN or ExPerments 
General requirements for the design of 
experiments on hypnosis are discussed else- 


*Orne’s (1959) work is a notable exception. It was 
not published, however, at the time the studies reported 
in this paper were conducted. 
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where (Sutcliffe, 1958, 1960). The basic de 
is set out in Table 1. The major comparisons 
bear upon the questions of fact. The contro 
conditions constitute the “parallel objectiy 
reality” of the second question; while s gges- 
tions given in hypnotic trance, on the credu 
lous view, set up a “subjective reality.” These 
are referred to, respectively, as the C and the 
HIS conditions. Comparison of behaviors 
under these two conditions bears upon the 
second question of fact and thus enables a test 
of the credulous view. The two hypnotic 
conditions may be compared to distinguish 
the effects of hypnotic induction, HI, per se 
from the effects of experimental suggestions 
given to the subject following upon the trance 
induction procedures, HIS. In the WA condi 
tions the subjects are required to act in the 
waking state as if conditions were as suggested 
by the experimenter; and Comparison of this 
behavior with that found in the other condi- 
tions enables a test of the skeptical view. 
The presence and absence of stimuli refer 
to the nature of the real situation. Fantasy 
adds something that does not exist or affirms 
something false, or it takes away something 
that does exist or denies something true. One 
may thus denote absence in a real situation as 
the positive case, and presence in a real situa- 
tion as the negative case. For example, C+ 
refers to the control condition with the experi- 
mental stimuli absent. To test the credulous 
view, one compares behavior under positive 
hypnotic fantasy, HIS-++, with behavior under 
negative control conditions, C—: in the latter 
the stimulus is present, in the former the 
stimulus is actually absent but the hypnotized 


TABLE 1 
Basic DESIGN FOR EXPERIMENTS ON HYPNOSIS 


Hypnotizability: 
Be e E 
Major Comparisons Stimuli Non- 
bule | pule 
Control (C) Present | C— c3 
Absent | C+ | C+ 
Hypnotic induction | Present | HI— | HI- 
Absent | HI+ | HI+ 
Hypnotic induction | Present | HIS— | HIS— 
and suggestion | Absent | HIS+ | HIS+ 
Acting when awake | Present | WA— | WA- 
(WA) Absent | WA+ | WA+ 
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subject has to fantasy its presence. Conversely, 
one compares negative hypnotic fantasy with 
positive control behavior. If behaviors are 
alike in these comparisons the credulous view is 
confirmed; but if they are unlike, and especially 
if HIS+ is like C+, and HIS— is like C—, 
real stimuli are dominant and the skeptical 
view is confirmed. For these various compari- 
sons to have any force, it is of course necessary 
that behaviors differ in the presence and 
absence of the control stimuli. 

The experimental comparisons are made for 
both somnambulistic and nonsomnambulistic 
subjects. This provision rules out the possi- 
bility that failure to confirm the credulous 
view in certain cases is due to the use of in- 
susceptible subjects, The complete design thus 
entails eight distinct experimental conditions 
replicated for two types of subjects. No one 
subject performs under more than one condi- 
tion in a given experiment, that is, an experi- 
ment entails 16 independent groups of subjects. 

Experiments on esthesia, hallucination, and 
delusion were conducted in conformity with 
this basic design. Each experiment entailed 
three main observational stages: pretesting for 
all three spanned 2 months and preceded 
major experimentation for all three which 
spanned 3 months before final subsidiary 
experimentation. The three experiments were 
thus conducted at the same time. They drew 
upon a common pool of subjects, but these were 
allocated among the experimental groups in 
different ways from experiment to experiment. 
A typical subject, for instance, could be a 
control in one case, a hypnotic subject in 
another, and an actor in the third. 

Sampling and Allocation of Subjects 

The experimental subjects were drawn from 
undergraduate psychology classes at the Uni- 
versity of Sydney. Such students are highly 
selected from the general population in age, 
ability, and education. For this group women 
tend to outnumber men, and interests tend 
more to the humanistic than to the natural 
scientific. 

For the experiments, two pools of subjects 
were to be obtained from this population: 
somnambules and nonsomnambules. These 
were obtained by selection from volunteers. 
The major criterion for selection was reaction 
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to hypnosis. Subjects were handled individ- 
ually, the initial session spanning 1.5 to 2 
hours. Adapted from Bernheim (1890), Davis 
and Husband (1931), Friedlander and Sarbin 
(1938), White (1941), and Erickson (1952), 
the particular criteria were as follows: for 
somnambules manifestations on the first occur- 
rence of hypnotic trance and at subsequent 
sessions of complete spontaneous posthypnotic 
amnesia; manifestations on the first or at 
subsequent sessions of “convincing” esthesias, 
hallucinations, and delusions, and of the ability 
to open the eyes and converse without coming 
out of trance; positive motivation; and for 
nonsomnambules reality contact and self- 
awareness throughout all hypnotic procedures 
with complete absence of spontaneous and 
suggested amnesia; compliant or willed rather 
than compulsive or involuntary reactions to 
suggestion; positive motivation. 

Erickson’s (1952) claim that success in 
hypnotic experimentation cannot be expected 
unless subjects are “trained” for deep trance is 
contentious; but to ensure that conditions were 
not unfavorable for a test of the credulous view, 
all somnambules were trained by the methods 
he describes to ensure clear-cut clinical mani- 
festations of hypnotic esthesia, hallucination, 
and delusion prior to the beginning of experi- 
mentation. There was a minimum interval of 2 
months between selection and training on the 
one hand and participation in an experiment 
on the other. During the experiments proper 
which involved hypnosis, a subject was run 
through a trance induction procedure that took 
30 minutes. For a subject chosen for the HIS 
conditions, this was followed by an additional 
15 minutes of hypnotic procedure for the 
production of esthesia, hallucination, or delu- 
sion as required (see Sutcliffe, 1958, pp. 105- 
108, 219-249, for details). å 

Of 487 students approached, hypnotic proce- 
dures were applied to 156 volunteers; and of 
these, 108 were rejected as recalcitrant or as 
intermediate or uncertain in their degree of 
hypnotizability. Selection continued as sub- 
jects came to hand until 24 somnambules and 
24 nonsomnambules had been found, each 
pool comprising 12 men and 12 women. There 
were no significant differences in age for these 
four subgroups, and the overall mean was 22.9 
years with standard deviation 3.2, For a given 
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type the incidence out of 156 is approximately 
16%, which agrees with the statistics compiled 
by Bramwell (1906). Thus although the specific 
aim in selection was to obtain given numbers 
of subjects, it happened that the two extreme 
types of subjects occurred at about the same 
tate as they do for the general population 
insofar as that can be determined from in- 
adequate statistics; and so the experimental 
universe is not atypical. Experimental results 
obtained from the samples have generality in 
that the latter are at least representative of 
the extremes of hypnotizability for young 
adult men and women at the university. 

The primary concern in testing the credu- 
lous and skeptical views is to compare the 
control, hypnotic, and acting conditions; 
studies of the effects of trance per se, of indi- 
vidual differences and reliability are subsid- 
jary. Attention might thus be restricted to 
methods of allocating subjects to the six 
experimental conditions: C+, C—, HIS+, 
HIS—, WA+, and WA—. For the experiment 
on esthesia, six independent random samples 
of four subjects were drawn separately from 
each of the somnambule and nonsomnambule 
pools, and one of the six experimental condi- 
tions was assigned at random to each group. 
With the other two experiments a matching 
procedure was used to minimize intersubject 
variation which, in samples as small as four, 
could blur group differences. To achieve this 
matching, subjects were listed in rank order 
on a relevant pretest, the parallel form of 
which was to be used as the dependent varia- 
ble in an experiment. Twenty-four somnam- 
bules could thus be subdivided into six sub- 
groups each containing four serially ordered 
subjects. The same procedure was applied to 
the nonsomnambules. Although the common 
pool of 48 subjects was used for all three 
experiments the experimental allocations for a 
given subject varied from experiment to experi- 
ment. No subject was aware of his group 
allocation for any given experiment, and his 
Participation could give him knowledge of 
only 1 of 12 conditions in a given experiment. 
The experimental testing was embedded in a 
context of other testing procedures; and in 
these various ways all subjects were kept 
ignorant of the overall design of the experi- 
ment and of the experimenter’s aims. 
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EXPERMENT I: ESTHESIA 


Esthesia was chosen for investigation as 
is regularly cited in support of the credi 
view. Most commonly, hypnotic anesth 
has been studied; here paresthesia is 
considered. Most studies have used some 
of a probe as a pain stimulus; here ele 
shock is used. The subject’s testimony, gr 
motor movements, and GSR to the shock 
stimuli are observed. 


Method 


the left hand. A switch in the latter circuit enabled 
experimenter to deliver either a buzz without s 
or buzz and shock simultaneously. P 
Pre-experimental testing. The subjects were given 
series of shocks with the apparatus described. 
provided them with experience necessary for subseqi 
paresthesia if it were to occur. Also it enabled the 
perimenter to determine a fixed level of electric s 
that produced pain in all subjects. 
Experimental procedures. The experiment was 
ducted in a sound-dampened room. The subject sai 
a small table placed in a corner so that ahead and 
the right he faced bare walls, while to the left and 
hind he was bounded by a wooden screen. The ex] 
menter sat behind the subject, or moved out wh 
necessary to control the apparatus concealed by tl 
screen. The table was bare and only the electrodes 
visible. A fluorescent source provided constant ill 
nation, The only stimuli of a variable character 
the experimenter’s instructions, and the buzz or b 7 
shock of the apparatus as delivered by the experimel 
ter. ib 


The particular experimental conditions were 
follows: 
C+ The subject was linked to both the shock 
GSR apparatus but received no shocks, Wher 
the stimulus key was depressed by the e 
perimenter, buzz alone occurred. The only in 
structions given to the subject were that h 
should remain passive and refrain from g 
motor movements, 
C- This condition was identical with C+ excep 
that when the experimenter depressed 
stimulus key, the subject received an ele 
shock simultaneously with hearing the buzz. 
HIS* The objective conditions were those of C+. In 
addition, the subject was induced into di 
trance and paresthesia for shock suggested, th 
“shock” to be felt simultaneously with heari 
the buzz. , 
HIS- The objective conditions were of C-. In 
tion, the subject was induced into deep ti 
and anesthesia was suggested for the left 
WA* With the objective conditions of C+ the s 
ject was asked to recall a circumstance 
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which, though not in pain, he had behaved as 
if he were, specifically, touching an electric 
jron to see whether it was hot; and the subject 
was then asked to respond at the buzz as if he 
had received a painful electric shock of the 
magnitude previously experienced. 

WA- With the objective conditions of C~ the sub- 
ject was asked to recall a circumstance in 
which, though in pain, he had behaved as if he 
were not; specifically, having fallen over as a 
child when playing with other children; and 
the subject was then asked to respond at the 
buzz as if there were no shock. 

In each case, the subject received four stimuli at 
intervals of approximately 1.5 minutes. In the intervals 
the experimenter recorded the subject’s GSR and the 
base level of skin resistance prior to giving the next 
stimulus, and he repeated crucial instructions pertain- 
ing to the experimental condition. Hypnotized subjects 
were specifically questioned about their experiences. 
Movements and comments from subjects provided 
qualitative data to supplement the quantitative GSR 
records, The room was sound dampened, the experi- 
menter worked quietly, the subject remained passive, 
thus irrelevant stimuli to the GSR were absent, and 
the GSR can be taken as indicative of the subject’s 
reaction to the experimental condition. 


Predictions 


On the credulous view, one would expect the GSR 
for hypnotic paresthesia to be the same as the GSR for 
the C~ condition: that is, fantasied shock should be 
equivalent to experience of real shock. Also, GSR for 
hypnotic anesthesia should not differ from GSR in the 
C* condition if fantasied absence of shock is equivalent 
to actual absence. At the same time it would be ex- 
pected that waking subjects cannot imitate the hypnotic 
performance. Furthermore, while nonsomnambules may 
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be like somnambules in the control and acting condi- 
tions they should differ from them in the hypnotic 
conditions. Complete failure of these outcomes would 
confirm the skeptical view. 


Results 


Qualitative data are reported in Tables 2 
and 3 for the eight somnambules who were 
tested under the HIS+ and HIS— conditions. 
These describe movements and comments 
made by the subject as trance reactions and 
in a comparable waking condition. These data 
show that as far as overt signs and the sub- 
ject’s testimony go, hypnotic paresthesia and 
anesthesia were invariably experienced. 

GSR data are presented as log conductance 
change (after Haggard, 1949). If R is a base 
level of skin resistance immediately prior to 
stimulation, and R’ is level of resistance at 
the peak of reaction to the stimulus, the 
measure of reaction used is (40 + 10 log— 
1/R’ — 1/R). This scaling with log zero 
denoted bar four gives a range of values from 
0 (mo reaction) to 30, the largest GSR for 
these data. Mean values for the various con- 
ditions are set out in Table 4. 

The data were tested for significance by F 
and ¢ tests via the analysis of variance, and 
the main findings were: Measurement of the 
GSR was reliable, intersubject variation being 
significantly (p < .001) greater than intra- 
subject variation, There was a significant 


TABLE 2 } 
QUALITATIVE OBSERVATIONS ON REACTIONS TO ReaL Smock (Pre-experimental) AND 
TO Hypnotic PARESTHESIA OF SHOCK 


Subject’s Waking Reactions to Real Shock 


Subjects Subject’s Reactions with Paresthesia 
1 Flinches, grimaces, moves hand. In reply to Most apprehensive of apparatus; needs reas- 
questions about what he felt, the subject surance of low dc amps before proceeding; 
says dislikes shocks and would prefer that shakes electrodes off fingers at each sl . 
no more be given. Needs reassurance to con- na to be fee to replace electrodes 
tinue. and to continue. 

2 No gross movements. When questioned, the No gross movements a] art from fn ane 

subject reports that he feels strong shocks. tion but says with vehemence: “Wow 
ve me a belt. : He 
„ “I know it’s 


3 Minor hand movement, facial grimace. Says 
first shock was only mild. Says that subse- 
quent shocks were getting stronger and not 


to increase the strength further. 


A Movement of hand; otherwise no signs. When 
questioned says she feels shocks but can take 


them if she has to. 


Reluctant to participate, sa; 
silly but I hate shocks.” Needs much reas- 
surance before proceeding. Exclaims loudly 
and shakes hand at each shock. Strong an- 
ticipatory avoidance both in hand move- 
ae A etree finger movement, 

i and, aj rom 10" 4 

SS aan "ofp ain. When questioned the 
subject reports that shocks were strong, can 
take them, but would not like them any 


stronger. 
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TABLE 3 


QUALITATIVE OBSERVATIONS ON REACTIONS TO ACTUAL ABSENCE OF SHOCK 
(Post-experimental) AND TO Hypnotic ANESTHESIA TO SHOCK 


Subjects 


1 


Subject’s Reactions with Anesthesia 


Subject’s Waking Reactions Without Shock 


Slight hand movement but reports feeling 
nothing. Visible increase in respiration sub- 
sequent to shock. 

Stuporous passivity; no movement in reaction 
to shock. When questioned, has to be urged 
to reply and mumbles that he feels nothing: 
“Hand is stone.” 

Stuporous passivity; no movement apart from 
hand jerk to shock. When questioned, de- 
nies feeling anything. 


Whole body jumped. When questioned the 
subject said, “I don’t feel anything, but she 
seems uncomfortable.” Subsequent move- 
ment restricted to fingers, but denies any 


Keyed up ordog shock; exhibits startle at 
“buzz” and relaxes when he finds he is not 
shocked; thereafter passive. 

Apprehensive of shock; but reassured when 
only “buzz” occurs. Startled at first stimu- 
lus, but thereafter passive. 


Says, “I hope there won’t be any shocks,” 

me startle in response to “buzz,” says, 
“That didn’t hurt” and laughs. Subse- 
quently passive. 

Matter of fact approach to the situation; no 
outward reactions. After second stimulus 
asks whether she should be receiving shocks, 
Accepts avoidance of reply with equanim- 


feeling. 


ity. 


TABLE 4 
MEAN CHANGES IN GSR EXPRESSED AS 
Loc CONDUCTANCE CHANGE 


Somnambule Nonsomnambule 


Nonshock Shock 
19.7 
18.9 
18.5 


(p < .0001) overall mean difference of 8 
reaction units between the shocked and the 
nonshocked subjects, with all shock groups 
averaging about 18 to 19 while the nonshock 
groups differed widely. In the nonshock condi- 
tions, hypnotic subjects gave reactions con- 
sistent with the absence of shock, whereas the 
actors gave greater reactions approaching in 
magnitude those given by shocked subjects. 
That is, neither hypnotic anesthesia nor wak- 
ing acting was able to inhibit shock reaction 
and in this they were alike; while acting could 
produce a shock-like reaction and hypnotic 
paresthesia did not and in this they were not 
alike. In reaction through GSR the somnam- 
bules and nonsomnambules were similar. In 
the comparisons that bear most directly upon 
the credulous and skeptical views, HIS+ was 
much more like C+ than like C—, and HIS— 
was much more like C— than like C+, and 
in this the data do not support the credulous 
view. 

To check on the effects of trance induction 
per se, the eight somnambules of the C+ and 


C— groups were subsequently hypnotized, 
and with this difference the previous control 
procedure was repeated. The two groups, 
which averaged 3.3 and 18.1 in the waking 
state, average 3.5 and 17.4, respectively, in the 
trance state, so that the sheer fact of being in 
trance did not affect the GSR. 

In case intersubject variability contributed 
to the nonsignificance of certain differences, 
given conditions were compared using each 
subject as his own control. The 16 somnam- 
bules comprising the HIS+, HIS—, WA-+, 
and WA— groups were retested under the 
C+ and C— conditions; and thus a given 
subject’s experimental performance could be 
compared with his normal waking reactions 
in the presence and absence of shock. These 
results fully confirmed those obtained with 
the independent group comparisons. 


Conclusions 

The credulous view that hypnotic suggestion 
is substitutable for real states of affairs as 
conditions of sensory content is clearly refuted 
by the quantitative data. The real state of 
affairs, shock or its absence, is the determinant 
of reaction even in the face of contradictory 
fantasy. It is quite clear that hypnotic sug- 
gestion does not produce “anesthesia” if by 
this is meant “absence of GSR to electric 
shock”; nor does it produce “paresthesia” if 
by this is meant “occurrence of shock-ap- 
propriate GSR without electric shock.” On 
the other hand, problems remain when these 


“CREDULOUS” AND “SKEPTICAL” Views oF Hypnotic PHENOMENA 


findings are considered in relation to the 
subject’s testimony. Hypnotically anesthetic 
subjects report that they feel no pain when 
given shocks known otherwise to be painful 
and when their bodies are definitely reacting 
to the stimuli. Hypnotically paresthetic sub- 
jects report that they feel pain despite the 
absence of shock and bodily reaction. As re- 
ported, subjective experience is at variance 
with bodily reaction. 


EXPERIMENT II; HALLUCINATION 


To investigate hypnotic hallucination—cen- 
tral to the credulous view—the auditory 
modality was chosen. Subjects were required 
to hallucinate, positively or negatively, sounds 
known to have a measurably distracting 
effect upon performance. This would provide 
an objective check upon the subject’s testi- 
mony about hallucination. 


Method 


Apparatus. Tt is known that delayed feedback of 
speech can disrupt and slow down speech (Spilka, 19545 
Tiffany & Harley, 1952) and this fact was utilized in 
the experiment. The method used to produce the effect 
was to have the subject speak into a microphone at- 
tached to a multihead magnetic tape recorder arranged 
so that the input was fed into the subject’s earphones 
after a delay of .27 seconds. (The reported optimal delay 
of .20 seconds could not be set up as the machine avail- 
able provided only for delays of .07, .13, and .27 sec- 
onds,) The subject’s vocalizing was an integral part of 
his test performance which then became subject to 
distraction effects. 

The test performance required the subject to say 
aloud each step and the answer to simple addition 
problems. An item comprised the addition of three 
two-place numbers; there were 20 items to a trial, and 
five trials to a form. The form was prepared as a printed 
sheet of five rows of addition items to be presented to 
the subject for working and recording his answers. A 
50-second time limit was set for each trial, and within 
this time the average subject could complete 10 items 
correctly. Parallel Forms A and B were prepared with 
constant mean and variance and estimated reliability 
greater than .8. 

In preliminary standardization of the delay tech- 
nique, there were observed progressive adaptation ef- 
fects with practice and individual differences in magni- 
tude of distraction effect positively correlated with 
speech rate. By comparison with normal average per- 
formance on the addition test, however, subjects per- 
formed significantly less well with delayed speech 
feedback. 

Preliminary testing. Subjects were tested on Form B 
of the addition test under normal conditions. This 
provided a criterion for matching subjects for the ex- 
periment to follow; it familiarized the subjects with the 


195 


test procedure so that detailed explanations could sub- 
sequently be avoided, especially in the experimental 
hypnotic sessions; it provided baseline data for assess- 
ing the effects of trance per se on test performance; 
and it provided sensory experience for subsequent 
hallucination should it occur. 

Experimental procedures. The experimental setting 
was similar to that described for the experiment on 
esthesia. The table was bare except for the test form 
and pencil which the experimenter provided when the 
test was to begin. Whether subjected to delayed feed- 
back or not, every subject wore earphones fitted tightly 
to minimize normal air conduction of sound, The ex- 
perimenter’s instructions and the subject’s vocalizings 
were routed to the earphones through a microphone 
located to the left front of the subject at a maximum 
distance of 18 inches from his mouth. 

The particular experimental conditions were as fol- 
lows: 

C+ In the waking state the subject performed on 
Form A of the addition test. He wore the ear- 
phones and the recorder was set to produce 
zero delay. The subject did not speak aloud 
but mouthed words exactly as when speaking 
except for the absence of sound. 

C- Tn the waking state the subject performed on 
Form A. He wore the earphones and the re- 
corder was set to provide .27 seconds feedback 
delay, The subject spoke aloud as he worked 
at the test and suffered distraction. 

HIS+ The objective conditions were those for Ch 
In addition, the subject was induced into deep 
trance and positive hallucination for his own 
voice was suggested. The experimenter illus- 
trated the interval of delay at which the sub- 
ject was to hear his voice whenever he mouthed 
words, and the procedure was repeated until 
correct. 

HIS- The objective conditions were those for ©. 
In addition, the subject was induced into deep 
trance and negative hallucination for his own 
voice was suggested. 

WA+ The objective conditions were those for Ct. In 
addition, the subject was asked to recall a 
circumstance where he behaved as if he had 
heard a noise when there was in fact none, 
specifically, querying a person who appeared 
to have spoken; the experimenter illustrated 
the interval of delay at which the subject was 
to imagine hearing his voice whenever he 
mouthed words, repeating the procedure until 
correct; and the subject was then asked to per- 
form the task as if, while mouthing words, he 
were actually hearing his own voice slightly 
delayed. 

WA- The objective conditions were those for C~. 
In addition, the subject was asked to recall a 
circumstance where he behaved as if he had 
not heard some sound, specifically, by ignoring 
a call when engrossed in an activity; and the 
subject was then asked to perform the task as 
if he could not hear his own voice. r 

In each case, the subject performed five trials at 
intervals of approximately 30 seconds, Crucial instruc- 
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tions were repeated between trials. Qualitative observa- 
tions were restricted to the subject’s testimony con- 
cerning the presence of hallucination. Quantitative data 
comprise the five trial scores for each subject. 


Predictions 


On the credulous view one would expect the per- 
formance of subjects with positive auditory hallucina- 
tion to be like that for the negative control condition; 
that is, fantasied distraction is supposedly like real dis- 
traction. Also, performance under HIS- conditions 
should be like that for the positive controls if negative 
auditory hallucination does in fact offset actual delayed 
feedback distraction. At the same time it would be 
expected that waking actors cannot imitate these per- 
formances, Somnambules would be expected to differ 
from nonsomnambules under HIS conditions. Complete 
failure of these outcomes would confirm the skeptical 
view. 


Results 


The qualitative data for the hypnotic condi- 
tions were as follows. With the somnambules, 
testing was not begun until appropriate hal- 
lucination was reported; affirmative testimony 
was given in all cases. With the nonsomnam- 
bules, in no case did a subject accept the 
suggestions. 

The addition test data are presented as 
number of correct items per trial with a pos- 
sible range from 0 to 20. Mean values for the 
various groups are set out in Table 5. The 
results for the 12 groups on the Form B pre- 
test under normal conditions are set out on the 
left-hand side; and their results on Form A 
under experimental conditions are set out on 
the right. These data were tested for signifi- 
cance via analysis of variance. There is a 
small but significant pretest difference in 
performance between the somnambules and 
nonsomnambules which does not affect the 


TABLE 5 
MEAN PERFORMANCE ON ADDITION 
Test Forms A AND B 


Form B, Pretest | Form A, Experimental 

- | Non- | Somnam- |Nonsomnam- 
Sopnam- | somnam- | `” bule bule 

bule | Distraction | Distraction 


Normal Testing ‘Ab- y ~ | Pres- 
Cond dona dar ale ai e 
Control |10.8/11.0| 9.2| 9.0/10.8| 8.2 | 9.3| 7.3 
Hypnosis |11.0/10.5| 9.2| 9.5] 8.5| 7.0 | 9.6| 7.4 
Acting 10.2/11.5| 9.2| 9.2/10.0| 9.0 | 9.1| 7.0 
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main outcomes of the experiment proper. 
Test performance is reliable, intersubject 
variation being significantly (p < -001) 
greater than intrasubject variation. The only 
significant effect for the experimental data 
( < .05) is the overall mean difference of 
approximately two items per trial between 
presence and absence of delayed speech feed- 
back distraction. The data for the nonsom- 
nambules are accounted for in this way: in the 
absence of distraction the pretest and experi- 
mental performances are the same; in the 
presence of distraction performance drops from 
the pretest level by an average of approxi- 
mately two items per trial. The data for the 
somnambules in the HIS conditions cannot be 
explained simply in terms of distraction, and 
account must be taken of the effects of hyp- 
nosis per se. 

The 16 somnambules of the C+, C—, WA+, 
and WA— groups were retested with Form B 
when later induced into trance. Comparison 
of individual results for the two testings re- 
vealed uniformly poorer performance in trance, 
the average drop exceeding two items per 
trial. Whether lethargic performance is an 
intrinsic feature of deep trance or a specific 
effect of the sleep suggestion procedure for 
inducing trance in these experiments is not 
established. The drop due to trance lethargy 
must be allowed for in this instance; together 
with the effects of delayed feedback distrac- 
tion it accounts for the results of the somnam- 
bules in the HIS groups. The drop from 11.0 
to 8.5 for the HIS+ group is explicable in 
terms of trance lethargy rather than positive 
hallucination of distraction; while a combina- 
tion of trance lethargy and actual distraction 
can account for the drop from 10.5 to 7.0 
for the HIS— group. 

The eight somnambules of the HIS+ and 
HIS— groups were retested with Form A 
when in trance but without suggestions of 
hallucination. This procedure enabled com- 
parison for a given subject of performance of 
the addition test under the three conditions: 
waking, trance, and trance plus experimental 
suggestions. These data, with intersubject 
variability controlled, corroborated the ex- 
planation of the results for the independent 
groups in terms of trance lethargy and speech 
feedback distraction. 
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Conclusions 

Waking acting and hypnotic “hallucination” 
are alike in this case in that they are equally 
unsuccessful in opposing the effects of real 
stimuli. As with the experiment on esthesia, 
it is the real state of affairs rather than the 
subjective experience that determines the 
subject’s objective behavior. Hypnotic sug- 
gestion does not produce positive hallucina- 
tion if by this is meant sensory content com- 
parable to that produced by actual speech 
feedback delay; neither does it produce nega- 
tive hallucination if by this is meant elimina- 
tion of distraction effects. The credulous view 
is clearly refuted by these data. There remains, 
however, the puzzling discrepancy between 
the subject’s reports of hallucination and the 
evidence of their objective performance. 


EXPERMENT III: DELUSION 


As a major category of fantasy, delusion 
recommends itself for study; but there is a 
dearth of experimental evidence. Delusion is 
typically defined as a false belief resistant to 
modification by contrary evidence. Thus a 
prerequisite for study is the choice of a propo- 
sition for belief. Of the infinite number avail- 
able it is convenient to choose one the subject 
of which is the person or his ego: then the 
delusion has immediate and wide implications 
for the subject’s behavior. Thus something is 
to be falsely predicated of the subject’s ego. 
In terms of the design, the proposition must 
take both a positive and a negative form. The 
predicate must thus be such that it can be 
truthfully affirmed of one group and falsely 
affirmed of another, and at the same time, it 
can be truthfully denied of one group and 
falsely denied of another. As the experimental 
groups to be used contained equal numbers of 
men and women, “sex” was a predicate that 
fulfilled all of these conditions. The logical 
form of the positive and negative propositions 
for men and women is given as: 


MAN WOMAN 


TRUE + You area man You are a woman 
— Youare nota woman You are not a man 
FALSE + You area woman You are a man 


You are not a man You are not a woman 


„The positive and negative conditions are 
different in that the statements that form a 
Positive-negative pair are not equivalent. The 


positive statement is unambiguous, but the 
negative allows several interpretations: for 
example, “You are not a man” need not 
imply “You are a woman”; the subject’s 
interpretation might be “I am a child.” These 
direct statements were not used; the intention 
of the logical form was achieved by more 
elaborate suggestions. The details are given in 
Sutcliffe (1958). 

The experimental test of the credulous view 
of delusion involves the subject in behavior 
relative to a belief about his sex. Of the many 
ways of behaving characteristic of the sexes, 
moral and ethical considerations ruled out 
direct observation of the more obvious genital 
behaviors. Interrogation relative to “hypo- 
thetical situations” might have yielded infor- 
mation, but this would have been limited to 
the subject’s testimony. Accordingly attitudes 
and interests characteristic of the sexes were 
investigated. 


Method 


Materials. Two parallel Forms A and B, each of 92 
items, were prepared from the Emotions and Interests 
sections of the Terman and Miles (1936) Attitude- 
Interest Inventory. An item is scored +1 for a masculine 
response, ~1 for a feminine response, and 0 for a neutral 
response by reference to the Terman-Miles scoring key. 
‘The algebraic sum of the weights gives a subject’s score 
for the form. The correlation between the forms is .81 
under normal conditions, The average score for women 
on the forms is just less than 0, SD = 13, The average 
for men is about 18 points higher, SD = 10. There is 
little overlap of the distributions for men and women; 
but it is known from the Terman-Miles study that a 
subject can, upon request, make answers conform or 
overconform to those of the opposite sex. 

Preliminary testing. All subjects were pretested on 
Forms A and B of the Attitude-Interest Inventory. 
This provided a criterion, Form B, for matching sub- 
jects in the experiment to follow, and a baseline, Form 
‘A, from which change in the experiment could be meas- 
ured. While subjects would be most familiar with their 
own sex roles, they would also have knowledge of op- 
posite sex roles, and accordingly no special experience 
had to be provided to enable subsequent delusion were 
it to occur. 

Experimental procedures. The setting was that de- 
scribed for the experiments on esthesia and hallucina- 
tion. The table was bare except for the test Form A 
and pencil which the experimenter provided at the 
appropriate time. When the subject had completed the 
inventory, the experimenter questioned the subject 
about his responses and sexual identity, and these data 
were recorded on tape. 

The particular experimental conditions were as 


follows: 4 
C+ Half the subjects were men and for them it was 
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simply true that they would fill out the in- 
ventory as men. The other half were women 
and would fill out the inventory as such. 
Each subject was required to enter name, age, 
and sex at the beginning of the test form 
before proceeding. 

C- Half the subjects were women and hence not 
men; while the other half were men and hence 
not women. The manner of filling out the in- 
ventory was as in the C+ condition. 

HIS+ The objective conditions were those of C+. In 
addition, the subject was induced into deep 
trance when a delusion about sex was suggested 
in line with the logical form: “You are a 
woman” for a man; and “You are a man” 
for a woman. 

HIS- The objective conditions were those of C~. In 
addition, the subject was induced into deep 
trance when a delusion about sex was suggested 
in line with the logical form: “You are not a 
man” for a man; and “You are not a woman” 
for a woman, 

WA? The objective conditions were those of C+, In 
addition, the subject was asked to recall a 
circumstance where he (she) played the part of 
the opposite sex; and the subject was then 
asked to fill out the inventory as if he were a 
woman (as if she were a man). 

WA- The objective conditions were those of C~, In 
addition, the subject was asked to recall a cir- 
cumstance in which he (she) had denied his 
(her) sex; and the subject was then asked to 
fill out the inventory as if he were not a man 
(as if she were not a woman). 

Predictions 

Whereas with the experiments on esthesia and hal- 
lucination it was not known in advance whether the 
subjects could perform the tasks set, in the experiment 
on delusion it was known that a waking subject can to 
some extent simulate opposite sex responses on the 
Attitude-Interest Inventory. Thus instead of asking 
whether an actor may do as well as a hypnotized sub- 
ject, one must ask whether they differ in any way. On 
the credulous view, a deluded subject should behave 
just as a subject who holds the same belief in truth. 
Thus not only would one expect a change in perform- 
ance with a suggested change in sex, but also it should 
be appropriate in degree. While alike in the content of 
their beliefs, however, the two groups may well differ 
in their reactions to interrogation and contrary evi- 
dence. Somnambules and nonsomnambules would be 
expected to differ primarily in the hypnotic conditions. 


Results 


The interrogation sought answers to two 
questions: what does the subject believe his 
sex to be? and how would he prove this belief? 
The details of the questioning and the answers 
given varied from case to case, but the results 
may be summarized as follows: 

All control subjects correctly recorded their 
sex, and when questioned about it they in- 
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variably cited biological sexual characteristics 
as proof. Less frequently cited in addition 
were the judgment of others, birth certificate, 
clothing and hair styles, and interests. Sub- 
jects showed that they regarded their sex as 
self-evident, and they were puzzled by the 
experimenter’s questioning of the obvious. 

All “acting” subjects correctly recorded 
their sex, and they cited supporting evidence 
of the same kind as that given by the controls, 
For all but one subject, a nonsomnambule, 
the act terminated with the completion of the 
inventory. This subject carried the act into the 
interrogation, but when pressed for evidence 
gave up the act and changed to an admission 
of his correct sex. In no case was the act taken 
as far as entering the incorrect sex at the head 
of the inventory. When asked about this, 
subjects gave the stock reply that this was 
identifying information which stood apart 
from the experimental context. Two nonsom- 
nambules and four somnambules reported 
difficulty with acting, and one of the latter 
said that she tried to give answers typical of 
neither men nor women. ; 

All nonsomnambules in the hypnotic condi- 
tions correctly recorded their sex and gave 
appropriate evidence in support. All reported 
difficulty in imagining the state of affairs 
suggested by the hypnotist, and two of the 
men took objection to the suggestion that 
they had become women and actively opposed 
it. 

The somnambules in the hypnotic condi- 
tions differ from the other subjects in several 
ways: While all 40 subjects so far considered 
correctly recorded their sex at the head of the 
form, only 2 of these remaining 8 subjects did 
so, one woman entered “male,” and 5 made 
no entry. The 4 subjects in the HIS+ group, 
though in deep trance, resisted the suggestion 
that they were of the opposite sex and passively 
but continuously reiterated their true sex. 
This behavior was unlike that of the controls 
in its repetition and failure to mention any 
evidence. While the 4 somnambule and 2 non- 
somnambule subjects of the HIS+ condition 
objected to the suggestion that they were of 
the opposite sex, the 4 somnambules of the 
HIS— group more readily accepted the denial 
of their sex. They made statements such as 
“T feel it quite strongly, I’m sure of it” and 
“I feel it’s just obvious”; and in this their 
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conviction about the assumed sexual identity 


was like the controls’ conviction about real 
sexual identity. They did not however cite any 
evidence. 

Quantitative data were analyzed for change 
as follows. The difference in score between pre- 
test and experiment on Form A was found 
for each subject. The average change for a 
control group was subtracted from each sub- 
ject’s change score. This correction was 
carried out separately for the somnambule and 
nonsomnambule groups as the two sets of 
control groups differed in retest variation. The 
difference in direction of change due to sex was 
corrected by denoting change scores positive 
if the change was in the suggested direction, 
and negative if the change was in the contra- 
suggested direction. It will be seen that this 
procedure sets the change scores for the control 
groups to zero. The mean values of change 
scores for the eight experimental groups are 
set out in Table 6. Differences between nega- 
tive and zero, and between zero and positive 
change are significant. 

Under normal conditions there is a mean 
difference of 18.6 between men and women on 
Form A. Thus for a given sex to successfully 
simulate the performance of the opposite sex 
group, it must change from its normal mean 
by 18.6. This provides a criterion for evaluat- 
ing the data of Table 6. The nonsomnambules 
in the two hypnotic conditions did not change 
at all, consistent with their insusceptibility 
to hypnosis. On the other hand the nonsom- 
nambules, when asked to simulate in the 
waking state, were able to change their per- 
formance and if anything they overacted the 
part. The somnambules in the HIS+ condi- 
tion changed somewhat but inappropriately; 
the overemphasis in test performance upon 
true sex was consistent with their subsequent 
reaction to interrogation. The somnambules in 
the HIS— and WA— conditions changed by 
about the predicted amount, while those in 


TABLE 6 
MEANS OF CHANGE SCORES ON THE ATTITUDE- 
Interest INVENTORY Form A FOR THE 
Hypnosis AND ACTING GROUPS 


Condition Somnambule Nonsomnambule 
HIS+ —9.5 0.8 
HIS— 18.8 0.3 
WA+ 26.8 27.0 
WA— 15.8 25.2 
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the WA-+ group overacted. Independent 
checks showed that trance per se does not 
affect performance on the inventory; and 
overall the results held for individual subjects 
as well as for the group averages. 
Conclusions 

Negative hypnotic delusion was success- 
fully suggested to the somnambules as indi- 
cated by their testimony and by the appro- 
priate mean change in performance. The WA— 
actors were as successful in changing their test 
performance, but their reactions to interroga- 
tion differed from those of the hypnotized 
subjects. In the HIS+ condition, both som- 
nambules and nonsomnambules found the 
suggestion that they were of the opposite 
sex objectionable; the suggestion was resisted, 
the experimental conditions for the test of the 
credulous view were not fulfilled, and hence 
this part of the data must be set aside. It is 
of interest to note that deep trance does not 
guarantee acceptance of suggestion in all 
cases. Where the experimental conditions were 
met, the quantitative data do not clearly 
differentiate between the credulous and skepti- 
cal views; the qualitative data support the 
view that in a number of cases hypnotized 
subjects were deluded about their sex. 


GENERAL SUMMARY AND CONCLUSIONS 


Evidence from the three experiments shows 
that: 

1. It was not possible by hypnosis to produce 
sensory conditions identical in form with 
those produced by parallel real stimuli. The 
GSR to real shock occurred despite hypnotic 
“anesthesia,” and there was no GSR in the 
absence of shock despite hypnotic “pares- 
thesia.” Decrement in performance due to 
actual delay of speech feedback occurred 
despite hypnotic “negative hallucination,” 
and no decrement occurred in the absence of 
delay despite hypnotic “positive hallucina- 
tion.” ‘ 

2, There is equivocal support for the view 
that hypnotic behavior is like waking acting. 
The two were alike in that neither was success- 
ful in inhibiting the effects of shock or distrac- 
tion, and a further similarity was found with 
the negative delusion about sex. The similar- 
ity in the first two cases should not be empha- 
sized as it depends upon a failure of inhibition 
where real states of affairs are dominant. In 
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the positive condition where production 
rather than inhibition was called for, differ- 
ences between hypnosis and acting did occur. 
In the absence of shock hypnotic subjects 
reported subjective experiences of pain but 
gave no GSR; while the actors with no sub- 
jective experience of pain overtly acted as if 
shocked and produced GSR. 

3. With the exception of the HIS+ group 
in the third experiment, all hypnotized sub- 
jects were convinced of the actuality of sug- 
gested states of affairs. They asserted that 
they were shocked when in fact there was no 
shock, or they were not shocked when in fact 
they were; they could hear their own voices 
when they were not in fact speaking aloud, or 
they could not hear their own voices when 
in fact they were speaking aloud; they were 
not men (not women) when they were in fact 
men (women). This testimony was uniformly 
at variance with the evidence of the subjects’ 
bodily states. 

These findings justify the following conclu- 
sions: 

1. The credulous view that hypnotic sug- 
gestions are substitutable for real states of 
affairs as conditions of sensory experience 
must be rejected. 

2, The skeptical view that hypnotic be- 
havior is like waking acting is true in some 
cases. The difference between hypnosis and 
acting might best be sought elsewhere than in 
the overt behavior, which is sometimes similar 
and sometimes dissimilar for the two depend- 
ing upon the task and whether inhibition or 
production is called for. The qualitative evi- 
dence suggests that differences might more 
fruitfully be sought in terms of perception and 
belief. 

3. Crucial evidence is lacking to decide be- 
tween the two versions of the skeptical view 
concerning the subject’s perceptions: that the 
hypnotic subject does not misperceive the real 
situation but misreports it; or that the hyp- 
notic subject misperceives, that is to say is 
deluded about, the real situation. The latter 
view gets some support from the different 
reactions of hypnotic, control, and acting 
subjects to interrogation. 

4. The distinguishing feature of hypnosis 
appears to be the subjective state; and the 
main feature of this state is the hypnotized 
subject’s emotional conviction that the world 
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is as suggested by the hypnotist, rather than a 
pseudoperception of the suggested world. 

The role of the hypnotic relationship and its 
procedures in the communication of delusory 
conviction remains to be clarified. An approach 
which at the same time may throw some light 
on individual differences in hypnotizability 
would be to examine the authority relationship 
and the conditions of credence in their develop- 
mental aspects with emphasis upon the parent- 
child relation. 
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PERFORMANCE ON COMPLEX REASONING TASKS 
AS A FUNCTION OF ANXIETY! 


CHARLES VAN BUSKIRK? 
University of Chicago 


good deal of recent research has been 

done concerning the effects of anxiety 
considered as an enduring personality 

trait upon performance in learning and prob- 
lem solving. Considerable evidence has shown 
that high anxious (HA) groups are inferior to 
low anxious (LA) groups in their performance 
on complex tasks. The findings have suggested 
that a general relationship exists between the 
complexity of the task and the magnitude of 
the decrement produced by anxiety. Farber 
(1955) suggested the following proposition: 
“ ..the quality of performance in complex 
learning situations is inversely related to Ss’ 
degree of anxiety, as measured by the Taylor 
scale, and furthermore, that the advantage of 
the nonanxious over the anxious Ss is positively 
related to the probable number and strength 
of the competing responses elicited” (p. 323). 
If a test, such as Taylor’s (1953) Manifest 
Anxiety (MA) scale, is assumed to provide a 
measure of general drive level, the interaction 
between anxiety and certain task charac- 
teristics is predicted by Hull’s hypothesis that 
response strength is a multiplicative function 
of habit strength and drive level. Poorer per- 
formance by the HA group is expected, for 
example, in either paired associate or serial 
learning tasks in which pre-existing associative 
telationships among the task elements lead to 
wrong responses, since the subjects with higher 
drive will take longer to extinguish their 
stronger erroneous response tendencies (Farber 
& Spence, 1953; Ramond, 1953; Spence, Far- 
ber, & McFann, 1956; Taylor & Spence, 1952). 
Tt has been suggested that HA subjects make 
more irrelevant responses to a task—particu- 
larly a stressful or difficult task (Child, 1954). 
The drive theorist’s position is that, if irrele- 


1 This study is based upon a dissertation presented 
to the Department of Psychology of the University of 
Chicago in partial fulfillment of the requirements for 
the degree of Doctor of Philosophy. 

2 The author wishes to express his grateful apprecia- 
tion to Morris Stein, Donald Fiske, and Austin Riesen, 
who provided unfailing guidance and encouragement 
whenever it was needed. 

Now at the State University of Iowa. 
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vant responses are produced, they will compete 
with task appropriate responses and interfere 
with performance. Higher levels of drive will 
increase the number of irrelevant response 
tendencies above threshold at a given time and 
will decrease the rate at which any specific 
weak response can be selectively reinforced 
(Spence, 1958). A contrasting interpretation 
involves the direction of irrelevant responses. 
The essential point is made in Rosenzweig’s 
(1944) distinction between “need persistent” 
and “ego defensive” reactions to frustration. 
In a need persistent reaction, behavior is 
oriented toward attaining an objective goal, 
that is, toward interaction with environmental 
conditions that are relevant to the frustration, 
In an ego defensive reaction, behavior involves 
a general avoidant or counteractive orientation 
which is irrelevant to the objective task. The 
latter characteristics have been attributed to 
anxious behavior by Hanfmann (1950) in her 
review of research on anxiety. She points out 
that anxiety is most commonly observed in an 
individual who is experiencing a threat that he 
does not clearly perceive or understand, Irrele- 
vant ego defensive responses should be charac- 
teristic of HA subjects, since many of the items 
in the MA scale refer to a propensity to become 
anxious without regard for reality considera- 
tions. However, recent research has cast con- 
siderable doubt on the hypothesis that the HA 
subject produces more irrelevant responses 
than any other subject. On tasks which mini- 
mize associative interference, the performance 
of the high anxious group is frequently superior 
to the LA group (Spence et al., 1956; Taylor & 
Chapman, 1955). Even under some of the 
stress conditions which consistently produce 
performance decrements in subjects gen- 
erally—very possibly by evoking irrelevant 
responses (cf. Lazarus, Deese, & Osler, 1952), 
HA groups have performed at a superior level 
(Deese, Lazarus, & Keenan, 1953; Taylor, 
1958). ji 

Complexity can be defined in terms of the 
number of components or steps which are 
essential to the performance of a task or solu- 
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tion of a problem. Little research has been 
done on the interaction between complexity, 
defined in this way, and anxiety. Grice (1955) 
found that a high manifest anxiety group had a 
longer reaction time in a complex perceptual 
discrimination task, but the difference was 
accounted for by the lower intelligence scores 
of the high group. Using the Children’s Mani- 
fest Anxiety (CMA) scale LA elementary 
school children have been found superior to 
HA children on difficult, complex discrimina- 
tion learning tasks by Palermo, Castaneda, 
and McCandless (1956) and Castaneda, Pa- 
lermo, and McCandless (1956). Initial differ- 
ences in ability were not controlled in the 
latter studies; and none of these investigators 
systematically varied complexity as a function 
of the number of task components. As pointed 
out above, drive theory predicts more response 
competition and poorer performance for HA 
subjects as the number of components essential 
to a task are increased (Farber, 1955). The 
nondrive prediction would be of poorer per- 
formance by HA groups, if a given increase in 
the complexity or difficulty of a task is more 
frustrating or threatening to the HA subject 
and thus tends to reorient his behavior in an 
ego defensive direction (French, 1952; Saltz & 
Hoehn, 1957). 

The purpose of this study is to determine the 
effects of anxiety upon performance on complex 
reasoning problems. Reasoning, in this in- 
stance, means the task of abstracting a rela- 
tional principle from a model situation and 
applying it to a new situation. Tests of logical 
deduction and figure analogies are used. Com- 
plexity is varied by increasing the number of 
elements which have to be taken into account 

formulating the principle of the problem. 
thin each test there are four subtests—each 
ore complex than the one preceding. In order 
to isolate the effect of the complexity variable, 
mean scores on the complex subtests are ad- 
justed relative to mean scores on simpler sub- 
tests by means of analysis of covariance. Three 
measures of manifest anxiety, provided by 
three different personality tests, are used. In 
addition, a measure of the “fear-of-failure” 
variant of anxiety has been included. The 
hypotheses are as follows. 

Hypothesis 1. The performance of the high 

manifest anxiety group on complex problems 
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shows a greater decrement from their per- 
formance on simple problems than is shown by 
the low manifest anxiety group. 

Hypothesis 2. The performance of the high 
fear-of-failure group on complex problems 
shows a greater decrement from their per- 
formance on simple problems than is shown by 
low fear-of-failure group. 


METHOD 
Personality Variables 


In an attempt to control test-specific variance, three 
types of personality tests? which have previously 
proven useful were used—a Self-Description Inven- 
tory and two kinds of projectives, a Sentence Comple- 
tion test, and a Picture Interpretation test. Defining 
criteria were formulated to serve as the basis for rating 
the projectives and selecting the inventory items. 
The criteria were expressed in terms of the individual's 
typical reaction to frustration and attempt to cope with 
it. The definition of manifest anxiety includes any 
expression or implication of awareness of anxiety. 

The defining criteria for manifest anxiety cover three 
areas: Frustration is perceived as a threat to the 
individual’s basic security, Warning of the threat is 
given by an active state of emotional upset character- 
ized by tension, anxiety, and apprehension. The primary 
aim of the individual is to protect himself from misery, 
attack, punishment, etc. The individual expects his 
instrumental efforts to resolve frustration to be con- 
fused, futile, thwarted, and ineffectual. His outlook is 
pessimistic, and his lack of hope for a constructive 
resolution may turn his efforts toward finding a way of 
escaping the frustrating situation, In general, other 
persons are mistrusted. The individual feels misunder- 
stood and without a reliable person to whom he can 
turn for aid or reassurance. Authorities often reject or 
punish in spite of the individual’s good intentions. 
Peers are expected to be troublesome or indifferent. 

A modification of the Michigan Sentence Comple- 
tion Test (Kelly & Fiske, 1951) was used as one instru- 
ment. Over half the original items were dropped, and 
first person references were changed to the third person 
or to an impersonal phraseology. A few items were added 
which referred to success, failure, and goals. The 
instructions were to complete all of the sentences, to 
write down the first thing that came to mind, and to 
work quickly. The time limit was 15 minutes. 

The Picture Interpretation test was based upon 15 
pictures clipped from old magazines, The pictures 
depicted one or more persons in ambiguous circum- 
stances which could be interpreted as frustrating. The 


3 The instruments used in this study have been 
deposited with the American Documentation Institute. 
Order Document No. 6778 from ADI Auxiliary Publi- 
cations Project, Photoduplication Service, Library of 
Congress; Washington 25, D. C., remitting in ad- 
vance $2,00 for microfilm or $3.75 for photocopies. 
Make checks payable to: Chief, Photoduplication Serv- 
ice, Library of Congress, 
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were told not to describe the picture, but 
‘rather to give a brief, imaginative interpretation of it 
by suggesting the thoughts, wishes, fears, intentions, 
beliefs, and feelings of the people which would be 
underlying reasons for the situation being the way it 
was. Each picture in turn was held in front of a group 
‘of subjects, and 70 seconds were allowed for writing the 
interpretation. 

‘The following score categories were used for both 

jectives: a 

Score 2: if the themes and self-descriptive state- 
ments correspond to the criteria (in at least two of 
the main areas) so frequently and definitely that the 
variable (manifest anxiety) can be considered one 
of the most important characteristics of the in- 
dividual’s personality. 

‘Score 1: if there is some evidence that the variable 
is very important in the personality under considera- 
tion, but for some reason the evidence is not decisive. 
(Note—this score is not to be used excessively. Unless 
there are hints that the variable might be very im- 

t, the “0” score should be used.) 

Score 0: if the evidence indicates that the variable 
is not one of the primary characteristics of the indi- 
yidual’s personality. 

Indicate no confidence: in addition to one of the 
other scores when it is felt that there is so little 
evidence that the rating is apt to be invalid, 

Each projective was rated by the author and one of 
two other judges. The ratings of the Sentence Comple- 
tion and of the Picture Interpretation tests were made 
independently of each other in all cases. The judges had 
copies of the defining criteria for manifest anxiety as 
presented above. Each judge made several 
ratings and discussed them with the author until it was 


terpretation 

or fifths (in the case of the Sentence Completion test). 
Each section of a test was rated for all subjects before 
going on to another section. With knowledge of his 
ratings of the separate sections, the author reread each 
Protocol and made an overall rating on manifest 
anxiety. To save time for the other two judges, the 
protocols were divided only The 
ratings of the two sections for 
mechanically combined for a given judge. 

ratings were combined, either from a single judge 
from two judges, the arithmetic mean was used 
when the two ratings were a 1 and a 2, 
tion of the latter two ratings was taken 


far as that score was concerned, 
of 89 items to which a 
“undecided,” 
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the Fear-of-Failure scale do not appearin the MA scale, 
but several of them closely resemble items in the Test 
Anxiety scale of Mandler and Sarason (1952). 

For testing the experimental hypotheses with respect 
to the inventory scores, extreme groups were selected 
on the basis of cutting points established at salient 
discontinuities about one standard deviation above 
and below the mean on each inventory score. Subjects 
who marked “undecided” or who failed to respond on 
20 or more items were rejected. 

The intercorrelations among the four personality 
measures are shown in Table 1. A rank order coefficient 
was deemed appropriate because the projective scores 
included only a few gross intervals. Kendall’s tau, 
which could readily be corrected for the many tied 
ranks that occurred, was used. The correlations were 
computed for each school-sex sample separately, and 
all subjects providing usable data on all personality 
measures were included. The inventory manifest anxiety 
score was found to be significantly related to the 
sentence completion manifest anxiety score and to the 
inventory fear-of-failure score in four of the five sample 
groups. The picture interpretation manifest anxiety 
score was significantly correlated with the sentence 
completion manifest anxiety score in only two of the 
samples. No other correlations were significant. Posi- 
tive trends seemed apparent in the other relationships 
except for that between picture interpretation mani- 
fest anxiety and inventory fear-of-failure. There were 


TABLE 1 


Rank CORRELATIONS (tau) AMONG PERSONALITY 
MEASURES IN FIvE SAMPLES 


Manifest Anxiety Measure 
Personality Measure A k 5 
entene - 
Semin teen Inventory 
Picture Interpreta- 
tion Manifest 
sample: (0) 
le: 
AM Qa —.14 
BM (24) 02 
CM aD -41** 
AF (24 
BF (33) .33* 
Inventory Manifest 
le: 
AM 14 03 
BM .50** 17 
CM -A1** 24 
AF +39* —.08 
BF .34* .21 
Inventory Fear-of- 
Failure 
Sample: 
AM -29 —.24 = =.52** 
BM -07 S .34* 
CM 10 —.02 .44** 
AF s21 —.02 i55% 
BF BGS 04 23 


* Significant beyond the .05 level. 
** Significant beyond the .01 level. 
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no systematic differences in the correlations which 
could be attributed to school or sex. 

In all cases, the relatively low magnitudes of the 
correlations indicated that the scores could not be 
substituted for one another. Therefore, a separate test 
of the hypothesis was made for each score. In addition, 
the hypothesis concerning manifest anxiety was tested 
with extreme groups selected on the basis of a combina- 
tion of scores provided by the three instruments, Low 
groups of sufficient size were obtained by selecting those 
subjects who fell in the low group on all three instru- 
ments. This stringent criterion did not provide groups 
of sufficient size at the high end of the scale. Therefore, 
any subject who fell in the high group on two of the 
personality instruments was selected for the combined 
group. This procedure makes sense if one assumes that 
the instruments were rather insensitive and missed 
some high subjects." 


Reasoning Tasks 


The reasoning tasks were tests of figure analogies 
and logical deduction, each of which consisted of 60 
multiple-choice problems (cf. Thurstone, 1938). In 
either case, each problem consists of a set of logically 
interrelated components, the model, and a set of 
response alternatives, one and only one of which is 
logically consistent with the model set. Both tests are 
divided into four 15-problem subtests representing four 
levels of increasing complexity. Subtest I (simplest) 
was given first, and the level of complexity of the 
succeeding subtests was regularly increased so that 
Subtest IV (most complex) was given last. Several 
practice problems served as an introduction to Subtest 
I on each type of test. The four subtests of a given type 
are parallel except for complexity. Although a wide 
variety of elements and inference structure are used, 
each type of inference is represented by similar items 
in all four subtests. At all four levels, some of the 
problems are made more difficult by the presence of 
irrelevant information and nonsense terms. 

In the Figure Analogies test, the model set consists 
of a pair of figures (1 and 2) which differ from each 
other on one or more figural dimensions: size, shape, 
shading, number, and position. A third figure (X) is 
shown, and the subject is required to select from five 
alternative figures (A to E) the one which bears the 


5 The attempt was made to study need Achievement 
defined in a way similar to that used in defining mani- 
fest anxiety, The three need Achievement scores were 
found to be uncorrelated with one another and essen- 
tially uncorrelated with all anxiety variables. Further- 
more, none of the need Achievement variables singly or 
in combination were found to have consistent relation- 
ships with performance on the complex reasoning tasks. 
The results for the need Achievement variable are 
reported in Van Buskirk (1959). 

It had been hoped to define high and low fear-of- 
failure groups on the basis of extreme scores on both 
variables. However, this was impractical because of the 
small number of subjects who clearly fell into such 
patterns. Therefore, our only measure of the fear-of 
failure variant of anxiety is that provided by a special 
inventory scale. 
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same relationship to the Figure X as Figure 2 bears to 
Figure 1. The inference required may be summarized as: 
1is to 2 as X is to?. In the subtest at the simplest level 
(Level I), the two model figures differ in only one 
significant respect. In each succeeding subtest, the 
number of differences between the model figures is 
increased by one. In Subtest IV, four differences within 
the model pair of figures have to be taken into account. 

In the logical deduction Subtest I, there are two 
propositions in the model set and three alternative 
conclusions. The complexity of each succeeding subtest 
is increased by the addition of one more proposition 
to the model set at each level, and by the addition of 
one more conclusion at Levels III and IV. In order to 
solve a problem in Subtest IV, the subject has to deduce 
a conclusion from a chain of five propositions and to 
identify the conclusion among five response alternatives. 
The complexity of the problems in Subtest IV is so 
great that they are extremely difficult. 

Each subtest is timed separately with the time limit 
increased at each level. The times are short enough to 
prevent most subjects from completing any subtest. 
In order to reduce the effects of speed the scores have 
been corrected for guessing by subtracting the per- 
centage of wrong answers from each subject’s score 
which he would have gotten correct by chance. Each 
subtest provides a separate corrected score. 


Subjects and Procedure 


The subjects were undergraduate students from 
three different universities. Most of the students 
participated in this experiment in connection with some 
psychology course. As a result, they received formal or 
informal credit in that course for participation. Two- 
thirds of the studentsat School C responded to a notice 
in the dormitory offering $1.00 to participants in a 
psychological experiment. All prospective participants 
were told that five monetary prizes would be awarded 
for the five highest total scores on the two reasoning 
tests. First prize was $10.00, the second $5.00, the 
third $3.00, the fourth $2.00, and the fifth $1.00. The 
students were encouraged to participate but were 
essentially volunteers, At Schools A and B, both males 
(M) and females (F) served as subjects. At School C, 
only males participated in sufficient numbers to be 
included in the study. They ranged in age from 17-25. 
A few subjects who had ages of 27 or greater were 
excluded, as were those who failed to follow instruc- 
tions or did not complete the testing. 

Combining the characteristics of school and sex 
gave five sample groups: AM, AF, BM, BF, and CM. 
These five groups differed rather widely among them- 
selves on many of the scores used in this study; 
therefore, the data have been analyzed for each of the 
five groups separately. 

The tests were administered in group sessions. At 
School A, the testing was accomplished in a single 
2-hour session in which the tests were given in the 
following order: Picture Interpretations, Sentence 
Completion, Inventory, Figure Analogies, Logical 
Deduction, At School B, students were tested in two 
successive regular psychology class sessions. In the 
first session, they were given Picture Interpretation, 
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Sentence Completion, and Figure Analogies tests. In the 
second session, they took the Inventory and Logical 
Deduction tests. At School C, the paid subjects were 
tested in a single session and the others in two classes. 


RESULTS 


The hypotheses were tested by determining 
differences in performance in high and low 
groups on each of the personality variables. By 
use of the analysis of covariance technique, the 
means of the complex subtest were adjusted so 
that the variance associated with initial ability 
was partialed out; thus the proportion of the 
variance due to complexity was maximized. In 
the Figure Analogies test, Subtest I, for which 
the mean in all groups was considerably higher 
than the means of other subtests, provided a 
good measure of ability on simple items. There- 
fore, the Subtest I score was used as the con- 
trol variable in the analysis of covariance. The 
means for Subtests II and III differed only 
slightly; for economy’s sake, Subtest II means 
were not analyzed. In the Logical Deduction 
test, Subtest I, there was a large “warm-up” 
factor. The Subtest IZ mean was at least as 
great as the Subtest I mean in each group. It 
seemed advisable to reduce the proportion of 
variance due to warm-up in the measure of 
ability on simple items. Therefore, the sum of 
Subtests I and TI was used as the control varia- 
ble for the analysis of covariance of the logical 
deduction scores. The correlations between the 
two types of reasoning test were generally posi- 
tive but low, so the data were treated sepa- 
rately for the Figure Analogies and Logical 
Deduction tests. Differences associated with 
school or sex were controlled by computing 
statistical tests of significance for each school- 
sex sample separately and combining the 
resulting p values for an overall test of signifi- 
cance, using the chi square technique suggested 
by Jones and Fiske (1953). j 

The first hypothesis was that the high mani- 
fest anxiety group would show a greater decre- 
ment on complex problems than the low mani- 
fest anxiety group. In Table 2 are shown the 
differences between the adjusted complex 
reasoning test means of the low and high groups 
selected on the basis of the combination of the 
three manifest anxiety scores. The differences 
were computed by subtracting the high anxiety 
group’s mean from the low group’s mean so 
that a positive difference would be in accord 
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with the hypothesis. One-tailed tests of signifi- 
cance were made. The differences on the 
Figure Analogies subtests were clearly within 
the range of chance variation. On the Logical 
Deduction subtests, the differences were con- 
sistently in the negative direction and yielded 
overall p values of .99 for Subtest III and .95 
for Subtest IV. These results tended to con- 
tradict the hypothesis. In fact, the differences 
in the unexpected direction were large enough 
to demand that their statistical significance be 
tested by a two-tailed test. Within the theoreti- 


TABLE 2 
ADJUSTED DIFFERENCES BETWEEN REASONING 
Susrest Mrans oF Low AnD HIGH GROUPS 
ON THE COMBINED MANIFEST ANXIETY 


CLASSIFICATION 
(N = 52) 
Logical Deduction Figure Analogies 
(eckmple,g | Subtest III | Subtest IV [Subtest IIT] Subtest TV 
sex) — es 
i i iffer-| „ [Dif 

le alice 

AM —1.3}.99 |—0.8).84 |—1.4].94/—1.6].95 
BM —1.0).89 |—1.3).94 |+0.2).44/+-0.7).34 
CM +0.1|.46 |—0.3|.63 |+0.5/.31|—0.4|.63 
AF —1.7/.97 |—0.1).52 |—0.5|.78/+1.2|.12 
BF —2.71.99 |—2.01.98 |—0.2[.51|—0.5|.66 
Combined .99* +95>| 81 60 

$ 


2 A two-tailed test of the difference in the direction opposite 
to hypothesis is significant,  < .01. 

b A two-tailed test of the difference in the direction opposite 
to hypothesis is significant, p < .05. 
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cal context of this study, the possibility that 
the various samples showed significant differ- 
ences in different directions was of no rele- 
vance. The alternative case of considerable 
interest was the proposition that all five sample 
groups showed differences contrary to our 
original hypothesis. In order to maintain con- 
sistency of direction, a one-tailed test of the 
difference in the unexpected direction was ob- 
tained for the five sample groups. These p 
values were then pooled, and the resulting 
value was doubled. 

The two-tailed test showed that the high 
manifest anxiety group was superior on both 
complex subtests of logical deduction. The 
differences were more extreme on Subtest IIT 
for three of the sample groups and, in fact, 
reached the .01 level overall. On Subtest II, 
the variances in Samples AF and BF ap- 
proached heterogeneity (p < .10), but it is 
doubtful if this accounted for the overall re- 
sult. However, there was no clear demonstra- 
tion that the results were truly more extreme 
on one subtest than on the other subtest of 
logical deduction. 

In Table 3 are shown the median mean differ- 
ences and combined p values (resulting from 
combining results for the five samples) for the 
comparison between manifest anxiety groups 
selected on the basis of each separate per- 
sonality instrument. For each type of per- 
sonality instrument, at least one of the com- 
parisons on logical deduction was in the 


TABLE 3 


Summary or ADJUSTED REASONING MEAN DIFFERENCES BETWEEN Low AND HIGH Groups SELECTED ON THE 
Basis of THREE SEPARATE MANIFEST ANXIETY MEASURES 


Logical Deduction Figure Analogies 
Manifest Anxiety Measure N 
Subtest III Subtest IV Subtest IIT Subtest IV 

Picture Interpretation 60 

Median difference —0.8 —0.1 —0.8 —0.3 

Range differences —1.9 to +0.3 | —2.0 to +2.1 | —1.6 to +0.3 | —1.3 to +0.5 

Combined -06 = = = 
Sentence Completion® 49 

Median difference ; —1.0 —1.0 +0.6 +0.8 

Range differences —1.8 to +1.4 | —1.1 to +0.2 | —0.8 to +1.3 | —1.2 to +1.9 

Combined p -05 = — = 
Inventory 53 

Median difference —0.1 —1.2 —0.1 —0.2 

Range differences —2.2 to 0.0 | —2.5 to —0.1 | —2.5 to +1.4 | —1.1 to +0.4 

Combined p .05 .05 — = 


Note.—All significant differences are in the direction opposite to hypothesis. 
® Based on four samples only; sample AM had no “highs” on this measure. 
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TABLE 4 
ADJUSTED DIFFERENCES BETWEEN REASONING 
Suprest Means or Low AND HIGH GROUPS 
ON THE INVENTORY FEAR-OF-FAILURE 
SCALE 
(N = 41) 


Logical Deduction Figure Analogies 


Sample (school (Subtest I1I|Subtest IV|Subtest IIT Subtest IV 
and sex) 


Differ- Differ- 


ence b ence d ence ence 
AM —1.5].95|—2.0].97/—0.9}.81}+-0.5}.31 
BM —0.1}.52|—1.8].93]—0.9].72|+-0.6}.34 
CM —1.4|,93/-+1.7|.07|4+1.1).16)+1.9).08 
AF —1.0|.75|—1.2|.76|+0.6].34}-+1.0}.26 
BF +0.2|.43|+-0.4|.39|+-0.8].22)-+-2.3).04 
Combined p .95 63| 44) 05 


direction opposite to prediction at around the 
05 level. The differences tended to be more 
extreme on Subtest III for the groups selected 
on the basis of the projective instruments. In 
the case of the figure analogies comparisons, 
the differences were all nonsignificant. The only 
new feature was that the difference between 
the groups selected by the sentence completion 
score showed a positive trend. 

The differences between the adjusted com- 
plex reasoning test means of the low and high 
groups on the inventory fear-of-failure score 
are shown in Table 4. Again the high group’s 
mean was subtracted from the low group’s 
mean. We pointed out earlier that the fear-of- 
failure and manifest anxiety scores on the 
inventory are rather closely related. F. ifty-six 
percent of the subjects involved in the com- 
parisons on fear-of-failure were also involved 
in the comparisons on inventory manifest 
anxiety. Therefore, these results may not be 
considered independent of the findings on 
manifest anxiety. On the Logical Deduction 
test, the high fear-of-failure group tended to be 
higher on Subtest III, contrary to the original 
hypothesis. However, the two-tailed test of 
the difference in the unexpected direction was 
not significant. On Subtest IV of the Figure 
Analogies test, the high fear-of-failure group is 
inferior in all samples, and the combined p 
value is .05. 

Discussion 


For any of three methods of measuring mani- 
fest anxiety, the high group was superior to the 
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low group on some complex, verbal, logical 
deduction problems. Neither group was supe- 
rior on complex figure analogies problems. 
Although the results vaguely suggested that 
the differences on logical deduction would 
disappear if the task were complex enough, the 
superiority of the high group seems more 
likely to be related to the increased complexity 
of the problems than to anything else. These 
results show that an increased number of 
elements in a reasoning task does not interact 
with the higher drive of the high manifest 
anxiety subject to produce greater interference 
among intratask (or extratask) responses. The 
extensive evidence (cf. Spence, 1958) that 
manifest anxiety is a measure of drive is not 
necessarily challenged. We need a better theory 
of the way in which drive, habit, and response 
variables interact in a multiresponse task when 
no one response has great initial strength. 
Perhaps a crucial factor is that reasoning tasks 
call for each correct response to be used only 
once instead of being repeated in a certain order 
as in the learning tasks previously used. At any 
rate the significance of the nature of the task 
for predicting the effects of motivation is 
emphasized. 

Another theory that merits consideration is 
that differential motivational states in HA 
and LA subjects orient them in different direc- 
tions toward the experimental tasks. It is 
clearly not the case that an increase in the 
complexity or difficulty of a reasoning task is 
unusually discouraging or disorganizing to the 
high manifest anxiety subject. On the contrary, 
it would seem that the HA subject is trying 
harder. There is evidence that the HA subject 
is more responsive to experimental reward 
(Taffel, 1955) but more sensitive to an ego 
involving threat. In two experiments, Sarason 
(1956, 1957) found the HA group to be supe- 
tior to the LA group only when neutral instruc- 
tions were used. The task was a very difficult 
list of nonsense syllables on which no group 
achieved a perfect recitation, When ego in- 
volvement was introduced through a threat 
that the results on the experiment would affect 
future evaluations of the subjects intelligence, 
the LA group was significantly superior to the 
HA group. The persistence with which an 
individual works toward a goal is a joint func- 
tion of his need for the goal and his expectation 
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of obtaining it under given conditions (French, 
1952). When the experimental set is neutral or 
emphasizes service to the experimenter for a 
monetary incentive, as in the present experi- 
ment, the HA subject would have a greater 
commitment to performing well because of his 
generalized need for reassurance and approval. 
When the threat of a negative evaluation con- 
tingent upon successful performance is intro- 
duced, both the need and expectation of success 
of the LA subject should change in the direc- 
tion of greater commitment to successful per- 
formance. However, the HA subject’s great 
self-doubt, or expectation of frustration and 
punishment, should interfere with an objective, 
need persistent orientation. The performance 
of an HA group under ego involving instructions 
should show the characteristic decrements 
shown by all groups after an arbitrary report 
of failure (Lazarus, Deese, & Osler, 1952; 
Taylor, 1958). The more inferior performance 
of the high fear-of-failure group in the present 
experiment is consistent with this theory. The 
high fear-of-failure subject describes himself 
through the inventory as having a much 
greater expectation of failure—particularly in 
any testing situation. 

A final consideration is whether the anxiety 
variable is interacting with task subject matter. 
In a minority of the figure analogies problems, 
there were wrong response alternatives which 
looked more similar to the model set, from 
which the relationship is to be abstracted, 
than did the correct response alternative. The 
presence of such similarities introduces an 
element of prepotent error associations which 
have repeatedly posed difficulties for high 
manifest anxiety subjects. An ad hoc classifica- 
tion of the two types of items showed that they 
occurred about equally often at each level of 
complexity. A separate scoring of each type of 
item revealed no consistent tendency for the 
HA group to make more errors on the type with 
interfering wrong responses. A related finding 
has been reported by Himmelweit (1945). He 
compared the Raven Progressive Matrices 
scores and vocabulary scores of neurotic sub- 
jects. The Raven Progressive Matrices is a test 
of analogical thinking in which a concrete set 
usually leads systematically to the wrong 
answer. Himmelweit found a significant ten- 


CHARLES Van BUSKIRK 


dency for anxious and depressed patients to 
have higher vocabulary scores and for hysteri- 
cal patients (who deny anxiety) to have higher 
matrices scores. Palermo et al. (1956) and 
Castaneda et al. (1956) found differences favor- 
ing the LA group on complex, nonverbal, 
discrimination learning tasks. The question 
remains as to whether the general inferiority 
of anxious subjects on nonverbal tasks is a 
function of differential response to subject 
matter (perhaps threat evoked by novelty) or 
of the differential effects of intratask inter- 
ference. A study needs to be done utilizing two 
types of nonverbal task, one with competing 
false response alternatives and the other with 
noncompeting alternatives. 


SUMMARY 


This study attempted to determine the 
effects of anxiety upon performance on complex 
reasoning problems. Manifest anxiety was 
measured with three instruments: picture 
interpretation, sentence completion, and an 
inventory. Two reasoning tasks, logical deduc- 
tion and figure analogies, were each divided 
into four separate subtests of increasing degrees 
of complexity defined in terms of the number 
of components involved in the problem. It was 
found, contrary to hypothesis, that the high 
manifest anxiety groups performed at a supe 
rior level on logical deduction complex prob- 
lems than did the low manifest anxiety groups. 
The differences were found for extreme groups 
selected on the basis of each separate per- 
sonality instrument as well as on a combination 
of all three. There were no significant differ- 
ences on the figure analogies problems for the 
manifest anxiety groups. The high group on an 
inventory “fear-of-failure” score was signifi- 
cantly inferior on the most complex figure 
analogies subtest. The results do not support 
the hypothesis that the performance of the HA 
subject will show greater intratask interference, 
disorganization, or ego defensiveness when the 
complexity of a task is increased simply by 
increasing the number of components in the 
task. The results are consistent with the inter- 
pretation that the HA subject is more strongly 
committed to performing well as long as the 
experimental situation does not present 4 
serious, ego involving threat. 
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N an earlier paper (Berkowitz, 1959) the 
writer reported that anti-Semitic college 
students were more likely to displace 

hostility onto a previously neutral bystander 
when angered than were their less prejudiced 
peers. These findings not only tend to support 
the scapegoat theory of prejudice; they also 
raise the problem of explaining the negative 
results obtained in other, similar investigations. 
Lindzey (1950), for example, exposed groups 
of subjects, either very high or very low in 
minority group prejudice, to experimentally- 
induced frustrations and found that the preju- 
diced subjects did not differ significantly from 
the less prejudiced group on his measures of 
hostility displacement. 

Since the writer’s investigation had also used 
extreme groups selected on the basis of a very 
similar measure of anti-Semitism, it seems 
likely that the variation in results can best be 
accounted for in terms of the indices of dis- 
placement. Berkowitz used questionnaire rat- 
ings of the subjects level of friendliness toward 
a given peer, the experimenter’s confederate, 
while the other experimenters employed fan- 
tasy measures of aggression. This difference 
may be important because the ratings con- 
ceivably could have been more affected by 
judgmental processes than were the fantasy 
indices. Holmes and Berkowitz (1961) dem- 
onstrated that ratings of friendliness are 
subject to judgmental influences; conditions 
analogous to those producing judgmental con- 
trast effects in psychophysical experiments 
gave rise to contrast-like effects in these friend- 
liness ratings. However, granting the operation 
of judgmental processes, we also must assume 
that highly prejudiced people make different 
kinds of judgments when frustrated than do 
their less prejudiced fellows. 

A recent paper has shown how these differ- 
ences might operate (Berkowitz, 1960). As is 

1This study, conducted by the writer’s class in 
Experimental Social Psychology, is part of a research 
program supported by Research Grant M1540(C2) 
from the National Institute of Health, United States 
Public Health Service. 


210 


well known, when a person makes a judgment 
he compares the to-be-evaluated stimulus with 
some appropriate standard. A contrast effect 
arises in this process when there is a relatively 
great psychological distance between the 
stimulus and the standard, i.e., the distance 
appears greater than it “really” is (as defined, 
for example, by an observer not affected by the 
standard), while assimilation, in which this 
distance is minimized, occurs when the true 
distance is relatively small. Psychological 
distance can be coordinated, among other 
things, to the number of discrete categories the 
observer can establish between the to-be- 
evaluated stimulus and the standard on rele- 
vant dimensions. Generally, the greater the 
number of discriminations the observer can 
make among stimuli objectively located be- 
tween the critical stimulus and the standard on 
these dimensions, the greater the perceived 
distance between them. Persons who typically 
employ broad categories when evaluating 
stimulus objects should therefore be more 
likely to display judgmental assimilation than 
people who characteristically make finer dis- 
criminations.? 

In applying this analysis to the study of anti- 
Semitism and displacement, the writer has 
suggested that stress serves to broaden the 
categories established by the highly prejudiced 
and to narrow those set up by the less preju- 
diced. As a result, the former are likely to 
exhibit assimilation. They do not clearly differ- 
entiate between the immediate frustrator and 
the objectively neutral person so that the nega- 
tive feelings aroused by one can be generalized 
readily to the other. (This differentiation prob- 
ably would take place, however, when the 

* This statement can be tautological in terms of some 
operations. For example, “broad categories” can be 
taken to mean a relatively flat stimulus generalization 
gradient for stimuli in that class, The person responds 
the same way to all of the objects in the category be- 
cause they have similar stimulus qualities for him. 
Similarly, “assimilation” also involves stimulus general- 
ization. In this case the same judgmental responses are 
made to the objects because they have similar stimulus 
qualities. 
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stimulus objects are a “real” person, the experi- 
menter, and a “nonreal” picture on a TAT 
card.) The less prejudiced, on the other hand, 
apparently respond to the stress of the frustra- 
tion by differentiating the frustrator from the 
bystander more sharply, a tendency heighten- 
ing the contrast between these two objects and, 
according to the results, they become some- 
what friendlier to this bystander. 

Two studies were used to support this line 
of thought. In the first of these, Brown (1953) 
showed that highly authoritarian persons (as 
defined by the California F Scale) exhibit 
greater problem solving rigidity than less 
authoritarian subjects, but only when placed 
in the stress of an ego involving situation. 
When this finding is translated into the present 
theoretical framework, it suggests that the 
authoritarian’s categorizations generally be- 
come broader under at least mild stress. Once 
stress is created, he becomes less likely to 
differentiate into the various types of problems 
that he could discriminate among under condi- 
tions of low ego involvement. Apparently, as 
far as he is concerned, they all are in one 
category, and he responds in a similar manner 
to all of the problems. 

The second study, conducted by Pauline 
Schoenberg under the present writer’s supervi- 
sion, sought to test this interpretation employ- 
ing the format of a more typical investigation 
of judgmental processes. The subjects were 
asked to rate the “goodness” of 11 personality 
traits, with the first 7 traits, all clearly “good” 
or all clearly “bad,” as defined by the prior 
judgments of 68 other subjects, providing the 
comparison standard, and the remaining 4 
being relatively neutral. Other investigations 
employing these traits had shown that there 
generally is a contrast effect in the evaluation 
of the neutral traits; they are judged as rela- 
tively “bad” when preceded by the 7 good 
traits, and as relatively “good” when they 
follow the bad standard. In other words, most 
subjects seem to differentiate the neutral traits 
from the preceding traits so that by comparison 
with the latter standard they appear more 
different than they really are. Following this 
reasoning, we hypothesized that subjects high 
in F would be less likely to show a contrast 
effect in their evaluations of the neutral traits 
when placed in an ego involving condition 
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than when given less ego involving instructions, 
supposedly because the stress of the former 
condition caused them to make poorer dis- 
criminations between the standard and neutral 
traits. The earlier displacement experiment 
also suggested that these results might be re- 
versed in the case of less authoritarian sub- 
jects. In that study, it will be recalled, subjects 
who were low in prejudice became somewhat 
friendlier to the neutral peer when they were 
frustrated by the experimenter, as if they had 
sharpened their differentiation between these 
two. Extrapolating, we predicted that subjects 
low in F would be more likely to exhibit a con- 
trast effect when ego involved than when not 
given ego involving instructions. The results 
of this second study generally were consistent 
with our expectations (Berkowitz, 1960). 

However, there are difficulties in extending 
these findings to the relationship between anti- 
Semitism and displacement of hostility. The 
instructions given the subjects in the judgment 
experiment emphasized that the traits all 
referred to one person. Although the writer has 
argued against this possibility (Berkowitz, 
1960), the obtained results conceivably could 
stem from differences in adherence to the “all- 
one-person” instruction. The present study was 
designed to resolve this problem. The subjects 
selected primarily in terms of their scores on 
the anti-Semitism scale are asked to evaluate 
the same sets of traits employed in the earlier 
Schoenberg-Berkowitz study, but without 
being told that the traits pertain to one particu- 
lar person. Nevertheless, essentially the same 
results are expected: the highly anti-Semitic 
subjects should display grosser categorizations 
under the stressful situation than in the more 
secure, non-ego-involving situation, while the 
reverse should be true for the subjects low in 
anti-Semitism. 

METHOD 


Subjects 

Sixty-five female and 33 male college students 
participated in the two sessions of this experiment. 
They were recruited from a pool of approximately 
120 volunteers from an introductory psychology course 
who, in a preli i session, had completed a 111- 
item, five-alternative, Likert questionnaire supposedly 
assessing their “attitudes toward everyday matters. 
‘Twenty-eight of these items were taken from the 
California F Scale, while 17 items came from the 
measure of anti-Semitism (A-S) developed by the same 
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investigators (Adorno, Frenkel-Brunswik, Levinson, & 
Sanford, 1950), In both cases item scores varied from 
four points for a response indicating strong agreement 
withan item tozero forthe response “Strongly disagree.” 
The distribution of obtained scores on these measures 
was concentrated toward the low end of the possible 
range: 12 to 67, median = 44 for the F Scale, and 1 
to 49, median = 14 for the A-S scale, 

In the preliminary session the subjects also filled 
out a separate work and class schedule, giving their 
names and addresses. The writer coded these papers 
so that the 15 experimenters taking part in the study? 
would not know the subjects’ attitude scores and ran- 
domly assigned each subject to one of the eight experi- 
mental conditions, maintaining controls only on the cell 
frequencies and the ratio of men to women in each 
condition. Each experimenter then contacted one of the 
six or seven subjects assigned to him for the second 
session, when he administered the instructions ap- 
propriate to the condition for which the subject was 
scheduled. Knowing only the F and A-S scores of the 
subjects who had participated in the second session, 
but not the nature of their judgmental responses, the 
writer then selected those subjects to be included in 
the data analysis. The highly anti-Semitic subjects 
were those falling in the upper third of the A-S distribu- 
tion of the sample (a score of 17 or higher), and also 
above the median on the F Scale, while the less prej- 
udiced subjects were those who came from approxi- 
mately the lower third of the A-S distribution (a score 
of 12 or lower), and were below the median on the F 
Scale, As a result, there were 56 subjects divided 
evenly among the eight experimental conditions in a 
2 X 2 X 2 factorial design. 


Experimental Procedure 


In the second session the experimenter told each 
subject that the purpose of the study was to develop 
quantitative ways of evaluating personality descrip- 
tions, such as those commonly included in letters of 
reference. Supposedly the experimenter wanted to find 
out how “good” or how “bad” the adjectives were that 
were frequently used in these descriptions. To obtain 
this information, the experimenter would read a list of 
personality traits to the subject and the subject would 
indicate the “goodness” of each trait by calling off a 
score for each adjective, A score of 1 was to mean that 
the trait was a “very good” personal quality, while 
13 meant the trait was “very bad.” 

Half of the subjects were placed in a mildly stress- 
ful, ego involving situation by informing them that 
their ratings would constitute a test, involving compari- 
son with the evaluations made by experienced personnel 
directors in order to determine which subjects were 
good judges. (This supposedly was preliminary to 
isolating the personality characteristics of good judges.) 
A lower degree of ego involvement, and therefore, 
presumably of stress, was induced in the remaining 


3 Needless to say, the writer was pleasantly sur- 
ptised that there were any significant findings, much 
less that there was the close conformity to the results 
obtained in the earlier Schoenberg-Berkowitz study. 
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subjects by telling them that their judgments were ; 
a test of their personality, or of how good a judge t 
were. The experimenter said he only wanted to | 
how “people in general” felt about these traits, 
Eleven traits then were read to the subject, wii 
subject giving a score after each adjective. The - 
seven traits provided the comparison standard, 
reported earlier, 68 other undergraduates in a 
liminary investigation had rated a number of adjecti 
on a 13-point scale. Half of the present subjects 
ceived as this standard seven traits that the judges 
evaluated as “good,” while the others heard s 
adjectives the judges had evaluated as “bad,” 
mediately following the standard traits, and witho 
setting them off as different in any way, the expe 
menter read the same four traits, which had previou 
been defined by the judges as “neutral,”* to all subjects, 
The traits and their scale values are given in Table 
The standard traits were always given in the order 
listed in the table, but the order of presentation of tl 
neutral traits was systematically varied according 
prearranged schedule. } 


RESULTS 


The results of the Duncan multiple 
test (Duncan, 1955) of the condition me 
for both the seven standard and four neu 
traits are given in Table 2. A number of conclu 
sions are readily apparent. First, there are n 
significant differences among the A-S level 
stress level conditions in the mean judgment 
of the first seven traits within each of the stand 
ard conditions. Thus, all subjects getting eith 
the “good” or “bad” comparison traits app 
ently received psychologically similar ju 
mental-standard stimuli. The question now 
to what extent did the groups vary in thei 
differentiation of the neutral traits from thi 
traits? 

Generally, this differentiation apparentl 
was made, and there was a very strong con- 
trast effect; when all subjects are combin 
the neutral traits have a significantly poo: 
evaluation following the “good” comparis 
standard than after the “bad” traits (F 
146.26, p = .001). However, the lower half of 
Table 2 also shows that there are condi 
differences in the extent to which the new! 
traits were discriminated from the stant 
stimuli. This is indicated, for example, in 

4 There was relatively good agreement among tl 
68 judges as to the ratings given to the standard tra 
ie., they have a relatively small Q value in the 
stone sense. The “neutral” traits, on the other h 
have a relatively large Q value. Other studies 
out by the writer had indicated that ambiguous 
generally are more sensitive to judgment shifts. 
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judgmental differences between the “good” 
and “bad” standard groups within each of the 
four stress-level conditions. As expected, the 
subjects high in A-S exhibited a greater con- 
trast effect under low stress than in the stronger 
stress condition. That is, only in the low stress 
treatment were the neutral stimuli evaluated 
significantly less favorably after the “good” 
than after the “bad” traits. Also as predicted, 
this pattern is reversed for the Low A-S sub- 
jects who display the significant contrast effect 
under high rather than low stress. In other 
words, the prejudiced subjects seem to make 
poorer discriminations between the standard 
and subsequent stimuli under mild stress than 


TABLE 1 
Tue -SCALE VALUES OF THE STANDARD AND NEUTRAL 
‘TRAITS 


(1 = “very good”; 13 = “very bad”) 
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in the more neutral condition, while the reverse 
is true for the less prejudiced group. 

These discrimination differences do not occur 
with all of the stimuli, however. As can be 
seen in the table, the evaluations of the neutral 
traits by subjects high in A-S are affected relia- 
bly only when these adjectives follow the 
“good” comparison standard. In the case of the 
less prejudiced subjects, on the other hand, 
there is a near-significant difference (p = .07) 
between the high and low stress conditions 
only for the neutral traits following the “bad” 
standard. (Since this difference was significant 
in the earlier Schoenberg-Berkowitz study, the 
finding is probably reliable.) Obviously, we 
cannot say at this point whether these results 
severely restrict the applicability of the present 
hypotheses. The fact that the High A-S sub- 
jects tended to exhibit relatively gross cate- 
gorizations under stress only in the “good” 
conditions may be explained in many ways. 


“Good” Standard “Bad” Standard For example, other meaning dimensions, such 
jecti Scale ate Seale êS the “potency” or “activity” (Osgood, Suci, 
eadiective Value Adjective Value  & Tannenbaum, 1957) of the “bad” traits may 
have affected the extent to which the preju- 

F; 4 i 10.09 a een . % 
2 Ate te by ot te 10.45 diced individuals could categorize these adjec- 
3. Cheerful 1.57 | 3. Belligerent aa tives together with the subsequent neutral 
$ EE pee es Mates 11,59 adjectives; the two sets of traits may have been 
6. Diligent 2.39 | 6. Dull 11.75 too far apart on these dimensions, at least as 
AE 2.63 | 7. Obstructive | 10.88 far as the High A-S subjects were concerned, 
EA for them to be placed in the same broad cate- 
a gory. A similar account could be given of the 
Adjective Scale Value behavior of the Low A-S subjects. Neverthe- 
3 less, in both cases the present hypothesis seems 
8: Krah as generally to be supported: prejudiced subjects 
10. Shrewd 6.05 manifest a different kind of judgmental process: 
11. Nonchalant oy under stress than do their less prejudiced peers. 

TABLE 2 
MEAN JUDGMENTS IN THE EXPERIMENTAL CONDITIONS 
(N = 7 in each condition) 
z Low 
A-S Level: P High Low 
Siren Level: as ais PO ced Bad Good Bad Good Bad 
Mean judgments of 2.665 9.87, 3.31, 10.946 2.57, 10.016 2.56, 10.10, 
standard trait: 
Mean REC 5.6lne  5.23p 6-97a 5-29% TMa 54o 7.064 6.49 ca 


neutral traits 


Note.—Cells containing the same subscript are not significantly 
test, All of the cells in this table were included in the one analysis. 


were assumed to be reliably different, they could not in themselves consist 


traits. 


different from each other (at the .05 level) by Duncan multiple range 


Note that even if the relatively slight mean anchorage differences 
istently account for the differences in judgments of the neutral 
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Discussion 


The present argument obviously faces prob- 
lems in addition to the difficulties inherent in 
interpreting the judgmental results as stem- 
ming from differences in discrimination ten- 
dencies. In order to generalize the findings to 
the earlier study of hostility displacement it is 
necessary to equate the ego involving treat- 
ment in this experiment to the frustration 
produced by the experimenter in the other 
investigation. A recent paper by Easterbrook 
(1959) provides support for this perhaps un- 
attractive assumption—and also presents other 
difficulties. 

Easterbrook (1959) reviews a number of 
studies demonstrating “that emotional arousal 
acts consistently to reduce the range of cues 
that an organism uses” (p. 183). This reduction 
in “the range of cue utilization” is essentially 
identical to the present conception of “category 
broadening.” The range is said to have shrunk 
when the organism makes relatively little use 
of peripheral (partially relevant) cues. Failure 
to attend to peripheral cues also implies that 
the organism responds primarily to the gross 
aspects of the stimuli and not make fine dis- 
criminations among them. In the present 
terminology, the stimuli are more likely to be 
grouped together in the same category. Not 
making these sharp differentiations (not re- 
sponding to the fine details that differentiate 
the stimuli from each other), the organism acts 
as if the stimuli were largely equivalent. 

Easterbrook (1959) points out that reduction 
in the range of cue utilization apparently takes 
place following a great variety of instigating 
conditions, all of which may be regarded as 
emotion arousing. These conditions include 
food deprivation, threats to the self or to the 
body, and what is particularly important for 
this paper, the offering of a prize for effective 
performance, which seems essentially similar 
to the ego involving instruction of the present 
study. In both cases performance was made 
much more important to the individual. Pre- 
sumably, emotion might also have been 
aroused in these conditions, thus, equating 
these instigations with the frustration created 
by the experimenter in the hostility experi- 
ment, at least as far as the discrimination 
process is concerned, 

Applying this reasoning to the present study, 
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we also would have to contend that the appar- 
ently gross discriminations made by the High F 
subjects in the stress condition resulted from 
emotional arousal. Their behavior in the judg- 
mental task then becomes a special instance of 
a more general phenomenon: a reduction in the 
range of cue utilization (i.e., category broaden- 
ing) under emotion-producing conditions re- 
sulting in less adequate discrimination among 
the stimulus objects confronting them. As a 
consequence, responses (such as hostility) 
instigated by one of these objects could be 
generalized readily to other objects. 

Tt is the behavior of the subjects low in F 
that is inexplicable from this point of view, 
One possibility is that the phenomenon does 
not occur in such people under stress. It may 
also be that the Low F people were not made 
emotional by the present conditions. However, 
such an explanation cannot account for the 
presumably sharper differentiations made by 
these subjects both under stress and following 
the experimental frustration. 


SUMMARY AND CONCLUSIONS 


The present study was designed to test a 
theoretical argument used to explain the differ- 
ent results in various investigations of the 
scapegoat theory of prejudice. Berkowitz 
(1959) had obtained evidence supporting the 
scapegoat theory, while other experimenters ; 
(e.g., Lindzey, 1950) have obtained essentially 
negative results. Briefly, it is suggested that the 
index of hostility displacement employed by 
Berkowitz was more affected by judgmental 
processes than the indices of fantasy aggression 
used by the other investigators, and that 
highly prejudiced people make different kinds 
of judgments under stress than do people lower 
in prejudice. According to this judgmental 
analysis, the conceptual categories established 
by those high in prejudice tend to broaden 
under stress, i.e., they make poorer discrimina- 
tions, so that both the frustrator and a neutral 
bystander are more likely to be included in the 
same category and the hostility aroused by the 
former can readily be generalized to the latter. 
Less prejudiced people, on the other hand, 
apparently react to stress with a tendency to 
make finer differentiations. As a result, there 
was a heightened contrast between the frus- 
trator and the bystander for these people and 
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their friendship for the bystander tended to 
increase. 

To test these hypotheses, extreme groups on 
the Levinson anti-Semitism scale were placed 
either in a mildly stressful, ego involving condi- 
tion or in a less stressful, non-ego-involving 
condition, and were asked to evaluate the 
general “goodness” of 11 personality traits, 
The first 7 (comparison standard) traits objec- 
tively differed from the last 4 traits on this 
“goodness” dimension. As was found in a pre- 
liminary investigation employing the same 
procedure (Berkowitz, 1960), the results gen- 
erally are in accord with the hypotheses; the 
highly anti-Semitic subjects seemed to make 
grosser discriminations under stress than in the 
More neutral situation, while the reverse was 
true for the less prejudiced subjects. The find- 
ings suggest, furthermore, that these differ- 
ences stem from differences in perceptual 
discriminations rather than from differences in 
degree of consistency, i.e., differences in a tend- 
ency to give the same evaluative responses. All 
subjects apparently differentiated the last 4 
from the first 7 traits; their evaluations of these 
two sets of traits were different in the range of 
numbers used. Instead of differing in the con- 
sistency of their responses, the subjects seemed 
to vary in the magnitude of the perceived 
difference between the comparison and subse- 
quent stimuli. 

Tf this judgmental analysis is correct, it 
Taises the problem of whether the question- 
naire ratings employed in the writer’s earlier 
study of displacement really assessed the 
strength of the subjects hostility drive. Un- 
doubtedly, aggressive behavior and judgmental 
evaluations represent different response Classes. 
Nevertheless, there probably is a correlation 
between these two sets of responses, at least at 


215 


the extremes, Feelings of friendliness towards 
another not only result in more favorable 
evaluations of him than do feelings of neutral- 
ity or unfriendliness (Berkowitz, 1957), but 
the favorable evaluations may also weaken if 
not prevent the occurrence of aggressive acts. 
Moreover, with overt aggression generally 
frowned upon in civilized society, the expres- 
sion of unfavorable evaluations frequently is 
the only relatively permissible way that 
aggressive inclinations may be displayed. It 
certainly should not be concluded that un- 
favorable evaluations have nothing to do with 
aggression. 
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NE of the basic assumptions of most 
personality theories is that a person’s 
perception of others is influenced by 

his own personality characteristics. Further, if 
the possessed characteristic is negatively 
valued, the person’s ability to assess the pres- 
ence of the same trait in others is believed to 
be impaired. Insight into one’s own personality 
is thought to mitigate such distortion. Indeed, 
one of the goals of didactic therapy is to help 
the therapist-in-training to gain insight into 
his own actions and thus avoid projecting his 
own motives onto his patients. 

Despite the very general acceptance of these 
beliefs, they lack adequate experimental vali- 
dation. The present study was therefore de- 
signed to investigate how possession of a trait 
of low social value affects the objective evalua- 
tion of the strength of the trait in others. A 
second objective was to determine whether 
insight into one’s standing on a trait increases 
the objectivity with which a person assesses 
it in others. 

The trait dimension of “hostility-friendli- 
ness” was selected as the negative trait to be 
studied, partly for reasons extraneous to this 
study (Murstein, 1956). There are, however, 
good reasons for choosing this dimension when 
working with fraternity members who com- 
prised the subjects in this investigation. First, 
requests for judgments of this trait do not 
arouse undue opposition from the subjects 
because of reluctance to say anything unfavor- 
able about the other members of their group? 

1 I should like to express my thanks to the following 
persons for their aid in the collection of data: Martin 
Dworkin, Allen I. Laskin, Joan Laskin, George Lassen, 
Joseph Marin, Nelly K. Murstein, and Roberto Roth. 
Maureen Bendick helped with the computations. 
Martha Pareis aided greatly in editorial matters relat- 
ing to the preparation of the manuscript. Thanks are 
due to Charles A. Moss for kindly giving of his time 
in drawing the figure. 

2 In a previous “pilot study,” the trait of “hostility” 
had been used. The use of this trait, however, met with 


considerable resistance on the part of the subjects, 
who were reluctant to term any of their fraternity 
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Second, the trait can easily be ranked by sub- 
jects who are not professional psychologists. 

At first glance the assumption that the per- 
sons ranked as least friendly, as in the present 
study, are also perceived as most hostile may 
seem questionable. May not a person be aloof 
and unfriendly and yet not hostile? But the 
manner in which “least friendly” was defined 
by the investigator made it clear to the subjects 
that these persons were to be evaluated as 
hostile. Subjects were told that “most friendly” 
meant “that person who is the most coopera- 
tive, easiest to get along with, and least hostile 
in the fraternity.” Moreover, the magnitude 
of the average reliability for the group judg- 
ments of the 25 groups (.86) indicated that the 
rankers probably employed a unidimensional 
scale. 

In studying possession of the trait of hostil- 
ity-friendliness, the pooled-rank method was 
used. A prime advantage of this method is that 
each person uses the same distribution to 
evaluate the group, avoiding “central tend- 
encies,” “leniency errors,” and “halo effects” 
that reduce discrimination among the mem- 
bers. In the pooled-rank method, if the “forc- 
ing” distorts the data the reliability of judg- 
ments by the group should be low. A fairly 
high reliability indicates that little distortion 
has occurred. A second advantage is that the 
method is easily explainable to the subject, 
requires little time to administer, and thus 
permits the use of large groups. A third advan- 
tage is that the rankings may be easily con- 
verted to normalized scores. The justification 
for this conversion rests on the assumption that 
most personality traits are normally distrib- 
uted, a view championed by Guilford (1954) 
and Calvin and Holtzman (1953), among 
others. Although groups of fraternity students 
cannot be assumed to represent the population 


brothers as hostile. Consequently, the trait deemed at 
the other end of the continuum, “friendliness,” was 
employed, with the assumption that those subjects 
who were the least friendly were the most hostile. 
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at large for which this assumption is most 
justifiable, it seems unlikely that their distribu- 
tion for the trait of hostility-friendliness differs 
sufficiently from that of a normal population 
to offset the statistical gain in working with 
normalized ranks (interval data) rather than 
with the ranks untransformed. 

The purpose of the ranking was to select 
four extreme groups differing both in their 
possession of hostility and in the degree of 
insight into their possession or lack of posses- 
sion of hostility. The hypotheses advanced 
were: 

1. “Hostile” persons are less accurate in 
their perception of the trait of “hostility” in 
others than are “friendly” persons. 

2. “Insightful” persons are more accurate 
in their perception of “hostility” in others than 
are “noninsightful” persons. 


METHOD 


The subjects were drawn from 23 fraternities and 2 
men’s dormitories at the University of Texas. Prior to 
a regularly scheduled “house” meeting, sheets of paper 
were handed out to all members present who had been 
members of the group for at least 3 months. The 
familiarity of each subject with all others in his group 
was thus assured, The names of all the members present 
for the meeting were written down by each member on 
his sheet, The following instructions appeared at the 
top of the subject » Sheet: 

Put a 1 in the small column to the right of the 
name of the person whom you consider to be the 
most friendly of those taking the test with you. By 
“most friendly” is meant that person who is the 
most cooperative, easiest to get along with, and 
least hostile in the fraternity. Put a number 2 next 
to the second most friendly person, and so on, until 
you have a different number for each person currently 
taking the test. Remember to rate yourself and to 
make sure that your name is also at the top of the 
page, Also, do not rate any member who is not taking 
the test with you. 

The ranks were converted to normalized scores, 
using Hull’s table (Guilford, 1954, p. 182). 

From the 536 students participating in the study, 
80 subjects were to be selected who, on the basis of the 
group and self-judgments, could be assigned to one of 
four groups: hostile-insightful, hostile-noninsightful, 
friendly-insightful, friendly-noninsightful. To ensure 
that the subjects chosen for the four experimental 
groups were either very hostile or very friendly, cri- 
terion scores based on consideration of the standard 
error of measurement of each of the group judgments 
were established, such that the 20 most extreme cases 
exceeding these scores for each of the four experimental 
groups were selected. Split-half reliability coefficients 
were obtained by dividing members of each fraternity 
and dormitory group randomly into two halves and 
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correlating the means of the ranking by each half for 
each person in the group, then applying the Spearman- 
Brown correction. The scores for each fraternity and 
dormitory are henceforth referred to as G scores. 

The reliability of the self-scores (S scores) was 
obtained in another manner. Although a complete 
reranking would ideally have been required, time 
requirements of this procedure made it impracticable. 
Therefore, another method was adopted. A previous 
study by Calvin and Holtzman (1953) suggested that 
the reliability of S scores in pooled rankings is at 
least .90. As a check on this finding, a group was se- 
lected at random and requested to make a second set 
of rankings a month later. The correlation between 
initial and second ranking for the 15 members was .89. 
In view of the high reliability despite the passage of 
time, it seems conservative to assume a reliability of .90 
for self-rankings for the total population of subjects. To 
reduce the greater sampling fluctuation of the S score 
as compared to the G score, the standard deviation of 
self-rankings also was computed for the total popula- 
tion of subjects rather than for each individual group. 

The chance of selecting an extreme subject incor- 
rectly was to be no greater than .01, Hence, the cri- 
teria for selection of the hostile-insightful group were 
G > 2.58 (standard error of measurement of G) + 
mean of G, and S > 2.58 (standard error of measure- 
ment of S) + mean of S. For the friendly-insightful 
group a similar formula was used except that now the 
cutting score was below the mean. The criteria were, 
therefore, G < Mg — 2.58 SE, and S < M, — 2.58 SE, 
where SE and M refer to standard error and mean, 
respectively. Similar logic was followed in selecting the 
hostile-noninsightful and friendly-noninsightful groups. 
For the first of these groups, G was > M, + 2.58 SE, 
To insure noninsightfulness, the self-ranking had to be 
sufficiently below the G ranking so that the risk of error 
was tbe or kem, Hence, he GS darepany hd 
to be > 2.58 VSE? + SEP — 2ra SEg SEs + Mte- 
For the friendly-noninsightful group, G was <M, - 
2.58 SE,. The amount by which an individual’s S 
score had to exceed his G score for the individual to be 


noninsightful was > 2.58 / SE + SE? — 2ra Sle SE, 


Mes). 
3 Table 1 summarizes the cutting scores for all four 


oups. 

ie The average reliability of the G scores for the 25 
student groups was .86, the range being from 50 to 
94, The standard deviation (SD) range of G varied 
from .57 to 1.21, while the SEs ranged from .24 to .64, 
‘The reliability of the S scores was estimated to be .90, 
and the SD and SE values computed for the total 
population were 1.79 and .56, respectively. ‘These S 
values were used as constant S criteria for the selection 
of subjects from each of the 25 groups. p 

Since the objectivity of perception of a particular 
group member was measured in terms of the M of the 
group judgments for him, and since each experimental 
subject had contributed to the determination of the 
objectivity score, the G score was recomputed, omitting 
in turn each subject’s judgment score for each individual 
in his group. Then each subject’s judgment for each 
member of his fraternity was compared (without regard 
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TABLE 1 
CRITERIA FOR SELECTION OF Four PERSONALITY GROUPS 


Self-Rankings for Hostility 
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Low 


Hostile-noninsightful 


High 
ma Ř 
Hostile-insightful 
Group High | G > Mg + 2.58 SEg 
S> M, +2.58 SE, 
For 
Hostility | Low | G < Mg — 2.58 SE, 


(S—G) 22.38 VSE + SEP — 2r_sSEgSBy+M(g-n)| S < My — 2.58 SE, 


G > Mg + 2.58 SE, 


(G — S) 2 2.58 V SEP + SEP — PrgoSEgSEa HMen) 


G S Mg — 2.58 SE, 


to sign) to the M judgment for that individual by the 
remaining members to yield a discrepancy score. The M 
of these discrepancies for all individuals in the group 
represented the individual judge’s inaccuracy score. 
The higher the discrepancy the greater the inaccuracy 
score. 


RESULTS 


In correlating G and S scores for each of the 
25 groups, the range was found to be from 
—.33 to +.71. The average r obtained via 
Fisher’s z transformation was .18 (p > .05). 
For the group as a whole, therefore, the sub- 
jects self-perception scores bear little relation- 
ship to the peer judgments. 

To test Hypothesis 1 that “hostile” subjects 
would perceive others more inaccurately than 
would “friendly” subjects, a 2 X 2 factorial 
design was set up for the dimensions “group 
judgments” (hostile or friendly) and “self- 
Judgment” (hostile or friendly). Inspection of 
the plotted scores for each treatment combina- 
tion revealed no gross departure from nor- 
mality of distribution, and Bartlett’s test for 


homogeneity of variance indicated that the. 


four groups did not differ significantly with 
regard to size of variance. The results of the 
analysis of variance are shown in Table 2. 

In accord with the hypothesis, there is a 


TABLE 2 


ANALYSIS OF VARIANCE OF PERCEPTUAL Inaccuracy 
SCORES FOR THE Trarr or HostrurTy 


Source of Variance df | us F 
Between Group Judgments 1 | .327 | 7.27 
Between Self-Judgments 1] .015| .33 
Interaction Group X Self 1 | .101 | 2.24 
Within Groups 76 |. 


Note.—For 1 and 70 df: 7.01 = significant at .01 point, 3.98 = 
significant at .05 point. 


significant difference between the “hostile” 
and “friendly” groups in accuracy of percep- 
tion of hostility in others, with the “hostile” 
group being more inaccurate. Differences in 
accuracy were not significant with respect to 
self-judgments or the interaction of group and 
self-judgments. 

The mean score for each group is shown 
graphically in Figure 1. The differences be- 
tween the groups do not appear very large. It 
must be remembered, however, that in any 
comparison between the groups and single | 
judgments, there is a considerable regression 
on G towards the mean as a result of the num- 
ber of judgments comprising the G score. This 
is not true of the single judgment. It follows, 
therefore, that even for the most accurate | 
perceiver, a considerable disparity will exist 
between the G score and his own judgment. 
The question is then whether the perceptions 
of any of the groups are relatively more objec- 
tive. A 

Each group, therefore, was compared with 
every other group for differences between 
means (Table 3). Both the hostile-insightful i 
and hostile-noninsightful groups were signif- | 
cantly more inaccurate than the friendly- 
insightful group (p < .05 and p < .01, respec- 
tively). A comparison of the insightful group 
(hostile-insightful, friendly-insightful) „versus 
the noninsightful group (hostile-noninsightful, 
friendly-noninsightful) resulted in a signifi- 
cantly greater mean for inaccuracy on the part 
of the noninsightful group (p < .01). Hypoth- 
esis 2 is therefore supported. 7 

The generality of these results might y 
questioned if they could be shown to depend — 
on a few fraternities that contributed large 
numbers of subjects to the four groups. The 
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Fic, 1. Mean perceptual inaccuracy scores for the 
four extreme personality groups. 


fraternity origin and score of each subject was 
therefore examined. Twenty-three of the 25 
fraternity and dormitory groups supplied at 
least one subject. Four groups supplied six 
subjects; 2 groups, five subjects; 1 group, four 
subjects; 10 groups, three subjects; 5 groups, 
two subjects; and 1 group, one subject. Each 
fraternity group was then investigated to 
determine whether the general rank order of 
the means with regard to perceptual inaccu- 
racy, as listed in Figure 1, would be found. 
The mean perceptual inaccuracy score for each 
personality grouping was compared with that 
for each other group that was present within 
the fraternity. A total of 37 comparisons were 


TABLE 3 
t VALUES BETWEEN ALL PERSONALITY GROUPS AND 
BETWEEN INSIGHTFUL AND NONINSIGHTFUL 
Groups FoR ACCURACY OF PERCEPTION OF 


HosTILITY 
Groups t 

Hostile-insightful versus Hostile-nonin- -66 
sightful 

He NE see versus Friendly-insight- 231% 
ul 

Hostile-insightful versus Friendly-nonin- -85 
sightful 

Hostile-noninsightful versus Friendly- 2.97** 
insightful 

Hostile-noninsightful versus Friendly- 1.51 
noninsightful 

Friendly-insightful versus Friendly-non- 1.46 
insightful 

Insightful Group (Hostile-insightful, 3.02** 


Friendly-insightful) versus Noninsight- 
ful Group (Hostile-noninsightful, 
Friendly-noninsightful) 


* Significant beyond .05 point. 
** Significant beyond .01 point. 
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thus obtained from all of the fraternities. Ac- 
cording to chance, 18.5 of these should be in the 
right order according to the ranking of the 
means and 18.5 in the wrong order. Actually, 
27 were in the correct order and only 10 were 
incorrect. The likelihood of this result occurring 
by chance, as tested by x, was less than .01, 
lending considerable support to the generality 
of the findings for the fraternity groups investi- 
gated. 


Discussion 


The data seem to warrant the conclusion 
that, at least for the fraternity men studied in 
the sample, extreme possession of the trait of 
“hostility” significantly distorts the accuracy 
of perception of this trait in others. The evi- 
dence also indicates that insightful persons of 
extreme hostility or friendliness are signifi- 
cantly more objective in their perceptions than 
noninsightful persons. The effect of insight 
itself, however, probably is secondary to the 
absence of the trait of hostility in insuring 
objective perception. This statement is sup- 
ported by the results of Table 3 which show 
that the ¢ values for the hostile-noninsightful 
versus hostile-insightful groups and for the 
friendly-noninsightful versus friendly-insight- 
ful groups did not reach significance at the .05 
level though they suggest a trend indicating 
that insight does facilitate accuracy of percep- 
tion. It is only when both insightful groups are 
combined and compared with the combined 
noninsightful groups that the trend reaches 
significance. On the other hand, the subgroups 
compared for absence or presence of hostility 
with insight held constant yielded a significant 
t value (p <.05) for the hostile-insightful ver- 
sus friendly-insightful group, and a near signifi- 
cant value for the hostile-noninsightful versus 
friendly-noninsightful groups. 

The self-ranking apparently had no effect on 
the perceptual judgments, although the 
method of measuring the self-concept does not 
seem as valid (though it may be as reliable) as 
the measurement of the objective possession 
of the trait. When a large group of individuals 
reliably judge “A” to be 2.5 o above the M for 
“hostility,” and by ranking we eliminate 
“halo” and “leniency” effects, we may have 
some confidence that “A” is hostile at least in 
that behavior observed by his fraternity broth- 
ers. Such is not the case with self-judgments. 

One is less likely to be honest with others 
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and/or with oneself in estimating self-posses- 
sion of “hostility.” Consequently, the self- 
rankings should be heavily influenced by per- 
sonal needs. Some subjects may rank them- 
selves as very hostile according to the code of 
“see no evil in thy fraternity brothers.” Others 
may be very defensive about their possession 
of hostility and rank themselves as being low 
in hostility. The effect of varying “sets” should 
result in a comparatively large variance of self- 
judgments. The evidence bears this out in that 
the SD of S pooled for greater stability for the 
entire population was 1.79, as compared to the 
range of .57 to 1.21 for the SD of G for the 
different fraternities. It is apparent then that 
less confidence exists as to the validity of self- 
judgments as compared to group judgments, 


SUMMARY 


Four groups of 20 college men (hostile- 
insightful, hostile-noninsightful, friendly-in- 
sightful, friendly-noninsightful) were evaluated 
for accuracy of judgments of their fraternity 
brothers, using the pooled-rank method. The 
normalized mean ranking was used as the 
criterion and the mean discrepancy between a 
given individual’s judgments of his fraternity 
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brothers and the value assigned them by the 
rest of the group constituted the perceptual 
inaccuracy score. Results indicated that the 
groups differed significantly in accuracy. The 
rank order of the groups for accuracy of percep- | 
tion from most to least accurate was friendly- 
insightful, friendly-noninsightful, hostile-in- 
sightful, and hostile-noninsightful. Significant 
differences in accuracy of perception were 
found between hostile and friendly persons and 
between insightful and noninsightful ones. No 
significant differences were found between 
individuals who judged themselves to be 
friendly and those judging themselves as 
hostile, nor for the interaction between group 
and self-judgments. 
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following two questions: First, to what 

extent do young children accurately 
discriminate age differences? Second, what is 
the relationship between the child’s capacity 
to discriminate age and his preference and 
esthetic judgments of people varying in age? 
The first question is obviously concerned with 
perception and concept formation, the second 
with affective and attitudinal states in the 
child. 

Consider first the perceptual-cognitive as- 
pects of the problem. There is evidence that 
children at a very early age are capable of 
physiognomic discrimination. The well-known 
crying response to the stranger generally occurs 
well before the child is one year old. By con- 
trast, the development of various categories or 
concepts for identifying or classifying people 
occurs considerably later. It is one of the pur- 
poses of the present study to specify when in 
the life of the child the age concept is acquired, 
as defined operationally by the child’s capacity 
to make accurate age discriminations. 

The second aspect of the study concerns the 
esthetic and social consequences of age dis- 
criminability in young children. Consider first 
the case in which the child has little awareness 
of age differences. Since such children are 
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nevertheless able to distinguish between 
people, they can be expected to find certain 
people more attractive than others, and per- 
haps, as a consequence, like these people more 
than the others. It would be most unexpected, 
however, to find any basis in age for such 
differential preference and esthetic judgments. 
A systematic bias (positive or negative) with 
regard to a particular class of people seems 
unlikely before the child has an adequate con- 
cept of the class in question. 

Consider next those children who can dis- 
criminate age differences. When this stage of 
cognitive development is reached, children 
might well begin to exhibit certain preferential 
tendencies based on the age of people encoun- 
tered. If a bias against old people is manifested, 
the present study might have something to say 
about the origins of stereotypes often at- 
tributed to old people in American society 
(Tuckman & Lorge, 1953). 

The present investigation is evidently 
analogous in many respects to the recent work 
on the development of racial attitudes in 
children. Studies by Goodman (1952) and 
Landreth and Johnson (1953) have shown that 
accurate perceptual discrimination of Negroes 
and whites by young children is often accom- 
panied by @ decided tendency to prefer whites 
to Negroes. It is our impression that the em- 
phasis on the racial problem has been accom- 
panied by a corresponding neglect of other 
important attributes of people in studies of 
children’s social perceptions. We consider age 
to be one such important attribute and, hence, 
have undertaken to explore its impact on 
children in a photographic judgment context. 
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METHOD 

Subjects 

A total of 102 children, 53 boys and 49 girls ranging 
in age from 4 to 6, served as the subjects of the study. 
For the purposes of the present study, subjects aged 
40 through 4-12 were considered 4-year-olds. Corres- 
sponding age categorizations were employed for 5- and 
6-year-olds. Subjects were obtained from summer day 
camps and private schools in the Boston area,’ The 
fathers of the large majority of the subjects were in 
professional and managerial occupations. Hence, the 
Present sample can be designated as predominantly 
middle to upper middle class. 


Procedure 


The stimulus materials consisted of two sets of 
passport photographs chosen from a pool of more than 
100 photographs on the basis of similarity in emotional 
expressiveness and general “pleasantness” of appear- 
ance. One set contained photographs of nine females 
for whom mean age estimates made by a group of 20 
adult judges ranged from 9.4 to 75.7 years. The other 
set consisted of photographs of nine males whose mean 
ages, as estimated by the same adult judges, ranged from 
11.2 to 76.1 years. Age distances between photographs 
were of approximately equal magnitude in both sets. 
Agreement between the adult judges’ age estimations 
was very high. Kendall’s coefficient of concordance 
applied to the judges’ age rankings of the two photo- 
graph sets yielded Ws of .99 and .98 for male and female 
photographs, respectively. 

Subjects were seen individually by a female experi- 
menter. After the experimenter established some rap- 
port with the subject, she would spread one of the pic- 
ture sets before him (her). The child was then told to 
look the pictures over carefully and to separate them 
into two piles—those liked and those not liked. Within 
each pile the subject chose the picture liked best, next 
best, and so on, until each of the nine pictures had been 
assigned a rank. After this preference ranking was 
completed, the pictures were reshuffled, and the same 
procedure was repeated for the criterion of attractive- 
ness—which is the best looking or the prettiest, the 
next best, and so on? Finally the subjects ranked the 
pictures on the basis of age—which do you think is 
oldest, next oldest, and so on? 

The second set of photographs was then spread be- 
fore the child, and preference, attractiveness, and age 
rankings again obtained. The particular order of the 
three questions—affective judgments preceding age 
judgments—was intended to insure that most subjects 
could cope with the task at its inception. If age judg- 
ments preceded those of preference and attractiveness, 
subjects unfamiliar with the age concept might well 
Jose interest. 


SS See ee 

* We wish to thank the following persons for making 
subjects available: Herbert J. Callahan of Mt. Ida 
Day Camp; Louise Keller of Page School, Wellesley 
College; Judith A. Schoellkopf of the Harvard Preschool; 
Decker Schwannecke of Dorchester House; Mary V. 
Wright of the Lesley-Ellis School. 
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Half of the subjects received the set of male photo- 
graphs first; the other half first responded to the female 
set. Upon the subject’s completion of the task, a brief 


interview was conducted to provide additional in. | 


formation about the subject’s awareness of signs of age 
and feelings about age differences, The following 
questions were asked: How do you know when some- 
one is old? How do you know when someone is young? 
Whom do you know that you think is old? Are your 
pamat young or old? Are your grandparents young 
or old 


Scoring categories were derived for each question, 
Subjects’ responses were coded independently by 
two judges with agreements ranging from 91 to 96% for 
the five questions. Discrepancies were resolved by a 
third judge. 


REsuLtTs 


Since the basic data of the study are in the 
form of ranks, Spearman’s rho was used to 
determine the magnitude of the relations be- 
tween the variables. The mean age ranks of the 
20 adult judges constituted the standard 
against which the children’s age, preference, 
and attractiveness judgments were matched. 
The median correlations are shown separately 
for male and female subjects in Figures 1 and 
2. For each figure, the results have been plotted 
separately for the male and female photograph 
sets. Table 1 presents the quartile deviations 
associated with the median values. 


Age Discriminability 
The four age discriminability curves (the 
solid lines of Figures 1 and 2) are distinguished, 


in general, by a substantial increase in dis- 


criminability from ages 4 to 5, with little 
further increase noted from ages 5 to 6. All of 
the median values are of moderate to high 
magnitude. Note, further, that the quartile 
deviations shown in Table 1 for subjects’ age 
rankings are inversely related to subjects’ age. 
The relatively large quartile deviations for 4 
year-olds in relation to the moderately sized 
medians for that group indicate that some of 
the 4-year-olds are capable of performing the 
task as well as their age seniors while others in 
this age group are incapable of accurate age 
iminations. 

By the fifth year, the large majority of the 
present sample of children closely approximate 
adult performance in making relative age dis- 
criminations. Median values are quite high, 
and the quartile deviations (Table 1) are gen- 
erally of low magnitude. With the 5-year-olds 
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Fic. 1. Median correlations (Spearman rho’s) between adult age rankings and male children’s age, preference 


and attractiveness rankings. 


proving so very capable at the task, any sub- 
stantial further increase in age discriminability 
at the 6-year-old level is precluded. For the 
Present sample of children, then, 5 is the age at 
which ordinal age judgments begin to conform 
to the adult criterion. This does not imply, of 
course, that 5- and 6-year-old children approxi- 
mate adult performance in making absolute 
chronological age estimates. 

„Regarding sex differences in the capacity to 
discriminate age, the results indicate a slight 
superiority for the girls at the 4-year-old level 
for both sets of photos. These sex differences, 
however, do not achieve statistical significance 
(Mann-Whitney U test). 

Differences in age judgments between photo 
sets are shown in Table 2. These were obtained 
by subtracting each subject’s correlation for 
male photos from his (her) correlation for 
female photos. Since the distribution of these 
differences did not manifest any systematic 
positive or negative directional bias, the sign 
of the difference was ignored in obtaining 
median and Q values. Table 2 shows that the 
4-year-old female subjects yield the largest 
median and Q values. On the whole, however, 


these values are quite small given the theoreti- 
cally possible magnitude of intrasubject dis- 
crepancies. Apparently, age cues are of approxi- 
mately equal salience for male and female 
stimulus faces. From another perspective, these 
results may be considered indirect evidence of 
the high reliability of age judgments. 


Relation of Age Judgments to Preference and 
Attractiveness Judgments 

To determine whether a child’s preference 
and attractiveness judgments were biased 
toward the younger or older stimulus figures, 
the child’s rankings were correlated with the 
mean adult age rankings. This procedure was 
selected over the alternative of correlating the 
preference and attractiveness judgments with 
the child’s own age rankings, for the meaning 
of the latter correlations would depend upon 
the degree to which the child accurately dis- 
criminated age. The adult standard, on the 
other hand, serves as an appropriate common 
baseline for all subjects. 

‘The median rho’s for preference and attrac- 
tiveness rankings are shown by age of child in 
the dashed and dotted lines, respectively, of 
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Median Correlation (Rho) with Adult Standard 
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Fic. 2. Median correlations (Spearman rho’s) between adult age rankings and female children’s age, preference, 
rankings. 


and attractiveness 


Figures 1 and 2. A positive correlation indicates 
a stronger preference for and more attractive- 
hess attributed to the younger stimulus person. 
It will be noted that the preference and attrac- 
tiveness curves are reasonably close to one 
another for both sex of the subject and photo- 
graph, and, to varying degrees, below the age 
discriminability curves. These data strongly 
suggest, then, that an age biased affective 
response to the photographs does not precede 
the stage of cognitive development required for 
accurate age discriminations. 

Variability (quartile deviations) of the in- 
dividual preference and attractiveness correla- 
tions are reported in Table 1. In contrast to the 
values for age reported in the same table, the 
variability of the preference and attractiveness 
tho’s, in general, shows a less steep decline with 


(W=17) 
Age of Child (Female) 
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(N=16)  (N=16) (N=17) 


increasing age of the subject. There is con- 
siderably more agreement, then, regarding who 
is younger and older than there is with respect 
to judgments of liking and attractiveness of the 
same photographs. This finding is hardly sur- 
prising, of course, for while judgments of agè | 
are quite stimulus-bound, preference ba 
attractiveness judgments involve attitudina 
and personality factors to a considerably 
greater extent. ? d 
Returning to Figures 1 and 2, it will be note 
that affective responses to the female photo- 
graph set reveal, for both male and female 
subjects, an increasing bias in favor of the 
younger stimulus figures over the 4- to or 
age span. (The preference curve for fema e 
subjects reaches its peak at age 5.) Subject se* 
differences are more pronounced for the male 
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TABLE 1 
QUARTILE DEVIATIONS OF DISTRIBUTIONS OF 
CoRRELATIONS BETWEEN THE SUBJECTS’ 
RANKINGS AND ADULT STANDARD 


Female Photos Male Photos 
Age and Sex 
ji N At- At- 
peSHbect Age | Pret- | trac- | Age | Pref- | trac- 
8e | erence| tive- | “S° |erence| tive- 
ness ness 


X .34 | .30 | .24 | .31 | .35 
16 | .07 | .18 | .19 | .06 | .34 | .25 
17 | .05 | .17 | .17 | .07 | .30| .30 


TABLE 2 
MEDIANS AND QUARTILE DEVIATIONS FOR ABSOLUTE 
DISCREPANCIES BETWEEN MALE AND FEMALE 
PHOTOS IN CORRELATIONS OF THE SUBJECTS’ 
AGE RANKINGS WITH ADULT AGE RANKINGS 


Aana Median o 
Male 4 08 ll 
5 10 08 

6 04 -08 

Female 4 .19 .21 
5 .04 07 

6 10 06 


photograph set. The difference is largely one 
of curve slope, female subjects showing a sub- 
stantial increase from age 4 to 6 in their prefer- 
ence for the younger stimulus persons, while 
males manifest a less marked increase in prefer- 
ence, and, indeed, show a slight decline at the 
6-year-old level. Note further that the attrac- 
tiveness curves deviate more from the prefer- 
ence curves for the male as opposed to the 
female photos. This effect very probably re- 
flects the child’s increasing awareness of cul- 
tural values which link appearance and desira- 
bility more strongly for females than for males. 

Consider next whether the subjects manifest 
consistent preference or attractiveness biases 
for male or female photos. To provide an an- 
swer to this question, each subject’s preference 
and attractiveness correlations for male photos 
were subtracted from his (her) corresponding 
correlations for female photos. Thus, a positive 
discrepancy indicated the subject’s preference 
and attractiveness judgments were more 
strongly related to the adult age baseline for 
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the female photos, while a negative discrepancy 
indicated that these relationships were stronger 
for the male photos. In every case (4-, 5-, and 
6-year-old male and female subjects on prefer- 
ence and attractiveness—12 comparisons in 
all), the sum of the ranked positive discrep- 
ancies exceeded the sum of the ranked negative 
discrepancies. In other words, preference and 
attractiveness judgments were uniformly more 
likely to favor the younger female stimulus 
faces than the corresponding male faces. 

A final question asked of the photograph 
judgment data was the extent to which the 
capacity to discriminate age influenced prefer- 
ence and attractiveness judgments of stimulus 
persons varying in age. To answer this ques- 
tion, the sample was arbitrarily divided into 
discriminators and nondiscriminators using as 
the cutting point a rho between the subject’s 
age and adult age rankings significant at the .05 
level or beyond (a rho of .59, one-tailed, for 
n = 9 stimuli). Given this criterion for assign- 
ment, 81 subjects (79.5% of the sample) were 
classified as discriminators of the male photos, 
and 78 subjects (76.5% of the sample) were so 
classified for the set of female photographs. 

Statistical significance at the .05 level was 
also chosen as the cutting point for an age bias 
in preference and attractiveness judgments. 
In the present case, however, a two-tailed 
criterion was employed, since liking may be 
associated with either end of the age con- 
tinuum. Accordingly, those subjects yielding 
Spearman rho’s exceeding -.68 were con- 
sidered to be manifesting age-biased preference 
and attractiveness judgments. 

The relationships between age discrimina- 


TABLE 3 
CENTAGE OF THE SUBJECTS FAVORING YOUNGER 
Tix OLDER Srncoxus Persons (Significant, Positive, 
and Negative rho’s for Preference and Attractiveness 


with Adult Age) BY ee TO DISCRIMINATE 
GE 


Attractiveness 


Preference 


are IDEELE 
Yes (N = 18) 
No (N = 24) 
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bility, on the one hand, and preference and 
attractiveness, on the other, are shown in Table 
3. For both sets of photographs, a very small 
percentage of the nondiscriminating subjects 
shows an affective bias in favor of either 
younger or older faces. On the other hand, the 
subjects who accurately discriminate age are 
much more favorably disposed toward the 
younger stimulus persons. Of those subjects 
who do not favor the youngest faces in their 
preference and attractiveness judgments, only 
a small percentage are inclined in favor of the 
oldest faces in the photograph sets. This finding 
suggests that the remaining subjects may show 
a systematic bias toward faces in the middle- 
age range, or what is more likely, the judg- 
ments of these subjects may be based on at- 
tributes of the stimulus pictures other than age. 


Responses to Interview Questions 

Are there any relationships between the 
subject’s responses to the interview questions 
and his (her) capacity to discriminate age? 
Before examining this problem, further con- 
sideration must be given to the division of the 
sample in terms of age discriminability. The 
distinction between discriminators and non- 
discriminators shown in Table 3 was reported 
separately for the male and female photograph 
sets. If we wish to characterize a subject in 
terms of the accuracy of his age judgments on 
both photo sets, the sample must be divided in 
a different way. In brief, a total of 72 subjects 
satisfied the age discriminability criterion 
(tho = .59) on both photo sets. Of the remain- 
ing 30 subjects, 15 subjects met this criterion 
on either the male or female photo set, and 15 
subjects on neither set. In the presentation of 
the interview results, the distinction between 
the latter two groups of subjects will be main- 
tained. The subjects who accurately discrimi- 
nated age on only one set of pictures will be 
designated as “partial discriminators.” Those 
subjects who fell short of the criterion on both 
photo sets will be called “nondiscriminators.” 
At first blush, it might appear that the neces- 
sity for the above distinction is inconsistent 
with the data reported in Table 2. The use of 
any arbitrary criterion to divide a distribution, 
however, often results in some scores relatively 
close to one another being relegated to different 
classes. 

We turn now to the interview data. The 
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TABLE 4 
Responses (in Percentages*) or DISCRIMINATORS, | 
PARTIAL DISCRIMINATORS, AND NONDISCRIMINATORS 
TO THE Question: “How Do You Know WHEN 
Someone Is OLD?” 


Discrimi- Partial Nondls- 
Response Category nators pardon ue 
(N = 72) (N = 15) (N = 48) 
A. Old-age-related phys- 62.5 20.0 0.0 
jognomic charac- 
teristics (e.g., 
wrinkles) 
B. Other old-age-related 33.3 6.7 -S 
characteristics (e.g., 
walk slow) 
C. Bodily size (e.g., big, 18.1 40.0 40.0 
large, tall) 
D. Other physical char- 16.7 33.3 6.7 
acteristics unre- 
lated to old age 
(e.g., wear glasses, 
use make-up) 
E. Response associated 2.8 13.3 0.0 
with death 
F. Undesirable person- 4.2 0.0 6.1 
ality qualities and/ 
or behavior 
G. Unable to specify 9.7 20.0 46.6 


* The total percentage in each column exceeds 100.0 since the 
total number of responses exceeded the number of subjects. There 
were also slight variations in the over-all number of characteristics 
cited in each group. Statistical tests were not attempted for thes 
reasons, 


subjects’ answers to the question—“How do 
you know when someone is old?”—have been 
categorized in Table 4 for the subjects who 
meet, partially meet, and do not meet out 
criterion of age discrimination. Not surpris- 
ingly, the discriminators exceed the partial and a 
nondiscriminators in reference to physiognomie i 
and other old-age-related characteristics. The 
partial and nondiscriminators, on the other 
hand, are more likely to mention body size of 
in the case of the partial discriminators, other 
physical characteristics unrelated to old age. | 
Note, finally, that approximately one-half of 
the nondiscriminators are unable to specify any 
basis for judging age. 

Responses‘ to the question “How do you 
know when someone is young?” were consistent 

‘Four tables giving the response categories and 
response percentages for discriminators, partial dis- 
ctiminators, and nondiscriminators on Interview 
Questions 2, 3, 4, and 5 have been deposited with E 
American Documentation Institute. Order Documeni 
No. 6746 from ADI Auxiliary Publications Project, 
Photoduplication Service, Library of Congress; Wash- 
ington 25, D.C., remitting in advance $1.25 for micro- 
film or $1.25 for photocopies. Make checks payable to! : 
Chief, Photoduplication Service, Library of Congress- 
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with these results, discriminators most fre- 
quently mentioning the absence of the phys- 
iognomic qualities attributed to the “old” in 
the previous question, partial and nondis- 
criminators referring primarily to bodily size or 
indicating that they do not know. Differences, 
however, were not as marked in the present 
case, for 28% of the discriminators, as opposed 
to 40% of partial and nondiscriminators, cite 
bodily size with reference to someone being 
“young.” The concept “young,” for some 
subjects, seems to connote a distinction be- 
tween children and adults. In such cases, 
reliance upon physiognomic cues alone would 
obviously not be required for accurate dis- 
crimination between those “young” and “not 
young.” 

Consider next the question: “Whom do you 
know that you think is old?” Grandparents or 
persons of that generation were mentioned by 
62.5, 33.3, and 13.3% of the subjects in the 
discrimination, partial discrimination, and 
nondiscrimination groups, respectively. Fur- 
ther, 46.7% of the partial and nondiscrimi- 
nators were unable to specify anyone thought 
to be “old,” as opposed to 18.1% for the dis- 
criminators. Of the various people who occupy 
the world of the young child, it is the grand- 
parents who are likely to be the oldest. Hence, 
the between-group differences observed in the 
use of the “grandparent” category provide 
further validating evidence for the photo- 
graphic judgment findings. 
if Finally, when asked the direct questions— 

Are your parents young or old?” and “Are 
your grandparents young or old?”—subjects 
generally stated that parents are “young” and 
grandparents are “old.” There were no sub- 
stantial differences between groups in this 
regard, though such responses were most fre- 
quent for the age discriminators. Forty percent 
of the nondiscriminating subjects considered 
one parent “young,” the other “old,” but few 
subjects in the other two groups made this 
distinction. It appears most unlikely that the 
chronological age discrepancies between the 
parents of nondiscriminators substantially 
exceeded those of parents of the subjects in 
the other two groups. Accordingly, these re- 
sults point to the confusion of the nondis- 
ctiminating subjects regarding the age dimen- 
sion and suggest that many of these subjects 
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make age distinctions on highly irrelevant 
grounds. 

A few of the discriminators and partial dis- 
criminators (maximum of 15.3%) spontane- 
ously categorized one or more of the parents 
or grandparents as “in between young and 
old.” None of the nondiscriminators em- 
ployed the above categories. It is entirely pos- 
sible, of course, that the dichotomous quality 
of the two questions discouraged more fre- 
quent use of such a “middle-aged” response. 

As for the partial discriminators, they re- 
semble the discriminators on some of the cate- 
gories, and the nondiscriminators on others. 
On still other categories, the partial dis- 
criminators manifest response frequencies 
about midway between those of the other 
two groups. There appears, then, to be a 
transitional phase through which some, if 
not all, children pass on their way to accurate 
age discriminability. At this stage, subjects 
seem to have an incipient awareness of age- 
relevant physiognomic cues, but have not as 
yet defined the age concept so as to disregard 
irrelevant stimulus information. 


Discussion AND CONCLUSIONS 


The results of the present study indicate 
quite clearly that children as young as 4 years 
of age are capable of fairly accurate age dis- 
criminations solely on the basis of physiog- 
nomic information. Indeed, since a substantial 
proportion of the 4-year-old group was equal 
to the task, it is entirely possible that a small 
percentage of children in the late 3-year-old 
range would satisfy the discrimination cri- 
terion employed in the present experiment. It 
should be noted, however, that the present 
sample of children overrepresents the higher 
Jevels of socioeconomic status, and, very pos- 
sibly, of intelligence, and the absolute level of 
age discriminability observed in the present 
sample may therefore well exceed the level 
that would be found in a truly representative » 
sample of 4- to 6-year-old children. 

Forty percent of the children who did not 
make accurate age discriminations on one or 
both photo sets, when asked in the interview 
“How do you know when someone 18 old?’ 
responded in terms of the relative sizes of 

Je. This finding can probably be traced 
to the fact that the child’s first exposure to 
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the problem of aging and age differences cen- 
ters around birthdays and the parents’ em- 
phasis on the child’s being 2 or 3 years old, 
for example, accompanied by the assertion 
that the child is “getting to be a big boy or 
big girl now.” Awareness of size differences 
occurs very early in the child’s perceptual de- 
velopment, and, hence, age differences prob- 
ably become linked to size in the eyes of 
many preschool children. There may, thus, 
be a stage of development in which children 
might be able to distinguish other children 
from adults without being able to discriminate 
adequately within the adult range. Children 
at this particular stage would be seriously 
handicapped, of course, in judging age from 
photographic stimuli. In time, the child can 
be expected to make finer differentiations 
within the adult category. When this stage is 
reached—by the fifth year for most of the 
children in the present sample—the child’s 
perception is accurate in the sense that his 
categories of youth, middle, and old age match 
physiognomic and other cues associated with 
those particular age groupings. 

It will be recalled that for the large majority 
of the nondiscriminating subjects, preference 
and attractiveness judgments were not system- 
atically related to the ages of the stimulus 
figures. Of those subjects discriminating age 
accurately, on the other hand, approximately 
one-half manifest a clear preference for the 
youngest faces in the photograph sets. Corre- 
spondingly, the oldest stimulus persons are 
liked least. Attractiveness judgments tend to 
parallel preference judgments, suggesting that 
these are aspects of a global affective evalua- 
tion of the photographs. 

While sex of the subject and of the stimulus 
person were of minor consequence in judg- 
ments of age, preference and attractiveness 
judgments, by contrast, were influenced to 
some extent by both sex of the subject and of 
the photograph. When comparing each sub- 
ject’s affective judgments of the male and 
female photos, it was found that judgments of 
the female photos were more directly age- 
linked, with the younger female faces viewed 
more positively. It is possible that a major 
determinant is the differential experience with 
adult males and females on the part of pre- 
school children. Nursery schools and kinder- 
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gartens almost exclusively employ female 
teachers, who quite naturally will vary in age. 
Whether older teachers are likely to be more 
“disciplinarian” in their educational practicesë 
or simply are more remote in age from the 
subjects’ mothers, the observed effect seems 
to be one of increasing negative affect with in- 
creasing age of the female stimulus person. 
This explanation cannot, of course, account 
for the differential steepness of the male and 
female preference and attractiveness curves 
for the male photos. It will be recalled that 
the female subjects exhibited a sharp con- 
sistent increase with age in their positive affec- 
tive response to the younger male photos. The 
curves for male subjects, on the other hand, 
showed a much more gradual increase in favor 
of the younger male stimulus faces, and, in- 
deed, a slight decline between ages 5 and 6. 
We are uncertain as to how to interpret these 
data. One possible explanation would stress 
the growing awareness in the young female of 
emotional attachments to the opposite sex, 
an awareness constantly reinforced by the 
mass media. From this standpoint the young- 
est male photos would show an increase in 
the extent to which they are endowed with 
positive affect by female subjects. For male 
children, on the other hand, identification 
processes would naturally assume priority over 
object choice when male photos are the stim- 
uli. Such a conceptualization would lead to the 
expectation that male subjects would prefer 
pictures of young and middle-aged adults to 
those of children and adolescents. Such a pref- 
erence pattern would tend to reduce correla- 
tions between age and affective judgments. 
This interpretation does not fully account, 
however, for the unusually low median correla- 
tions for affective judgments of the younger fe- 
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male subjects on the male photos. It is our opin- 


ion, on the basis of the subjects’ spontaneous 
comments, that the response to the youngest 
member of the stimulus series is the critic 

factor in the present case. In the female photo 
set, the youngest face was considered a poten- 
tial playmate by most of the youngest male 
and female subjects. The youngest face in the 


male stimulus series, on the other hand, met ; 


5 In a subsequent questionnaire study of 34 children 
aged 4 to 6, 25 (73.5%) expressed a preference for & 
“young” rather than an “old” teacher. 


‘hate 
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with a more equivocal reaction from the 
4-year-old males, and with outright rejection 
from many of the youngest females. Possibly, 
the differential response is based on the rela- 
tive ages of the two stimulus persons. It will 
be recalled that the adult judges, on the aver- 
age, estimated the youngest male photos about 
2 years older than the corresponding youngest 
female photos. The youngest male may, thus, 
have fallen outside the category of playmate. 
For the youngest female subjects, in addition, 
the male photo in question probably called 
forth an undesirable image of “aggressive mas- 
culine play.” As we have seen, such an image, 
if present, is not maintained, for the 5- and 
6-year-old female subjects. 

In conclusion, we should like to consider 
some of the more general social forces that 
might contribute to the emergence in young 


€ children of negative sentiments toward old 


y 


people. First, however, it should be recalled 
that the study employed ranking as its basic 
procedure. While ranked judgments have the 
decided merit of comprehensibility for young 
children, they do have the disadvantage of 
failing to provide any absolute yardstick in 
regard to affective responses to the photo- 
graphs. Low rankings may simply be reflec- 
tions of indifference or, in the rare case, of 
mild positive feelings. 

Considerations such as these are especially 
relevant if one resorts to a “role” interpreta- 
tion of the results. Thus, if the youngest stim- 
ulus persons are perceived relative to their 
role as playmates, a highly valued role for 4- 
to 6-year-old children, the youngest figures 
can be expected to receive the most positive 
endorsements. Faces in the young adult and 
middle age range may be perceived as fulfilling 
parental and teacher roles, both not nearly so 
pervasively positive as the playmate role. 
Faces in this age range, then, can be expected 
to receive a lower ranking. Finally, there are 
the oldest stimulus persons in the photograph 
series, for whom a clear role is entirely lacking- 
These faces perhaps tend to receive the least 
positive affective judgments by default. 

But how about the grandparents of these 
children? The interview data indicated that 
grandparents are generally identifed as 
“old.” Insofar as grandparents constitute an 
indulgent rewarding figure for children, one 
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might expect some generalization to the oldest 
faces, and, hence, more favorable judgments of 
old people. It is evident, however, that the 
relation between grandparents and the young 
child in our society is a peripheral one, rela- 
tive to the frequent intensive contact with 
parents and peers. Nevertheless, a positive 
emotional relationship with one or more 
grandparents might serve to mitigate negative 
feelings toward old people, and, in the present 
study, might partially account for the fact 
that a substantial number of discriminating 
subjects did not assign their lowest preference 
ranks to the oldest stimulus figures. 

Some consideration should be given to fac- 
tors other than “role” that might contribute 
to a better understanding of the findings. Some 
children, for example, may develop negative 
attitudes toward old people as a consequence 
of the explicit or implicit transmission of 
stereotypes (Tuckman & Lorge, 1953) by 
parents or other adults. Such transmission 
may take the form of stories read to children. 
Elderly characters in classic children’s tales 
are often cast in a stereotyped role, or are 
unsympathetically portrayed. 

Anthony (1940) noted that 4- and 5-year- 
old children sometimes explicitly link death 
and old age. In the present study, only a few 
children spontaneously associated death and 
old age in the interview (Table 4). It is con- 
ceivable, however, that a large number of 
children were influenced by this connection, 
but failed to mention it in the absence of a 
direct question. While an association of this 
kind may apply to a limited number of chil- 
dren, it would serve to bias the affective, Te- 
sponse to the older faces in a negative direction. 

Note, in conclusion, that only about half 
of the older children in the sample gave the 
oldest stimulus persons the lowest preference 
and attractiveness rankings. Why, then, did 
the other half not do so? We have suggested 
that a positive relationship with one or more 
grandparents might exert an influence. Are 
there any other special circumstances likely to 
mitigate the development of negative feelings 
in these subjects? Or, do children simply vary 
in the length of time intervening between 
age discrimination learning and the develop- 
ment of age-related affective predispositions? 
Clearly, these questions can be answered only 
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in terms of extending the present research to 
older children and inquiring further into fac- 
tors responsible for individual differences in 
children’s perception and evaluation of the age 
dimension. 


SUMMARY 


A sample of 102 children (53 males and 49 
females) ranging in age from 4 to 6 was studied 
with regard to capacity to discriminate age 
differences and affective responses to stimulus 
persons varying in age. Each subject was 
presented with two sets of stimulus faces of 
heterogeneous ages with instructions to rank 
the photographs in terms of age, preference, 
and attractiveness, The stimuli, a male and 
female set each containing nine photos, had 
been ranked for age by a group of adult judges. 
This ranking served as a baseline against 
which the children’s rankings were compared. 

Progressive increases in age discriminability 
were noted from ages 4 to 5, setting a ceiling 
that permitted little further increase from 5 to 
6. Affective judgments with increasing age 
were in the direction of a more positive re- 
sponse to the younger faces. Preference and 
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attractiveness judgments were influenced to i 
some extent by the sex of the subject and of 
the photograph set. 

When the sample was divided into age dis- 
criminators and nondiscriminators, it was 
found that consistent preference and attrac- 
tiveness judgments in favor of the younger 
faces were found only in the former subgroup. 
Evidence is reported indicating that partial 
and nondiscriminators are more likely to 
associate age with body size, while discrimi- 
nators tend to use physiognomic cues. 

Some possible interpretations of the findings 
are presented in the body of the paper. 
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examine wherein breadth of categorizing 

reflects the manner in which people deal 
with the risk of errors of judgment—specifi- 
cally, the risk of saying that things are similar 
when they might be different, or that they are 
different when they might be similar. 

Breadth of category—or, as it has sometimes 
been called, equivalence range—refers to the 
range of stimuli that are placed in the same 
class or category and share a common label. 
The criteria for placing objects in a common 
category may be many or few, explicit or 
implicit. In psychophysical experiments using 
unidimensional continua, such as length, 
breadth of category defines the limits beyond 
which stimuli are no longer classified as X, or 
“Jong,” but either as non-X, “not long,” or as 
Y, “short.” In more complex arrays of stimuli, 
there is no such unidimensional simplicity, and 
the criteria for classification are often combined 
relationally or disjunctively in the process of 
deciding whether something does or does not 
belong in a class—as, for example, in classifying 
men as “competent” or “eligible to vote.” 

In recent years, individual differences in 
categorizing, particularly in breadth of cat- 
egorizing, have been thought to reflect im- 
portant general differences in cognitive func- 
tioning. Four trends of research on this problem 
can be distinguished: the investigation of 
changes in breadth of category as a function of 
changes in categorizing conditions, the study of 
individual consistencies in breadth of cat- 
egorizing in different judging situations, the 
search for relationships between various emo- 
tional and personality factors and the indi- 
vidual’s general preference for broad or narrow 
classifications, and the exploration of relation- 
ships between breadth of categorizing and 
various forms of abnormal mental functioning. 

A word about each of these trends as they 
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relate to the present study. It is apparent that 
when one alters the conditions of decision 
breadth of category changes. Thus, a sentry, 
deciding whether approaching figures in the 
twilight are friends or foes will have a broader 
inclusion category of “foes” than will an 
inspector judging machine parts as defective or 
not. For the consequences obviously vary in 
the two instances and one can control the 
severity of these consequences by altering 
breadth of category (see Bruner, Goodnow, & 
Austin, 1956). In general, it can be said that 
when the perceived consequences of over- 
inclusion are more severe than the conse- 
quences of overexclusion, inclusion categories 
narrow. When overexclusion leads to more 
severe consequences, inclusion categories 
broaden, 

We mention this line of research for two 
reasons. In the first place, it underlines the 
risk-regulating character of categorizing. But 
beyond that, it points to the possible meaning 
of consistencies in categorizing breadth that 
appear to be associated with different per- 
sonality characteristics—the subject matter of 
the last three of the four lines of inquiry 
mentioned above. Thus, Gardner (1953), 
Hamilton (1957), Klein (e.g, 1958), and 
others remark that obsessive, anxious doubters 
tend to prefer narrow inclusion categories in 
their sensory judgments. Perhaps they have a 
tendency to minimize risk of error by the nay- 
saying route, preferring the consequences of 
error that come from avoiding contact with 
threatening objects. It has been shown, too, 
that the ambivalent, ethnocentric personalities 
studied by Frenkel-Brunswik (1949) and 
Rokeach (1951) show a tendency to narrow 
categorizing. Yet, on the other hand, Arnhoff 
(1956) has noted among ethnocentrics a 
tendency to overgeneralization—which is 
indeed what ethnocentrism implies. Inter- 
estingly enough, Arnhoff (1957) obtained 
completely negative results in an attempt to 
replicate his study. This negative replication, 
while one must be cautious in interpreting it, 
may be revealing in the sense that there may 
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be two opposing tendencies operative in the 
ethnocentric personality: a tendency toward 
overgeneralization under certain conditions, 
and toward narrow categorizing under others. 
Zaslow (1950) suggests, and perhaps his sug- 
gestion is relevant in this connection, that 
normal subjects, in contrast to schizophrenics, 
tend to use middling wide categories, whereas 
schizophrenics veer either toward overnar- 
rowness or overgeneralization. 

In any case, it seems patent that experiments 
seeking to relate categorizing breadth either 
to conditions of judgment or to personality 
cannot get very far unless and until we under- 
stand much more about the intimate texture of 
the act of categorizing itself. This is not to say 
that we do not recognize the degree to which 
categorizing is affected by these two sets of 
determinants. Rather, we feel that an examina- 
tion of the underlying judgmental processes 
involved in categorizing is n for a 
fuller understanding of the role played by 
motivational and personality variables. 

We turn now to the series of experiments 
with which the present paper is concerned. Our 
central interest is in the specific behavior of 
broad and narrow categorizers in a highly 
simplified judging task. Two questions concern 
us. The first has to do with consistency of 
preference for broad and narrow categorizing 
in a stable stimulus situation. But consistency 
is not as simple as it might at first appear. 
There is an equally important question con- 
cerning reactions to a changing stimulus 
environment. Is there some consistent manner 
in which broad and narrow categorizers alter 
their judgments in the face of changes in the 
stimulus situations with which they must 
cope? 

With respect to the second question, some 
predictions can be made. If we assume that a 
narrow categorizer is one who is compelled to 
attend to differences in stimuli, whereas a 
broad categorizer tends to overlook differences, 
then it should follow that narrow categorizers 
will be more likely to alter the breadth of their 
categorizing to conform to changes in the 
stimulus world. Thus, narrow categorizers 
should show more susceptibility to anchoring 
effects in psychophysical judgment than do 
broad categorizers by virtue of being more 
sensitive or reactive to change and contrast. 
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Indeed, we would say, it is precisely this 
sensitivity to change that constitutes the 
hedge against risk that characterizes the nar- 
row categorizer. For to the narrow categorizer, 
differences and change are highly relevant, 
and, as Tajfel shows (1959), these relevent 
differences tend to be accentuated. And if this 
is the case, not only should the narrow cate- 
gorizer alter his breadth of category more 
readily in response to stimulus changes, but he 
should also adapt more quickly when the 
change persists and becomes a new steady state 
of stimulation. 

There is another way of stating our predic- 
tion, one taken from the language of anchoring 
experiments. When a subject has been judging 
a range of stimuli in terms of categories of 
absolute magnitude, the introduction of new 
stimuli larger than the range has either a 
“contrast” or an “assimilation” effect. A 
contrast effect is responding strongly to the 
newly-introduced stimulus—say a stimulus 
larger than any previously encountered—by 
judging previous stimuli as contrastingly 
smaller. Assimilation, both in common sense 
terms and technically, is “not heeding” the 
new anchoring stimulus, i.e., judging it as no 
larger than the stimuli previously encountered. 
We might well predict that narrow subjects, 
sensitive to differences, will be subject to 
contrast effects. Broad subjects, on the other 
hand, should by virture of their less acute re- 
activeness to change, show less of the contrast 
effect and may indeed assimilate the stimuli of 
the extended range to the psychophysical 
scale established with the smaller range of 
stimuli. 

With these predictions in mind, we may turn 
now to the experiments proper. 


Experiment I 
Subjects and Procedure 


Forty-eight fifth grade school children served as sub- 
jects in this experiment. They were tested individually. 

Stimuli consisted of projected slides containing 
clusters of dots, cast on a bended screen at an exposure 
time of .5 second per cluster. The subject, facing the 
screen 12 feet away, was seated next to the table on 
which the 35 millimeter Bell and Howell projector was 
placed. At the beginning of the session, each subject 
was told that his task would be to decide whether each 
of the clusters to be flashed on the screen contained 20 
dots or did not contain 20 dots. If he thought that there 
were 20 dots in the cluster he was to say “yes”; if he 
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thought that the cluster did not contain 20 dots, he was 
to say “no.” The subject was then told that 20 would 
be the smallest number of dots in any of the clusters 
that he would see, and was shown two sample clusters 
of 20 dots each, as demonstrations. 

The presentation of the experimental series followed 
immediately. In order to avoid the possibility that the 
decision about the number of dots might be determined 
by the recognition of the shape of a cluster once seen, 
four clusters with different dispositions of dots in them 
were used for each number of dots, and, in addition, 
each of these slides was shown in different positions 
at its successive presentations. 

The experimental presentations were divided into 
two parts. First, a series of clusters containing from 
20-26 dots was presented eight times successively—a 
total of 56 presentations. The presentations were ran- 
domized in such a way that each cluster of the series 
appeared once before the whole series was repeated in 
a different random order. Following this, in the second 
part of a session, four series of clusters containing from 
20-30 dots were presented in varying random orders— 
a total of 44 presentations. 

The session was interrupted by two periods of rest 
of approximately 2 minutes, thefirstafter three presenta- 
tions of the series; the next, three presentations later. 

The scoring was based on the percentage of yes re- 
sponses, representing for each subject the proportion 
of the stimuli which for him fell into the category “20.” 
These percentages were taken for each subject for three 
separate phases of the presentations of stimuli: for the 
initial range of 20-26 dots in its eight consecutive 
presentations (Phase I), for the stimuli 20-26 of the 
extended range in its first two presentations (Phase II), 
for the stimuli 20-26 of the extended range in its last 
two presentations (Phase III). Breadth of category was 
defined as the proportion of stimuli assigned by a 
subject to the inclusion category 20 in Phases I, 0, 
and II, and a comparison of these phases provides us 
with comparative data for testing our hypotheses. 


Results 


Before the results are reported and discussed, 
the hypothesis stated in general terms earlier 
should be stated specifically in the context of 
the procedure just described. It was assumed 
that during the initial eight presentations of 
the stimuli in the original range, each subject 
would settle to his characteristic pattern of 
assigning a certain proportion of stimuli to the 
inclusion category of 20. The subjects could, 
thus, be divided, by means of ranking, into 
three groups: a group of broad categorizers 
with the largest proportions of number of calls 
of 20; a middle group, and a group of narrow 
categorizers with the smallest number of calls 
of 20. 

Extending the series of stimuli from clusters 
ranging from 20-26 dots to clusters ranging 


233 


from 20-30 dots was conceived as a form of 
anchoring, introducing a larger difference be- 
tween the two extremes of the range. According 
to the argument outlined previously, this 
change should have been more salient and ef- 
fective for the narrow than for the broad cate- 
gorizers. In other words, after the introduction 
of the additional larger stimuli, the narrow 
categorizers should, in their responses to this 
part of the new range which constituted the 
original range (clusters from 20-26 dots), in- 
crease the proportion of clusters called 20 
more than the broad categorizers. The narrow 
categorizers should, moreover, adapt back to 
their old narrow pattern with the continued 
experience of the new range provided by Phase 
HI; 
1. Initial breadih of category. The range in 
breadth of category for the original series of 
stimuli was wide: from 89.5%-30.3%, with 
the median proportion of calls of 20 being about 
55%, with six subjects calling 40% or fewer 
20, and eight calling more than 70% of the 
stimuli 20. 

The division of subjects into three groups 
was made at “cut-off” points nearest to 27% 
of the distribution (Kelly, 1939) for the upper 
and lower ends of the distribution. This re- 
sulted in 12 subjects with the largest per- 
centages of yes responses being assigned to the 
broad group, (with breadths ranging from 
64.3 %-89.5%), and 13 subjects with the lowest 
percentages of yes responses being assigned to 
the narrow group (with breadths ranging 
from 30.3%-44.6%). The difference between 
the number of subjects in the groups is due to a 
run of ties at the upper end of the distribution. 
Assigning all the remaining 23 subjects to the 
middle group seemed the least arbitrary 
procedure. 

2. Change in breadth of category as a function 
of extending the range. As a first test of the 
general hypothesis, a rank correlation was run 
between breadth of category in Phase I and 
the shift from Phase I to Phase II in the per- 
centage of calls of 20. This correlation was of 
the order of rho = —.433; with 46 df this is 
significant at p < .005. Thus, the larger the 
number of stimuli initially assigned to the 
category 20 by a subject, the less did he tend 
to increase this number for stimuli of 20-26 
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dots in the first two presentations of the ex- 
tended range. 

As Hamilton (1957) rightly points out, how- 
ever, “there are few forms of behavior, espe- 
cially in the cognitive sphere, which cannot be 
shown to vary with intelligence. In fact, it 
might be thought that intelligence is maxi- 
mally involved in any process requiring 
perceptual judgments” (p. 203). In pursuit of 
this point, a rank correlation between IQ and 
breadth of category in Phase I was obtained. 
It was of the order of —.53. The initial breadth 
of category appears then to be determined, in 
part at least by the subjects’ intelligence— 
more intelligent subjects tended to use nar- 
rower categories. The mean IQ of the broad 
group was 107.4; of the middle group, 113.6; 
and of the narrow group, 122.7. The dif- 
ference between the broad and narrow groups 
is significant at p < .002. 

The testing of predictions outlined above 
depended, however, on the relationships found 
between the breadth of category shown by the 
subjects in Phase I and the amount of shift 
from Phase I to Phase II. A rank correlation 
between the amount of shift and IQ was ob- 
tained, and found to be .27, reflecting the fact 
that narrow subjects shifted more. However, a 
partial correlation between IQ and shift, 
holding breadth of category constant, reduced 
this figure to .05. A complementary finding is 
that the partial correlation between breadth of 
category and amount of shift, this time holding 
IQ constant, remained as high as —.35. Thus, 
it would seem that, though breadth of category 
is consistently related to intelligence, the 
amount of judgmental shift that occurs when 
the environment changes is related not to 
intelligence but to breadth of category. 

The Wilcoxon matched-pairs signed-ranks 
test (Siegel, 1956) was applied to assess the 
significance of the increase in the percentage of 
yes responses to stimuli 20-26 after the ex- 
tension of the range for each of the groups. The 
results for the broad group were not significant. 
Out of 12 subjects in this group, 6 actually 
shifted in the direction of decreasing the pro- 
portion of their yes responses after the ex- 
tension of the range. In the middle group the 
shift was significant at p < .005 (19 out of 23 
subjects increased the proportion of their yes 
responses). In the narrow group, all 13 sub- 
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jects increased their proportion of yes re- 
sponses. The mean increase in percentages of 
yes responses from Phase I to II for broad 
categorizers was +.45, the range being from 
—21.4-+-21.4, The comparable increase for 
the narrow categorizers was +20.1, with a 
range from +3.6-+-37.5. The middle cate- 
gorizers were in between: mean increase in 


percentage of yes responses being 12.3 witha 


range from —12.5—+-30.4. Mann-Whitney U 
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tests were then applied to compare the amount | 


of shift found in the different groups. All of 


them were found to be significantly different | 
from each other (broad vs. narrow, p < .01; — 


broad vs. middle, p < .02; narrow vs. middle, 
p = .05). 

It might also be said parenthetically that 
the broad and narrow groups behaved con- 
sistently in their placement of dot clusters 
ranging in number from 27-30 in Phase II. 
Some 40.6% of these were called 20 by the 
broad categorizers in contrast to 20.2% called 
20 by the narrow categorizers, a difference 
significant at p < .025 by the U test. 

3. Changes in breadth of category as a function 
of adaptation to the extended range of stimuli. 
Let us now compare the change from Phase II 
to Phase III—the subjects now having had 
experience with two presentations of the ex- 
tended range and confronting the two addi- 
tional series. In the narrow group, out of 12 
subjects (data for this shift for one of the 
subjects were not available), 8 subjects shifted 
back towards fewer yes responses for the 20-26 
part of the range, 1 did not shift, and 3 shifted 
towards more yes responses. The comparable 
figures for the broad group are interesting. 
Only 2 subjects in this group showed a decrease 
in yes responses in Phase III in comparison 
with Phase II; 7 subjects increased their yes 
responses and the remaining 3 subjects showed 
no shift at all. A comparison of means tells the 
same story. On the average, subjects in the 
broad group increased their yes responses by 
7.0% between Phase II and III. Narrow 
categorizers decreased theirs 9.7%. The dif- 
ference between the two groups tested by the 
Mann-Whitney U test is significant at a p < 
.025. 

Here again there is a correlation of .37 be- 
tween the initial breadth of category and the 
amount of shift from Phase II to Phase III, 
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indicating that broad categorizers tend to 
broaden their range more as they get more 
exposure to the changed stimulus situation, 
the narrow categorizers having done most of 
their changing when the stimulus situation was 
first altered. A correlation of —.29 was found 
to hold between IQ and the amount of shift, an 
obvious consequence of the fact that the more 
intelligent subjects had already done their 
shifting in Phase II. A partial correlation was 
computed between IQ and the amount of 
shift, from IL to IM, holding breadth of 
category constant. It was found to be —.12. 
But a partial correlation between breadth of 
category and amount of shift, holding IQ 
constant this time, is .27. It can be argued then, 
that the amount of shift from Phase II to 
Phase III, reflecting adaptation to the new 
range, is related to the initial breadth of 
category but not to intelligence, though 
intelligence and the initial breadth of category 
are related to each other. 

The data for all subjects in the three phases 
of experiment are summarized in Table 1 in the 
form of percentage of yes responses given. 

Figure 1 summarizes the findings. The main 
feature of the results is the relative stability of 
performance in the broad group and the sharp 
changes from phase to phase in the narrow 
group. This is congruent with the general con- 
ception of broad and narrow categorizers as 
outlined in this paper and in others (eg, 
Gardner, 1953). The broad categorizers do not 
react to change markedly or rapidly; the small 
and insignificant amount of shift that they 
show after the extension of the range main- 
tains a slow and steady increase as the new 
range continues to be presented. Narrow cate- 
gorizers react to change strongly, but then 
they seem to revert rapidly to their preferred 


mode of response as soon as there is habitua- 


tion to the new situation. 

Our main hypothesis—that the narrow 
categorizers are more sensitive to change than 
broad ones—is confirmed. 

There is one other point to be made in con- 
nection with cognitive risk. One can conceive 
of risk regulation in terms of a cost-for-gain 
reckoning. That is to say, for a gain in ac- 
curacy or security or some other “good,” one 
pays a cost. In the case of the broad cate- 
gorizers in Experiment I, we see that they do 
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in fact call more of the stimuli containing 20 
dots correctly. The cost of this is calling many 
more of the other dot clusters incorrectly. 
The narrow categorizers show a considerably 
sharper slope in their discrimination function. 
The price they pay for not being wrong about 
the excluded items is missing some of those 


TABLE 1 
PERCENTAGE OF YES RESPONSES GIVEN BY SUBJECTS 
IN THE THREE PHASES OF EXPERIMENT I AND 
IN Exeerment III 


Experiment I 
Experi- 
Group | phase | Phase | Phase | Phase | Phase | "Tt 
I m | D m | D 
(20-26) | (20-26) | (20-26) (27-30) | (27-30) 
Broad 
1 89.5 | 71.4 | 78.6 | 62.5 | 62.5 | 58.9 
2 85.7 |100.0 |100.0 | 37.5 | 75.0 | 64.3 
3 83.9 | 78.6 |100.0 | 0 37.5 | 69.5 
4 80.3 | 64.3 | 85.7 | 37.5 | 50.0 | 71.4 
5 80.3 | 92.8 |100.0 |100.0 100.0 |100.0 
6 75.0 | 71.4 | 85.7 | 25.0 | 37.5 | 75.0 
7 71.4 | 50.0 | 78.6 | 62.5 | 50.0 | 94.6 
8 71.4 | 78.6 | 50.0 | 62.5 | 75.0 | 58.9 
9a 69.6 64.3 
10 67.8 | 85.7 | 70.1 | 37.5 | 37.5 | 83.9 
11 67.8 | 50.0 | 78.6 | 25.0 | 75.0 | 58.9 
128 64.3 62.5 
13° 64.3 62.5 
14 64.3 | 78.6 | 78.6 | 25.0 | 25.0 58.9 
15 64.3 | 85.7 | 85.7 | 12.5 | 50.0 | 58.9 
Me 75.1 | 75.6 | 82.6 | 40.6 | 56.2 
Mè 73.3 69.5 
Middle 
1 62.5 | 64.3 | 71.4 | 25.0] 0 55.4 
2 62.5 | 64.3 | 85.7 | 62.5 | 25.0 57.1 
3 62.5 | 50.0 | 64.3 | 37.5 | 0 58.9 
4 60.7 | 78.6 | 85.7 | 25.0 | 25.0 53.6 
5 58.9 | 78.6 | 57.1 | 12.5 | 25.0 | 51.8 
6 57.1 | 85.7 | 78.6 | 25.0 25.0 | 41.1 
7 57.1 | 71.4 | 57.1 | 12.5 | 50.0 50.0 
8 57.1 | 64.3 | 57.1] 37.5 | 25.0 92.9 
9 57.1 | 85.7 | 71.4 | 12.5 12.5 | 44.6 
10 57.1 | 71.4 | 78.6 | 62.5 50.0 | 66.1 
11 57.1 | 85.7 | 78.6 | 50.0 | 25.0 60.7 
12 55.3 | 42.8 | 35.7 | 37.5 50.0 | 55.4 
13 53.6 | 71.4 | 50.0 | 37.5 | 12.5 73.2 
14° 53,6 | 78.6 | 71.4 | 37.5 12.5 
158 51.8 57.1 
16 51.8 | 50.0 | 50.0 37.5 | 12.5 | 48.3 
17 51.8 | 64.3 | 85.7 | 25.0 62.5 | 41.1 
18 51.8 | 71.4 | 64.3 | 0 12.5 | 57.1 
19 50.0 | 64.3 | 78.6 | 37.5 12.5 | 50.0 
20° | 50.0 50.0 
21 48,2 | 50.0 | 57.1 12.5 | 25.0 | 39.3 
22 48.2 | 78.6 | 64.3 | 0 0 55.4 
23° 46.4 | 57.1 | 35.7 | 0 37.5 | 57.1 
24° 46.4 | 42.9 | 50.0 | 25.0 12.5 
25° 46.4 | 64.3 | 57.1 62.5 | 50.0 | 44.6 
Me 54.5 | 66.8 | 64.6 | 29.4 24,5 
Mè 55.3 55.2 
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TABLE 1—Concluded 


Experiment I 
Experi- 
Group | phase | Phase | Phase | Phase | Phase | "Rt 
tT | a | im | a. | ia 
(20-26) | (20-26) | (20-26) | (27-30) | (27-30) 
Narrow 
1 44.6 | 78.6 12.5 51.8 
2 42.8 | 57.1 | 71.4] 0 12.5 | 48.3 
3 41.1 | 71.4 | 50.0 | 12.5 | 25.0 | 51.8 
4 41.1 | 50.0 | 42.9 | 25.0 | 12.5 | 64.3 
b 41.1 | 64.3 | 50.0 | 0 0 60.7 
6 41.1 | 78.6 | 78.6 | 87.5 | 87.5 
7 41.1 | 57.1 | 21.4 | 0 0 62.5 
8 39.3 | 64.3 | 78.6 | 25.0 | 25.0 | 39.3 
9 39.3 | 42.9 | 35.7| 0 0 
10 35.7 | 50.0 | 14.3 | 87.5 | 87.5 | 64.3 
11 33.9 | 57.1 | 64.3 | 0 25.0 | 46.4 
12 32.1 | 57.1 | 35.7 | 12.5 | 0 35.7 
13 30.3 | 35.7 | 28.5 | 0 0 46.4 
M> 38.7 | 58.8 | 47.6 | 20.2 | 22.9 
MA 39.7 51.8 


^ Subjects not included in the analysis of Experiment I, and 
added subsequently for comparison of their performance in Experi- 
ments I and IIT. 

> Means for Experiment I, not including subjects marked a. 

Subjects taking part in Experiment I, and not available for 
Experiment III. 

d Means for comparison of performance in Experiments I and 
TI, not including subjects marked c. 

® Subjects falling in the lower part of the distribution (narrow 
group) for comparison between Experiments I and III, and in- 
cluded in means for narrow group (d), 


that should be included among the 20s. 
Broad subjects, in short, pay their price for 
being right; narrow subjects seem to pay 
theirs for not being wrong. 

100 
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But before these, or any other conclusions 
are finally drawn from the results, some very 
obvious artifactual possibilities must be dealt 
with. The results may have been due not to 
the narrow group’s more abrupt reaction 
to change, but to the fact that by definition the 
narrow categorizers have more room to expand 
their proportion of yes responses. A broad 
categorizer who started out with 80% of yes 
responses cannot, of course, achieve an increase 
of 25%. This “ceiling effect” may account for 
the broad group showing less increase in yes 
responses than the narrow group. Yet, the 
reversal of trends in shift from Phase II to 
Phase III argues in some measure against this 
artifactual explanation; so does the fact that 
half of the broad subjects decreased their 
proportion of yes responses from Phase I to 
Phase II, some of them quite considerably. 
The results, at least the shift from Phase I to 
Phase II it might also be argued, could have 
been due to regression effects. Experiment II 
was conducted to test further the validity of 
interpretation given to the results of Experi- 
ment I, particularly with respect to ceiling 
effects and regression effects. 

Another artifact, possibly lurking in the 
results, is that the narrow group is “narrow” 
not because of its preferred mode of taking 
cognitive risks, but simply because, for oné 
reason or another (such as, for example, in- 
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PHASES OF EXPERIMENT I 
Fic. 1. Mean percentage of stimuli in range 20-26 classified as 20 (yes responses) by broad and narrow cate- 


gorizers in the three phases of Experiment I. 
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terest in the task), the capacity of narrow sub- 
jects to discriminate was better than that of the 
broad group. The relationship between intel- 
ligence and breadth of category, previously 
reported, increases the likelihood of some such 
interpretation. Under the conditions of Experi- 
ment I and with the small number of yes 
responses that we obtained, the narrow group 
could by definition get closer to the actual 
leptokurtic distribution of the “correct” 
stimuli than the broad group. Thus, in Phase I, 
the proportion of clusters containing 20 dots 
was 14.3%, and decreased to 9.1% in Phases II 
and III. But note, however, that 7 out of 13 
subjects in the narrow group increased their 
proportion of yes responses to the new range 
presented to them in Phase II, while only 3 out 
of 12 subjects in the broad group did so, a dif- 
ference that is significant at the .02 level as 
estimated by a U test. There remains the pos- 
sibility, however, that there is some intrinsic 
relationship between intelligence test scores 
and narrow categorizing based on the possible 
bias of intelligence tests in favor of the precise 
or narrow categorizers. Yet this is not borne out 
by the indifferent partial correlation between 
IQ and shift from Phase II to Phase III. But 
such indirect evidence is not sufficient. Con- 
sequently, Experiment III was conducted in 
order to explore further the question of accu- 
racy. 


ExPERMENT II 


In Experiment I, change was introduced by 
extending the series beyond its initial range. 
Using the same series of stimuli, it should be 
possible to shift judgments in the opposite 
direction by biasing the distribution of stimuli 
through cutting off a part of the range. The 
order of presentation used in Experiment I was 
reversed in Experiment II: The subjects were 
first confronted with the range of 20-30 dots, 
then the larger end of the series was eliminated 
and only the 20-26 range presented. If it is 
true that change has a greater effect on narrow 
than on broad categorizers, then in Experiment 
II the narrow categorizers should decrease their 
proportion of yes responses in the 20-26 range 
from the original to the reduced series more 
than the broad categorizers. 

Twenty-four new subjects were used in this 
experiment, this time drawn from the fifth 
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grade. The initial series of 20-30 dot clusters 
was run through randomly eight times, (a total 
of 88 presentations), and then followed by four 
presentations of the reduced range of 20-26 
dots for a total of 28 presentations. Apart from 
this difference, the procedure was the same as 
in Experiment I. The subjects were ranked in 
the order of their percentage of yes responses to 
the 20-30 initial range to establish broad, 
middle, and narrow groups. 


Results 


The range in breadth of category for Phase 
I (now 20-30 dots) was from 95.4% to 21.6%. 
The 27% cut-off point resulted in six subjects 
being assigned to each of the two extreme 
groups: the broad and the narrow. The range 
in breadth of category for the broad group was 
95.4%-59%; for the narrow group, 36.4%- 
21.6%. 

This time, as one shifts from Phase I to 
Phase II, one finds the narrow subjects de- 
creasing their calls of 20 in response to the 
narrowing of the range—all of them doing so 
with a mean shift in percentage of yes responses 
of —13.7 (Table 2). The broad group again 
hardly reacts to change: three of the subjects 
decrease their proportion of yes responses, one 
does not change, and two increase the number 
of yes responses. The mean percentage decrease 
of yes responses for the broad group is —1.4. 
The difference, as measured by a U test, is 
significant at the .03 level. 

The shift from Phase I to II goes counter to 
what would be expected by virtue of “ceiling” 
and “floor” effects, or, indeed, according to the 
effect of regression. Those who have been 
calling few of the clusters 20 decrease the 
number of their calls much more markedly 
than those who have been calling many 
clusters 20. While these results provide an 
effective answer to the query raised in the last 
section about the possible contribution of 
artifacts to the results of Experiment I, they 
also raise some interesting questions of inter- 
pretation. k 

Take first the narrow categorizers. When the 
range of stimuli to which they are exposed 
broadens, their categorizing broadens. It is, if 
you will, now safer to take positive fliers on 
doubtful stimuli, and more dot clusters are 
labeled as 20. When the universe of stimuli 
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TABLE 2 
PERCENTAGE OF YES RESPONSES GIVEN BY SUBJECTS 
IN THE THREE PHASES OF EXPERMENT II 


Group Phase I | Phase I | Phase I |Phase II |Phase II 
(20-30) | (27-30) | (20-26) | (20-26) | (20-26) 
Broad 
1 95.4 | 90.6 | 98.2 | 100.0 | 100.0 
2 80.7 | 53.1 | 96.4 | 85.7 | 64.3 
3 67.0 | 40.6 | 82.1 | 78.6 | 71.4 
4 62.5 | 31.2 | 80.4 | 64.3] 57.1 
5 62.5 | 71.9 | 57.1 | 57.1] 85.7 
6 59.0 | 34.4 | 73.2 | 92.9] 71.4 
M 71.2 | 53.6 | 81.2 | 79.8 | 75.0 
Middle 
1 56.8 | 37.5 | 67.9 | 57.1 | 50.0 
2 55.6 | 37.5 | 66.1 | 78.6| 85.7 
3 54.4 | 34.4 | 66.1 | 71.4 | 42.9 
4 52.3 | 28.1 | 66.1 | 42.9 | 42.9 
5 50.0 | 25.0 | 64.3 | 42.9 | 57.1 
6 50.0 | 37.5 | 57.1 | 78.6 | 57.1 
7 47.7 | 12.5 | 67.9 | 64.3 | 42.9 
8 46.6 | 15.6 | 64.3 | 57.1 | 57.1 
9 46.6 | 50.0 | 44.6 | 35.7 | 57.1 
10 42.0 | 28.1 | 50.0 | 42.9 | 57.1 
11 40.9 | 40.6 | 41.1 | 57.1 | 57.1 
12 37.5 9.4 | 53.6 | 57.1] 57.1 
M 48.4 | 29.7 | 59.1 | 57.1 | 55.3 
Narrow 
1 36.4 | 12.5 | 50.0 | 28.6 | 50.0 
2 35.2 3.1 | 53.6 | 42.9 | 21.4 
3 32.9 | 34.4 | 32.1 | 21.4 | 21.4 
4 32.9 9.4 | 46.4 | 42.9 | 42.9 
5 31.8 | 34.4 | 30.4 | 14.3 | 14.3 
6 21.6 | 12.5 | 26.8 7.1) 14.3 
M 31.8 | 17.7 | 39.9 | 26.2| 27.4 


narrows in range, narrow categorizers show a 
response of caution, and the bounds of their 
inclusion class are narrowed. Interestingly 
enough, both reactions go counter to the 
“reality” situation. That is to say, in Experi- 
ment I the proportion of actual clusters of 20 
dots decreases from Phase I to Phase If—yet 
the yes responses of narrow subjects increases. 
In Experiment II it is the reverse: the propor- 
tion of clusters of 20 dots increases from Phase 
I to Phase II, but the calls of 20 decrease for 
narrow subjects. It seems plain then, that 
while these subjects are responding to change 
in the environment, they are doing so in terms 
of internal requirements, not external ones. 
What is also interesting is that whereas in 
Experiment I narrow subjects reverted to 
narrow categorizing after they had habituated 
to the extended range, in Experiment IT they 
do not change in Phase III after they have 
narrowed their categorizing in response to a 
narrowed range of stimuli. There is no change 
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in the percentage of yes responses from Phase 
II to Phase III (Table 2). In short, a change in 
categorizing in the preferred direction sticks, a 
change in the nonpreferred direction (toward 
greater breadth) habituates out. 

Now consider the broad categorizers. In 
Experiment I we show that they did not 
respond on the average to an extension of the 
range of stimuli to which they were exposed in 
shifting from Phase I to Phase I. But in 
Phase III, habituation to the extended series, 
or possibly an insignificant and delayed reac- 
tion to the change of stimulation, led to a 
moderate increase in the breadth of their 20 
category. They, too, respond to increased 
diversity in environmental stimulation, but 
more slowly and more modestly. In Experiment 
Il, the picture is very similar: as before, and in 
contrast to narrow categorizers, their response 
to change is slow and insignificant. 

One last check on possible artifacts in Ex- 
periment I is provided by the design of Experi- 
ment Il—a check on regression effects. In 
Experiment I, the transition from the initial to 
the extended range occurred after 8 successive 
presentations of the 7 stimuli of the initial 
range—a total of 56 presentations. This was 
followed by Phase II which consisted of a 
series of 11 stimuli repeated twice—22 presen- 
tations. As Phase I in Experiment II consisted 
of 11 stimuli repeated 8 times, it was possible 
to ascertain whether any regression effect 
(which would consist of an increase in the 
proportion of yes responses for the narrow 


group and of a decrease for the broad group) í 


had occurred at the point of presentations 
nearest to the transition point between Phases 
I and II in Experiment I. This was done by 
comparing the percentages of yes responses for 
the first 5 presentations of the series (a total 
55) with the same percentages for the 2 fol- 
lowing presentations (a total of 22). There was 
no evidence of any trend towards increase for 
the narrow categorizers, or towards decreas 
for the broad ones. 


Experment II 


This experiment was conducted in order to 
ascertain whether the results of Experiment $ 
were due to consistent differences between the 
broad and the narrow categorizers in thei 
preferred mode of dealing with an uncertain 
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cognitive situation, rather than to a tendency 
of the narrow group to be more accurate, 
presumably under any conditions. A subject 
who is in the group of narrow categorizers in 
Experiment I because he might be intent to 
perform as accurately as possible should be- 
come a broad categorizer in a situation where 
the distribution of stimuli is such that most 
stimuli fall into the inclusion or target category. 
The converse is, of course, not necessarily true: 
if a broad categorizer performed as he did 
because he was bored and was paying very 
little attention to the task, he might still 
perform in a similar manner when confronted 
with any distribution of stimuli—assuming 
that yes responses are easier for maintaining 
an attitude of nondiscriminating boredom. 

Though we have spoken of “accuracy,” there 
is one general question concerning it that is 
difficult to answer in these experiments. In 
most cognitive situations flexibility and avoid- 
ance of extreme response tendencies probably 
lead in the end to most “adaptive” responses— 
that is, to responses which keep in closest touch 
with the normally distributed and continuously 
changing environment. There may be situa- 
tions, however, in which either a very narrow 
or a very broad type of response is the most 
useful. This, as Bruner, Goodnow, and Austin 
(1956) have shown, is a function of the “payoff 
matrix” in the situation, of the respective 
consequences that errors of overinclusion or of 
overexclusion may have. It is, therefore, im- 
possible to state a priori which of the two 
extreme forms of categorizing is the “better” 
or the most useful save in a restricted way 
applying to specific situations. 

The arrangement of stimuli in Experiment I 
favored the narrow categorizer in the sense 
that few stimuli were in fact 20s. We decided, 
therefore, to run another experiment in which a 
broad style of categorizing would lead to per- 
formance corresponding more closely to the 
actual distribution of stimuli. This time, the 
great majority of stimuli would belong in the 
inclusion class and the preferred response 
distribution of broad categorizers would con- 
form to the distribution of “correct” or in- 
clusion stimuli. 


Subjects and Procedure 


Experiment III was conducted with 45 subjects 
who took part in Experiment I plus 4 subjects who were 
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added subsequently. The latter were first subjected to 
the procedure of Experiment I, and then, a few days 
later, took part in Experiment IIT. Stimuli consisted of 
straight lines drawn on rectangles of white cardboard, 
11” X 14” in size. The inclinations and positions of 
lines on the rectangles were varied randomly. There 
were 56 rectangles: on 48 of them (6 out of 7) lines 5” 
long were drawn, one to a rectangle. The remaining 8 
rectangles contained lines varying in length from 514" 
to 53". The subjects, tested individually, were 
seated at one end of a table, the experimenter at the 
other, separated by a distance of 3 feet. The lines were 
shown one by one, the order of presentation so ar- 
ranged that one line longer than 5” appeared in each 
successive series of six 5’’ lines, the serial position of 
the longer line varying at random. Subjects were told 
to decide whether a line was 5’’ long or not. If they 
thought that it was, they should say “yes”; if they 
thought not, “no.” They were duly informed that some 
of the lines would be 5’’ long, some slightly longer. 
Two 5” lines were shown first, and the subjects told 
what they were. The presentation of the experimental 
series followed immediately. Time of exposure was not 
limited, the cardboard rectangle being held up until the 
subject made his decision. 

Scores again consist of the percentages of yes re- 
sponses given to the total of 56 stimuli presented—a 
number equal to the number of clusters of dots pre- 
sented in Phase I of Experiment I. The prediction, of 
course, is that if the subjects’ performance in Experi- 
ment I reflected their preferred mode of categorizing, 
they should perform in a similar manner in Experi- 
ment III, despite the radical change in the frequency of 
stimuli belonging to the inclusion class—that is, broad 
categorizers should respond more broadly in both 
experiments, 


Results 


The range of percentage of yes responses was 
from 100% to 35.7% with a median of 57.1%. 
Table 1 sets out the distribution of the sub- 
jects according to the percentage of their yes 
responses. 

1. Comparison of breadth of category in 
Experiment I and Experiment III. The sub- 
jects were ranked according to their breadth of 
category in both experiments. A rank correla- 
tion was run between their performances in 
the two experiments. This was found to be of 
the order of rho = .572, significant at p < 
.0005. 

A further assessment of the similarity of 
performance in the two tasks was made by 
selecting the subjects who in Experiment I 
belonged to the broad and narrow groups, and 
comparing the difference in the performance of 
these two groups in Experiment III. The cut- 
off point for the groups in Experiment I was 
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slightly different from the one used in the pre- 
vious analysis of results for this experiment, as 
some subjects dropped out and some were 
added. Taking ties into account, the nearest 
point to 27% now included 15 subjects in the 
broad group, the narrow group consisting as 
previously of 13 subjects. The difference, as 
assessed by a U test, is highly significant, the 
confidence level being less than .001. The mean 
percentage of yes responses in the previously 
defined broad group is 69.5. The percentage of 
yes responses in Experiment III of the previ- 
ously defined narrow group is 51.8. 

2. Comparison of accuracy of performance in 
Experiment I and Experiment III. The group of 
narrow categorizers performed more ac- 
curately in Phase I of Experiment I than the 
group of broad categorizers. Accuracy of 
performance was assessed by assigning to each 
subject a score that consisted of the proportion 
of his number of yes responses assigned by him 
to clusters containing 20 and 21 dots. The per- 
centage of accurate calls for the narrow group 
was 45.9, as compared with 33.4 for the broad 
group, a difference significant at the .001 level. 

The same measure of accuracy could not be 
applied in Experiment III, as almost all sub- 
jects gave all correct no responses to the eight 
lines that were longer than 5”. Therefore, the 
assessment of accuracy here was made by 
summating all incorrect responses made by a 
subject: no responses given to 5” lines, and yes 
responses given to the longer lines. Of the 25 
subjects belonging to the broad and narrow 
groups in Experiment I whose accuracy of 
performance in that experiment had been 
originally compared, 23 were still available in 
Experiment III. Their accuracy of performance 
in the new task was compared and the broad 
subjects were found to have 81.5% of their 
calls accurate, in contrast to 71.4% for the 
narrow subjects—a difference significant at 
the .01 level. 

To sum up Experiment III, then, subjects 
maintain their relative positions with respect to 
breadth of categorizing when they shift from a 
situation where few stimuli fit the specifications 
of an inclusion class to one where many stimuli 
fit that specification: ones who are narrow in 
the first situation tend to be narrow in the 
second, and so, too, for those who are at the 
broad end of the distribution. One finds, more- 
over, that whereas in a situation where few 
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stimuli fit the specifications for inclusion in the 
target class, narrow categorizers are more ac- 
curate, whereas when many stimuli fit, the 
broad categorizer turns out more accurate. 
Accuracy, in brief, seems more a function of a 
happy fit between response preference and 
stimulus properties than of discrimination 
capacity, much as in an experiment of Tagiuri, 
Bruner, and Blake (1958), where accuracy of 
social perception was a systematic resultant of 
two response patterns that had nothing to do 
with discriminative finesse. It would seem, then, 
that not only does discriminative capacity not 
determine categorizing breadth, but that, in a 
paradoxical way, it is the other way round. 


CONCLUSIONS 


Several things appear quite plain. To begin 
with, there appear to be consistencies in 
breadth of categorizing. We find, moreover, 
that narrow categorizers tend to be more sensi- 
tive to changes in the stimulus environment. 
They show more alteration in their categorizing 
breadth in response to change and show it more 
swiftly—though in a way that is not necessarily 
“rational.” That is to say, when the range of 
stimuli to which they must respond broadens, 
they broaden their equivalence range for 
stimuli of a target class, although the specifica- 
tions of the target class have not changed, and 
although their response may lead to more er- 
rors. It is as if they were “exploring” the new 
and wider range. Broad categorizers, on the 
other hand, do not change as much in response 
to an increase in the range of stimuli to which 
they must respond. They appear to be “hold- 
ing on” to old methods in the face of a new 
situation. When the changed stimulus situation 
persists and becomes a steady state, the nar- 
row categorizer reverts to his more precise and 
narrow mode of categorizing and reduces 
error. The broad categorizer, slower to react 
in the first place, begins to respond to change 
only when the change becomes more perma- 
nent. 

An analogous picture holds when the stimu- 
lus world to which broad and narrow cate- 
gorizers are exposed is constricted and becomes 
less varied. The narrow categorizer “follows” 
the change more swiftly by narrowing his 
categorizing range for the target class. Again 
even though logically it may lead to more 
error, for the specification of the target class 
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has not changed. This time his response seems 
to be in the direction of closer analysis of a 
narrower range. The broad categorizer re- 
sponds with no discernible change, as if he were 
riding out the narrowing of the stimulus world 
that runs counter to his preference for breadth 
rather than changing with it. 

It should not be surprising then—although 
we were taken unawares—that breadth of 
category is associated with intelligence, the 
narrow categorizers having a higher average 
IQ than the broad ones. Interestingly enough, 
though, shift in categorizing in the face of 
stimulus change is found to be related to 
breadth of categorizing but not to intelligence. 
Regardless of intelligence, the narrow cate- 
gorizer handles change by shifting, by reacting 
to contrast; the broad categorizer shifts much 
less and later. In sum, reaction to change ap- 
pears to be a strategy of dealing with the con- 
sequences of error. The narrow categorizer 
appears to prefer the risk of reacting and pos- 
sibly being wrong. The broad categorizer 
prefers the risk of mot reacting to change and 
possibly being wrong. Now we are in a position 
to ask about personality. What personality 
characteristics produce such preferences? 
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ore than 20 years after it was first 
published, a monumentally difficult, 
subtle, and as yet untranslated 
German philosophic work—Sein und Zeit by 
Martin Heidegger—has become the bible of 
many psychotherapists in this country. 
Heidegger has nothing to say explicitly about 
therapy; and even those of his ideas which 
seem relevant have been accessible to English- 
speaking readers only in some of his shorter 
pieces, through expository writings in various 
textbooks of philosophy and theology, or in 
one excellent essay by Brock (Heidegger, 1949, 
pp. 24-131). Therefore Heidegger’s influence 
has had to be exerted indirectly, through one 
or another of the practicing psychiatrists who 
claim to speak for him or in his name. In this 
way, often at one or more removes, a new ap- 
proach to psychotherapy has recently come 
into prominence and a new vocabulary has 
become fashionable: encounter, authentic and 
unauthentic, care, being, engagement, finitude, 
dread, thrownness, and commitment, among 
many other terms. 

Most striking is the fact that although a 
supposedly novel and radically different ap- 
proach to the practice of psychotherapy has 
lately appeared in America—bearing the name 
of existential analysis as an adaptation of a 
significant trend in Europe—its guiding ideas 
are either guesses as to what the founding 
figures might have advocated or suggestions 
culled from scattered works which were not 
meant originally as texts in psychotherapy. As 
a full-scale exposition of theory and practice, 
there is next to nothing from any of the major 
European figures in this movement, with the 
sole exception of Frankl (1958), who has been 
careful to identify his own approach as Logo- 
therapy and to insist that the name for this 
type of existential analysis is not Daseinsanaly- 
sis, as with Binswanger, but Existenzanalysis 
(1959, p. 157). 

Existential analysis appears to be the most 
important movement in the therapeutic field 
since the rise of the neo-Freudian schools of 
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Sullivan and Horney. Indeed, in the eyes of 
its wide range of proponents, the historic mis- 
sion of the new approach is to go beyond all 
forms of psychoanalysis both in doctrine and 
in practice. In a report on the recent Inter- 
national Congress of Psychotherapy at Barce- 
lona, Lefebre (1959) noted that, to judge from 
the papers at this meeting, the existential 
analytic approach enables psychotherapists to 
“rescue psychoanalysis from Freud and the 
Freudians.”” Yet in spite of the enthusiastic 
support it has enlisted, there remains the 
strong possibility that the new movement may 
be not yet a fact but only a hope, or that on 
examination it may be revealed as largely a 
fiction. It is the purpose of the present paper 
to review a selection of claims made for exis- 
tential psychotherapy and to attempt an 
assessment of its present status and future 
prospects. 
DEFINITION 

The first question that may be raised is 
what is meant by existential analysis or exis- 
tential psychotherapy. Surprisingly, however, 
there seems to be little agreement on a defini- 
tion of the term. Its history may therefore be 
of interest in pointing to some of the termino- 
logical and conceptual confusion in this area. 
According to Straus (1959), the term which 
has been translated as Existential Analysis was 
introduced by Binswanger (1958a) in his “Case 
of Ellen West.” However, he meant it in quite 
a special sense, and he explicitly distinguished 
between existential analysis (Daseinsanalysé, 
as he called it) and Heidegger’s analytic 
existence (Analytik des Daseins). “The similat- 
ity of the expression,” Binswanger noted, “is 
justified by the fact that the anthropological 
or existential analysis relies throughout on that 
structure of existence as being-in-the-world 
which was first worked out by the analytic of 
existence” (1958a, p. 271). A great part of 
Heidegger’s Sein und Zeit is devoted to an 
elaboration of the “existentials’! which make 

1 This term (Evistentialen), referring to aspects of 
the structure of Dasein, is used in place of “categories”; 
for technical reasons the preference is not arbitrary- 
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up the structure of Dasein; and it was this 
elaboration, the term for which is usually trans- 
lated as analytic of existence, which provided 
the starting point for Binswanger’s (1957) 
historic series of clinical studies. However, the 
clinical work of the pupil differs quite deliber- 
ately from the ontological investigations of the 
teacher. Binswanger’s approach rests upon and 
arises from Heidegger’s studies and often 
makes use of somewhat similar terminology, 
but it is meant essentially as a translation into 
concrete instances of the guiding conceptions 
which were originally worked out for the meta- 
factual realm of Dasein. 

If only this were the case, it would be 
stretching the meaning of the term to reapply 
Heidegger’s teachings under the name of exis- 
tential analysis as though they constituted a 
form of therapeutic doctrine. But to complicate 
the matter further and to make even less 
seemly the present trend toward an indiscrimi- 
nate use of Heidegger’s conceptions, Binswan- 
ger’s (1958b) term has itself undergone a subtle 
alteration, At no point did the latter represent 
it as a method or approach in therapy; rather, 
he offered it as “an anthropological type of 
scientific investigation” (p. 191). As a prac- 
ticing clinician, he is well aware of the essential 
difference in attitude between treatment in 
therapy and investigation in science. At best, 
he maintains, the success of his method of in- 
vestigation might aid the clinician to “comply 
with a therapeutic demand,” but in this respect 
existential analysis is no more a method of 
therapy than is an investigational technique 
such as the Rorschach. More recently, in his 
paper for the Barcelona congress, Binswanger 
re-emphasized this point. Existential analysis, 
he said, refers to a scientific, not a therapeutic 
endeavor; psychoanalysis is still indispensable. 

These statements may be contrasted with 
the varied definitions for existential analysis 
that have been offered since the approach has 
become widespread. It has been defined as 
“the application of existentialism to psycho- 
therapy” and as “an exploration of the onto- 
logical ground of man” (Van Dusen, 1959a, 
pp. 101, 104). According to one reviewer 
(Ellenberger, 1958), it is “simply the applica- 
tion of certain existentialist concepts to psy- 
chotherapy, without regard to phenomenology 
or existential analysis” (p. 119); yet another 
writer (May, 1958a) defines the existential 
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analyst as a phenomenological and existential 
psychiatrist or psychologist. Finally, existen- 
tial analysis has been called “a clinical prac- 
tice, outwardly similar to other forms of psy- 
choanalysis” (Van Dusen, 1957, p. 310). The 
lack of agreement here suggests a serious con- 
fusion at just the point where there ought to 
be the most clarity. 

What, in actual fact, does existential analy- 
sis as described by its proponents consist of? 
Five aspects may be discussed in terms of 
the extent to which each appears to be a 
sound contribution to clinical theory: a de- 
scription based upon a certain view of modern 
culture, a set of working procedures, a new 
terminology for current therapeutic concepts 
and practices, an attack on some excesses and 
limitations of orthodox Freudian theory, and 
an attempt to popularize the thinking of 
Heidegger. 


A DESCRIPTION BASED UPON A 
View OF CULTURE 


In one of the early papers in English on this 
subject, Edith Weigert (1949) noted that 
existentialism does not lend itself to a concern 
for individual psychopathology but only to a 
description of “certain basic irreducible dialec- 
tic trends like One and Self, Care and Love” 
(p. 409). “Even if the existentialist may not 
have much to contribute to a greater under- 
standing of individual psychopathology or 
therapy, he affords us a profound insight into 
the pathology of the culture as a whole” 
(p. 412). This line of reasoning, to the effect 
that a method for understanding what is 
sometimes called the sickness of our age is 
helpful to the therapist and perhaps even to 
his patient, forms the basis for the essays on 
modern man with which existential analysts 
frequently preface their clinical studies. The 
lengthy historical survey by May (1958a) is a 
case in point. Writers in this area, often show- 
ing more concern for man in general than for 
men in particular, show a strong tendency to 
turn their study of persons into a self-conscious 
sociology. 

One wonders, however, how relevant the 
theme of cultural history is to the practice of 
psychotherapy. It is undeniable that at every 
point in his therapeutic progress the patient 
is at a moment in human history as well as 
at a moment in his own history. But are these 
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two moments equally pertinent to the process 
of therapy? The patient’s insight, “I am es- 
tranged from my own true self,” may not be 
the same kind of datum as the insight—on the 
part of the therapist, or the patient in his 
own living room, or any other adult—that 
modern man in general is estranged. Indeed, 
it is difficult to see how attaining the latter 
insight could help one to achieve the former, 
especially since it is also probable that it 
would act as an obstacle rather than an aid. 


A Ser or PROCEDURES 


In the flood of recent commentaries on 
existential analysis it is not easy to find a 
thorough, precise statement of its procedures, 
particularly as they may differ from current 
practices among competent therapists of all 
persuasions. May (1958a) refers in general 
terms to a “variability” of techniques, the 
specific ones depending on the patient and the 
phase of treatment; but he also quotes a 
personal communication from Roland Kuhn, 
a Swiss psychiatrist who has worked closely 
with Binswanger: “Since existential analysis 
is a relatively new discipline, it has not yet 
had time to work out its therapeutic applica- 
tion in detail” (p. 76). Van Dusen (1959a) is 
somewhat more specific, though he too begins 
with the general statement that “other than 
some central elements, the existential analysts 
have not yet developed a dogma of practice” 
(p. 108). He sees the following as the central 
elements of practice: a variety of techniques 
and approaches, a centering on the critical 
here and now, an emphasis on the patient’s 
capacity to choose rather than on directive 
methods, an avoidance of categories and the 
historical past (1957, p. 318), and “a Weltan- 
schauung which sees a being here and now 
struggling to manifest more perfectly” (1959a, 
p- 108). The essential characteristic, however, 
“lie(s) in the way the analyst understands the 
process” (1957, p. 319). Boss (1958), who is 
one of the European leaders in the new move- 
ment, has stated frankly that “existential 
analysis as such is indifferent both to psycho- 
therapeutic techniques and to practical conse- 
quences and aims” (p. 119). Other writers, 
however, express a greater optimism, although 
they are seldom able to be more specific. 
Ellenberger (1958) notes hopefully that “phe- 
nomenology opens the path to a new type of 
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psychotherapy which is still in its early stages 
of development” (p. 113), and Straus (1959) 
offers an optimistic summary in regard to 
therapeutic practice: “Although Daseinanalyse 
can be applied to psychiatric cases, at its roots 
it is not a specific psychiatric method; still 
less is it primarily a psychotherapeutic tech- 
nique. However, the concept of being-in-the- 
world, as the basic structure of Dasein, the 
distinction of authentic and unauthentic 
modes of existence, the emphasis placed on 
finiteness, Nothingness, and on the relation 
to possibilities and to the future—all these 
lead to definite psychotherapeutic conse- 
quences” (p. 161). In any case, there remains 
the difficult question of whether, if the prac- 
tice of existential analysis does not differ from 
other therapeutic methods, we are not all 
really existentialists without knowing it; or if 
we are not, whether the existential analysts 
are implying that methods and procedures 
are in the long run less important than the 
therapist’s theoretical constructions and 
modes of understanding. 

It may be noted that there is a discrepancy 
between two of the claims made for existential 
analysis. On the one hand, its expositors in- 
sist on their fundamental respect for the con- 
crete phenomenological datum—although they 
usually use the term “phenomenological” in 
the limited and nontechnical sense of “subjec- 
tive’—which is obtainable, in the view of 
some, only by an act of empathic apprehension 
of the structure of the patient’s private world. 
Yet on the other hand, they emphasize their 
concern for general and historically condi- 
tioned human attitudes and for constructions 
which are intellectually satisfying to the 
therapist. The discrepant claims usually seem 
to be resolved in favor of a guarantee of 
growth on the part of the therapist, whether 
or not there is also change in the patient. The 
nature of this resolution—best exemplified in 
Binswanger’s (1958a) “Ellen West,” which 
has perhaps served as the model—may throw 
some light on the fact that on close examina- 
tion existential analysis turns out to have 
very little to offer in regard to the practicali- 
ties of everyday clinical work. It is of some 
interest that Frankl (1958), who has publicly 
dissociated himself from the main stream of 
existential analysis, is the only one of the 
European psychotherapists who has made 
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specific contributions to therapeutic practice, 
for example, his technique of the “paradoxical 
intention.” At the same time, however, he is 
careful to avoid any claims to establishing a 
new school or method of therapy: “Logo- 
therapy is no substitute for psychotherapy 
but its complement” (p. 36). 


TERMINOLOGY FOR THERAPEUTIC 
CONCEPTS AND PRACTICES 


When the conceptions and practices offered 
as existential analysis are summed up and 
critically examined, they turn out to bear a 
close resemblance to many which have been 
adopted independently by psychotherapists 
trained in widely varying disciplines and ap- 
proaches. Consider, for example, this descrip- 
tion by Sonneman (1958) of the essence of 
the therapy process: “The daseinanalytic 
therapist-patient exchanges are therefore quite 
frontal discussions on a basis of absolute and 
equal freedom—free from any and even the 
most hidden implications of a mere status- 
derived or situation-based authority of the 
therapist. They are shared engagements of 
both partners at once, and what the two en- 
gage in is never the psyche of the patient in a 
fictitious isolation but always his world, always 
that which matters to the patient, and exactly 
on the terms on which it does” (p. 145). It is 
true that the practice of therapy does not 
always achieve this laudable goal, for reasons 
attributable many times to the narrowness or 
incompetence of the therapist himself, and it 
is welcome indeed to have the case for an 
optimal relationship stated so movingly. But 
it is equally true that no competent therapist 
would deny these words, or even fail to under- 
stand them immediately and be able to apply 
them to his own practice, to his successes as 
well as his failures. The basic question, which 
is usually bypassed, is whether the admitted 
validity of Sonneman’s ideas bears any legiti- 
mate relation to Binswanger rather than, say, 
to Sullivan, to the best of Heidegger rather 
than the best of Freud or Adler.? 

Still clearer examples may be found in other 
writers. May (1958a), after stating that “the 


2 There is no intent here to dismiss some worthy 
contributions by the method of discovering similar 
statements in earlier writings by others. The issue is not 
who deserves credit for saying it first but rather who 
should be credited at all. 


achieving of the sense of being is a goal of all 
therapy” (p. 44), spells out his prescription, 
which is essentially a recommendation for 
genuineness on the part of the patient. Simi- 
larly, Hora (1959b) substitutes a new term 
“unauthenticity,” for the old term “duplic- 
ity” and defines it as what occurs when the 
patients gratify their own ‘personal needs 
for tension-reduction even though they ap- 
pear to be talking to each other and seem- 
ingly are interested in each other” (p. 85). 
He makes an excellent plea for such estab- 
lished goals in therapy as sound values, in- 
sight, self-awareness, and the constructive 
use of anxiety, goals with which none would 
quarrel. He insists, however, on presenting 
them in a new wrapping. But in so doing, 
does he not run the risk of contradicting 
the very goals that he advocates for his pa- 
tients? 


ATTACK ON ORTHODOX 
FREUDIAN THEORY 


Something like the existential analytic 
movement was, without doubt, long overdue 
in this country. It had already been an im- 
portant influence in European clinical practice 
for more than a decade. Although a number of 
important movements had developed before 
this to challenge the orthodox Freudian posi- 
tion, they all shared the weakness that they 
were in significant measure reactions against 
Freud and therefore at their origins presuma- 
bly joined in a common system, In order to 
avoid this dilemma, an approach was needed 
which started from completely different 
sources. A psychotherapy based on existen- 
tialist tenets seemed to satisfy the require- 
ment and perhaps for this reason was quickly 
seized upon by those who saw psychoanalysis, 
in any of its forms, as too restricting because 
it was bound to outmoded conceptions which 
told only part of the story about the patient 
in midcentury. An admirable summary of the 
necessity for this development is given by 
Franz Alexander (1959): “One must welcome 
the existentialist emphasis as a counterbalance 
to the prevailing trend of psychoanalytic prac- 
tices and teaching—namely, the trend toward 
losing sight of the gap between theoretical 
generalizations and the individual patient, 
whose uniqueness of necessity requires a 
flexible application of general principles, 
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toward replacing the individual approach to 
each specific person with rigid technical rules. 
This trend eventually leads to a deification of 
technique as an aim in itself, as the essence of 
psychoanalysis instead of a servant of its 
goals” (p. 94). 

The essence of the existential analytic 
quarrel with Freud, as formulated by Boss 
(1958), is that the new approach states a 
human norm which differs from that of psy- 
choanalysis. His own goal, he says, is that his 
patients “see themselves, so to speak, as a 
light emanating from the mystery of existence, 
in whose rays all things and fellow-beings are 
allowed to appear and to develop according 
to their own nature” (p. 119). In, thus, 
repeating the ideas and even some of the phras- 
ing of Heidegger, Boss states clearly a con- 
ception of cure, of health, and of psychic func- 
tioning which bears little resemblance to the 
nineteenth century associationism of Freud’s 
theoretical system. Similarly, Binswanger 
(1958a) devotes an entire section of his longest 
case study, “Ellen West,” to a comparison of 
his own approach with that of classical psycho- 
analysis, concluding that the latter is a ‘‘one- 
sided distortion of the human image in the 
frame of a scientific theory of man” (p. 328). 
Whether the existential analytic approach 
will be able to develop its own systematic 
psychological theory to substitute for the 
psychology so elaborately worked out by 
Freud is a question as yet unanswered. Thus 
far, there is very little evidence in either 
European or American psychology of such a 
theory in the making,’ and the only systematic 
contributions of merit to have reached a large 
audience, in the works of Sartre (1948a, 1948b, 
1953), have been conspicuously ignored by 
spokesmen of the existential analytic persua- 
sion. 

In the light of the deliberate attempts within 
existential analysis to break through the 
limitations of psychoanalytic theory, it is of 
particular interest that adherents of the new 
approach have been unable to come to a satis- 
factory position in regard to Freud’s clinical 


3 Mention should, however, be made of the hopeful 
indications onesees in the writings of the Adlerian group 
in this country, in the clinical and methodological 
contributions of Erwin Straus, and in some of the 
studies by von Gebsattel, Minkowski, and Sarro in 
Europe. 
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contributions. Lefebre’s (1959) comments con- 
cerning attempts at the Barcelona congress to 
rescue psychoanalysis from Freud and the 
Freudians seem to imply a general agreement 
among existential analysts that the psycho- 
analytic practice of therapy is worth taki 

over into a new system. But Boss (1958), in 
particular, has some difficulty on this score, 


After noting that “practical indications” in 


therapy should usually be consistent with 
Freudian teachings, he adds that Freud “in- 
tuitively” developed technical rules which 
have a “deeper... existential meaning.” In 
regard to his own technical rules, Freud was 
“intuitively” an existential analyst; only in 
his theory did he go astray. It would appear 
from this passage that Boss is trying to do 
more than save psychoanalysis from Freud; 
he is trying to save Freud from himself, As a 
former pupil of Binswanger, who in turn was 
a lifelong personal friend of Freud, and now 
as an intimate of Heidegger, Boss expresses 
in its clearest form that uncomfortable strad- 


dling of specific issues which besets the new 


existential analytic movement. 


POPULARIZING THE THINKING 
OF HEDEGGER 


The fountainhead of such theory of existen- — 


tial therapy as exists today is to be found in 
Heidegger’s writings, although in their con- 
cern for the historical setting its expounders 
also trace the theory back to St. Augustine, 
Pascal, and Kierkegaard, or link it to more 
modern thinkers such as Buber, Berdyaev, and 
even the Zen masters. The immediate reason 


for the predominant influence of Heidegger is — 


probably not his ideas as such but the fact 
that both of the important recent influences 
on the new movement, Binswanger and Boss, 
have been his disciples. Just about all the 
recent writings in this country on existentialist 
conceptions as applied to therapy are b: 

on Binswanger and Boss.4 Thus, Heidegger 
has been twice interpreted, and the conse- 
quences are far reaching for the ideas under 
discussion here. 

The most striking discrepancy in the con- 
temporary development of a school of existen- 
tialist thinking is that Heidegger should have 

“One exception to this may be Rollo May, who has 
also been influenced by the thinking of the theologian 
Paul Tillich. 
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become the thinker to be followed by psycho- 
therapists. Binswanger (1958a) once pointed 
to the paradox that Freud developed his image 
of man from experience with his neurotic pa- 
tients, thereby completely ignoring many 
exemplary aspects of his own life pattern. In 
direct contrast with this, the followers of 


Heidegger have constructed an optimistic 


and ennobling scheme on the basis of his 
philosophy, apparently ignoring both his 
harsh ethic and the more questionable aspects 
of his professional career. It is well known that 
his role, particularly in relation to his teacher 
and friend, Husserl, left much to be desired 
at a time of national crisis in Germany when 
Hitler came to power, and that in his pattern 
of living since the end of the war he has exem- 
plified the lonely, isolated thinker who is quite 
removed from those concerns which form the 
substance of therapeutic problems and prac- 
tices. More important, his views on the funda- 
mental aims of human striving are difficult to 
reconcile with familiar goals in therapy. As 
Weigert (1949) points out, in the end he offers 
no counsel more noble than to advocate the 
heroic acceptance of “relative freedom of de- 
cision” against the ground of Sorge® in an in- 
different universe. She concludes that “a 


_ psychiatrist may feel that there is something 
unconvincing in this heroic power operation, 


the isolated transcendence of fundamental 
human helplessness” (p. 409). 

The attempt to make of these views a palat- 
able concoction for the therapist and his pa- 
tients has resulted in some odd twists which 
range from errors of emphasis to serious dis- 
tortions. To take one example, in a discussion 
of Heidegger’s concept of thrownness, Hora 
(1959b) considers man’s being “thrown” into 
the world and having to fit himself into it, 
his consequent “existential dilemma,” and 
finally the possible ways he might find to 
extricate himself from the dilemma. There is 
some practical value in stating the matter 
in this way, but it misses a fundamental con- 
ception, one which is expressed by Sartre in 
terms of man’s being thrust into decision and 
by Kierkegaard in terms of the ineluctable 
terror of freedom, to the effect that thrown- 
ness is beyond the person’s apprehension or 

5 Sorge (literally “anxious care”) is a basic term in 
Heidegger’s work. It is the name for the whole structure 
of Dasein, ontologically, that is, for the Being of 
Dasein itself. 
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use. One’s existence is constituted of it; one 
can only fashion projects in its light. A dis- 
cussion of thrownness which poses it merely as 
a problem, structurally similar to a neurosis— 
that is, as a distressing affection which with 
effort and skilled help may be overcome— 
rests on the untenable assumption that it can 
somehow be encompassed by man’s knowledge 
and earnest endeavors. Indeed, not only is it 
not reducible to the level of “getting along”; 
there may ultimately be no getting along, not 
in this stern view of man and his fate.® 

Other aspects of the twice-removed inter- 
pretation of Heidegger contain misconceptions 
which if logically pursued would deny his aim 
while professing to expound his views. As is 
now well known, Heidegger’s self-assigned 
task in Sein und Zeit was “the quest for 
Being,” to revive through ontology the ancient 
problem of Being which had almost disap- 
peared as a matter of concern in contemporary 
European philosophy. An abstruse and funda- 
mental study, ontology is concerned, not with 
entities, such as persons or objects or living 
things, which manifest Being, but rather with 
that which they manifest. However one wishes 
to phrase it—as inner structure, guiding idea, 
or underlying phenomenon—Being as a sub- 
ject of study is not resolvable into what can 
be classified or described as an entity, but 
must be understood as that which infuses the 
entity and finally makes classification and 
describing possible. 

In the course of his analysis, Heidegger 
maintains one distinction without which his 
quest could not proceed—that between the 
ontological, which concerns Being, and what 
he terms the ontic, which refers to that set of 
given and classifiable objects known as par- 
ticular beings. The distinction is not identical 
with that between the abstract and the real, 
for in that case it would point to nothing but 
a problem in logic; nor is it between the ideal 


© It is perhaps less than accurate to refer to Heideg- 
ger’s view in this connection as stern, for as an ontologi- 
cal study it assumes no stand in regard to values. 
Ontology as a discipline proposes to explore the very 
ground for human existence; it must therefore precede 
rather than follow on such aspects of human living as 
values. In softening the implication of Heidegger's 
teachings by adding the mode of Love to the mode of 
Care, Binswanger would seem to have fallen into the 
error of reversing this logical sequence of ontology and 


values. 
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and the material of metaphysics. Rather, it 
refers on the one hand to Being as fundamen- 
tally manifest in facts and entities present to 
us and on the other to the concrete and fac- 
tual, through which we have our dealings with 
Being. This is a distinction which is usually 
clear and justifiable, except in the special case 


of man when he is made an object of study. . 


For a particular, individual man is an ontic 
phenomenon, yet his particularity is never 
quite that of the single entity. He is unique as 
well as individual, creator as well as victim of 
his own special fate, and he demonstrates his 
nature in the very process of being aware of 
this. Within him and through him there abides 
a special relation to Being. The totally new 
conception which Heidegger now develops to 
refer to the phenomenon which sustains this 
relation is Dasein. Although he appears to be 
talking about human beings and their prob- 
lems, in his use of this term he does not mean 
either man, or person, or particular being, or 
(as the word Dasein might ordinarily be trans- 
lated) existence in the sense of mode of living,’ 
In what amounts to his prolegomenon to a 
science of psychology, he presents his analytic 
of Dasein, particularly in respect to its basic 
structure of being-in-the-world. The term 
world is here understood, as one commentator 
(Macquarrie, 1955) puts it, as “an instru- 
mental system founded in man’s practical 
concern” (p. 49), as a nexus of relations rather 
than a locus or a set of activities. In all of 
this, it should be noted, man as an observable 
entity who is studied by the various sciences 
does not appear, nor are concrete human cir- 
cumstances the substance of the author’s con- 
cern. 

It would seem unlikely that the analytic of 


Dasein could be applied directly to specific . 


instances such as the symptoms of particular 
patients. Yet this bold move is what Bins- 
wanger (1958b) attempted in his development 
of the existential analytic school of thought, 
or Daseinsanalyse. However, he has not been 
unaware of the significance of Heidegger’s 
distinction between the ontic and the ontologi- 
cal, as his disclaimer clearly shows: ‘‘Existen- 
tial analysis does not propose an ontological 

7 The hypostasis of Thomistic metaphysics, which is 
the term of highest dignity for the subsistence of a 
person of the Trinity, furnishes a precedent sufficiently 
similar to be of more than passing interest to the 
student of the history of ideas. 
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thesis about an essential condition determini 
existence, but makes ontic statements—that 
is, statements of factual findings about ac- 
tually appearing forms and configurations 
existence” (p. 192). Recently, in fact, he stated 
that his entire attempt at an ontic “transl, 
tion” may have been an error but that it w: 
a productive error. 

In the course of reinterpreting Binswanger’s 
interpretation, however, recent writers on 
existential analysis appear to have overlooked 
this distinction between ontic and ontological, 
changed the meaning of ontology almost be- 
yond recognition, and engaged in a wholesal 
application of phenomenological terms 
clinical concepts which are only verbally simi 
lar. Examples are numerous in the receni 
journal literature: ‘“Being-there”’ refers 


ways of dealing and communicating with th 
environment” (Hora, 1959b, p. 83); and 
again, “... mode of being-in-the-world is the 
individual’s unique way of encountering and 
dealing with the world” (Van Dusen, 1959, 
p. 105). The “existentials” of Heidegger are 
equivalent to “inborn qualities” of the person, 
something like instincts (Weigert, 1949). The 
“ontological basis” of the psychological in- 
sights achieved by existential analysis is “the 
study of man as the being who has these par- 
ticular problems” (May, 1958b, p. 36). “Self 
transcendence”? means that one transcends 
one’s immediate and selfish interests (Hora, 
1959b). “In everyday terms, being embraces 
non-being in the sense that we can be awat 
of death ... can by self-awareness encompass 
death” (May, 1958a, p. 49). A style of life, 
in Adler’s sense, and the striving for superior 
ity as well, are “ontologically grounded,” for 
Adler isolated “ontologically fundamental 
aspects of man...showing their dynamics 
(Van Dusen, 1959a, p. 105). Existential psy; 
chotherapy “unites science and ontology : 
(May, 1958b, p. 36). “Man’s mode of being: 
in-this-world depends very much on his episte- 
mological point of view, that is . . . the natur? 
of his knowledge...” (Hora, 1959a, p. 109 
Being is synonymous with “what exists, a 
ontology “when applied in psychology - ++” 
® This hyphenated term is a literal rendering of t° 
German Da-sein. — f in Heideg 
9 Temporality is of central importance in 
ger’s thinking, so much so that he has coined the te" 
Zeitigung to express the “engendering” of time 
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the study of man’s being or nature” (Van 
Dusen, 1959b, p. 143). 

These statements, which constitute a fair 
sampling of what has been offered in the re- 
cent literature as the philosophic basis for 
existential analysis, have no legitimate connec- 
tion with Heidegger and manifest only the 


barest resemblance to Binswanger. As ideas, 


they may be useful in themselves. For the 
demonstration of this, one may only wait and 
see. But to claim more for them at the mo- 
ment would seem to bespeak at best a neglect 
of the sources from which they are supposed 
to have been learned and at worst an ill- 
founded attempt to make use of the prestige 
of those sources. 

Clearly, existentialism as a movement in the 
history of ideas has profound and perhaps en- 
during contributions to make to every signifi- 
cant human endeavor. Some aspects of the 
vision which inspires its protagonists may be 
found, for example, in Lefebre’s (1959) con- 
clusion: “‘... the conception of man as one 
who seeks for his own truth and who falls ill 
if he cannot bring his truth into living reality 
is what distinguishes existential analysis from 
other psychotherapeutic schools of thought” 
(p. 118). Binswanger’s conception of the “vo- 
cation” of the therapist, as described by 
Weigert (1949), also reaches this level: “... 
open to visions of togetherness, totality, to 
existence as love, to the healing potentialities 


of the creative Eros” (p. 408). But there is 


still the possibility that this inspired and 
historically necessary expansion of the healing 
mission of the therapist will be dissipated in 
hollow claims. Existential psychotherapy to- 
day is more gaudy fiction than fact; yet it re- 
mains a hope. 
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NE of the most commonly observed 
and widely studied phenomena in 
interpersonal relations is that the 

degree of similarity among persons is a power- 
ful factor in their reactions to one another 
(Asch, 1952; Back, 1951; Raven, 1959; Schac- 
ter, 1951; Sherif, 1936). The present paper is 
concerned with the development of similarity 
among persons when one person models him- 
self after another. Our general hypothesis is 
that the perception of similarity among per- 
sons in limited respects leads to the perception 
of similarity among them in other respects. 
This tendency to generalize similarities is as- 
sumed to be one form of the phenomenon 
often described as identification. 

In keeping with the cognitive emphasis of 
the study, we employ the self-concept as a 
central explanatory concept. A person’s self- 
concept is assumed to include a number of 
attributes. The individual also develops con- 
cepts of other persons. He may then perceive 
that one or more of the attributes of his self- 
concept is similar to one or more attributes of 
his concept of a particular other person, 
termed the model. This similarity is designated 
as a first similar attribute (FSA). Our key 
proposition holds that the presence of an FSA 
creates a tendency for the person to conceive 
of himself as similar to the model on other 
attributes, provided that there is no clear 
evidence to the contrary. These additional 
similar attributes are termed derived similar 
attributes (DSAs). It is further assumed that, 
the more FSAs there are between a person 
and a model, the more DSAs will develop. 

The process of generalizing similarities is 
assumed to take either of two forms, introjec- 

1This research was financed by Grant-in-Aid 
M-701 (C) from the National Institute of Mental 
Health, United States Public Health Service to the 
Research Center for Group Dynamics, University of 
Michigan, Alvin Zander, principal investigator. 
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tion or projection. Introjection is the cognitive 
process whereby an attribute previously pres- 
ent only in a person’s concept of the model 
now becomes an attribute in his self-concept. 
Projection is the process whereby an individual 
changes his concept of a model to include at- 
tributes previously present only in his self- 
concept. These two processes are assumed to 
be types of identification. 

The study concerns two hypotheses con- 
cerning introjection and projection: 

Hypothesis 1: The greater the number of 
FSAs in a person’s perceptions of himself and 
another individual, the greater is the tendency 
for the perceiver to introject DSAs into his 
concept of himself. 

Hypothesis 2: The greater the number of 
FSAs in a person’s perceptions of himself and 
another individual, the greater is the tendency 
for the perceiver to project DSAs into his con- 
cept of the model. 

The validity of one additional assumption 
is examined: that each DSA introjected into 
the self-concept can serve as an FSA insofar 
as future introjections and projections are 
concerned, thus, increasing the probability 
that additional DSAs will be introjected. The 
introjective process, in short, can be cumula- 
tive with each DSA making it more probable | 
that there will be further DSAs. | 


METHOD 


Each subject was provided an opportunity to dis- 
cover that she had attributes in herself that were 
similar in some degree to attributes in another. She v 
next made aware of additional attributes of the model l 
in order to see if she would introject these f 
attributes (DSAs) into her concept of herself. Then she 
was made aware of other attributes in her concepti 
of herself and given the opportunity to project these f 
into her concept of the model. Finally, the subject 
was queried in respect to matters relevant to intro- 
jection and projection. l 

The subjects were 70 undergraduate women. re 
cruited from elementary biology courses at the Uni- 
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versity of Michigan on the pretext of an experiment on 
the nature and causes of personal tastes and preferences. 

Procedure. Subjects were brought into the labora- 
tory one by one so as to put each in a private cubicle 
without allowing her to know who else was participat- 
ing in the study. Subjects were provided with earphones 
and a microphone “so they could hear the experimenter 
and the other subjects, and talk to these persons.” In 
reality, however, everything the subjects heard through 


the earphones had been previously transcribed and the 


microphones were dead. 

Each subject was informed that she would work with 
two other persons. The other two were labeled A and B 
and the subject was told that she was Person C. She 
was to respond in her microphone, she was told, in 
answer to questions asked by the experimenter and to 
write her responses, and those of the others subjects, on 
forms prepared for this purpose. Because comments 
by Models A and B were on a tape, any number of 
subjects could be used, each one responding as though 
she was a member of a three-person group. 

After the subjects were seated, the experimenter 
began playing the transcription. They heard the experi- 
menter’s voice explaining the purposes and procedure. 

Phase 1. In order to create the required experi- 
mental conditions each subject had first to be made to 
sense that she had attributes in her concept of herself 
that were similar to attributes in her concept of a model. 
Each subject was therefore led to believe that her 
preferences among seven brief musical compositions 
were highly similar to the choices of one of the models, 
and highly dissimilar to the choices of the other model? 
The subjects heard the experimenter ask them to decide 
which of two melodies they preferred in each of seven 
pairs, They listened to each melody twice, made their 
choice, and recorded it in writing before they heard the 
other participants’ voices reporting their preferences. 
Periods of silence at appropriate points in the tape 
made it possible for the subjects to state their choices 
into the microphone when their turns arrived. The 
choices made by the Models A and B were also recorded 
in writing by the subjects, to make them aware of their 
own musical preference in relation to the preferences 
of the other two persons. 

To ensure that the subjects would be likely to ob- 
serve by the end of Phase 1 that they were more similar 
to one of the models than to the other, melodies un- 
familiar to all subjects were composed for the purpose. 
All tunes were pleasant and fairly simple in form. By the 
use of pretests, seven pairs of melodies were created in 
which one tune in each pair was very likely to be prefer- 
able over the other. One of the models consistently 
stated preferences from among the more popular tunes 
while the other spoke for the less commonly liked ones. 

In order to rule out preference for one voice over the 
other, Model A gave the more popular choices for half 
the subjects and Model B did this for the other half. To 
avoid suspicion, A and B gave the same preferences on 
two of the seven pairs. 

Phase 2. The purpose of the next step was to make 


2 The authors wish to express their appreciation to 
Edwin J. Thomas and Albert Geibler for their assist- 
ance in playing and preparing the music. 
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it possible for the subjects now to introject other at- 
tributes of a model into their own concepts of them- 
selves. Nonsense syllables with low association value 
(Glaze, 1928) were used in this instance, to avoid strong 
preferences based on association. The task was similar 
to that for the melodies; namely, that all were to state 
which syllable they preferred in 10 pairs. 

The procedure was different from the one used with 
the melodies in one important respect. Each pair of 
syllables was typed on a different card and each subject 
was instructed not to look at the next pair of syllables 
“until the experimenter asked her what her preference 
was. Since all subjects were in Position C, they heard 
the choices of A and B, stated as “first syllable” or 
“second syllable,” and recorded these on the appro- 
priate form, before they saw the pair of syllables to 
which these preferences referred. This procedure was 
justified to the subjects by explaining that since their 
first impressions were wanted they were not to know 
the nature of the syllables until the last possible mo- 
ment before making their choices. 

Each subject, thus, came to know the attributes of A 
sand B without having the attribute present in herself, 
When it was the subject’s turn to respond, she turned 
the card, read the syllables, and made her own choice, 
In making this choice she could introject an attribute 
from the model into her concept of herself. 

Each subject had to be made to feel free to make her 
choices without any concern about how the other would 
react to them, so that she might not hesitate about 
disagreeing consistently with one of the models if she 
wished to do so. This problem was not present in the 
case of the preferences for music since each subject 
made her private selection before she heard the choices 
of the others. In order to avoid possible concern over 
the model’s reactions, the experimenter made a fictitious 
discovery as this phase of the experiment began, that 
C’s microphone had gone dead. She would still be able 

to hear A and B, but the latter two would not be able 
to hear her. After some apparent hesitation, the experi- 
menter “decided” to go ahead, cautioning C to make a 
careful record of her own choice when her turn came 
since the others would not be able to do so for her. 
Each C, then, made her choice after knowing what the 
choices of the other participants were, but made it in 
private and with the knowledge that the other persons 
would not know which syllable she had selected. 

Phase 3. The objective for Phase 3 of the experiment 
was to make it possible for each subject to project 
attributes of herself into her concept of the model. It 
was assumed that subjects would do this more often to 
a person with whom they had many similar attributes 
than to a person with whom they had few similarities, 
The subjects therefore had to be made aware of some 
attribute in themselves that had not previously been 
present in their concepts of the model, and to provide 
the possibility for them to assign it to a model. 

Each subject was told that she was to pick from pairs 
of girls’ names, recited aloud by the experimenter, the 
one she liked best. This step was intended to make her 
aware of a particular attribute in her concept of her- 
self, Next, she was to predict which names A and B 
would pick, permitting her to project her own prefer- 
ences onto the models. The experimenter read each 
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pair of names twice and allowed a 30-second period for 
the choice and predictions. In all cases these choices 
were written and were not “announced” on the com- 
munication system by the subjects, 

Phase 4. At this point questionnaires were dis- 
tributed. The subjects were queried about their strength 
of preference for the various materials on which they 
had made their choices, their judgment about which 
pair of subjects were most similar and least similar, and 
their perceptions of themselves and the other partici- 
pants. 

After the experiment was over, the subjects were 
given a full explanation and all their questions were 
answered, 


REsuLTs 
Main Hypotheses 


According to the first hypothesis, the greater 
the number of FSAs, the greater the number 
of DSAs introjected. Since there were two 
potential models for each subject, the number, 
of agreements with each on preferences for 
melodies was counted. Then the smaller of 
these figures was subtracted from the larger. 
This difference score was the measure of the 
FSAs with the model with whom the subject 
agreed more. Since the models stated the 
same preference on two of the seven pairs, 
the difference scores could be 5, 3, or 1. To 
obtain the DSAs, the number of agreements 
with each model on nonsense syllables was 
computed, and the smaller subtracted from 
the larger, obtaining a difference score of 
8, 6, 4, or 0. A positive DSA meant more agree- 
ments with the model who had been agreed 
with more on melodies; a negative DSA meant 
more agreements on nonsense syllables with 
the other model. 

To test the first hypothesis, a tau correlation 
was calculated between the FSAs and the 
DSAs (considering sign). The correlation 
(16) was in the predicted direction, but not 
significantly so (p = .09). Perhaps this re- 
sulted from the narrow range of scores on 
FSAs. 

Another test of the hypothesis was per- 
formed by dividing the subjects into three 
categories. The first, called introjectors, were 
those whose preferences on nonsense syllables 
were in five or more instances the same as the 
ones made by the person with whom they 
had agreed most often in musical choices. The 
second category, nonintrojectors, contained 
those whose choices of nonsense syllables 
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agreed with one model four times and with 
the other model four times. The third category, 
negative introjectors, included those whose 
preferences on the nonsense syllables were in 
five or more instances different from the ones 
chosen by the person with whom they had 
agreed most often when making choices of 
preferred melodies. Table 1 shows that more 
Persons were in the first category of intro- 
jectors than would be expected by chance, 
thus, supporting Hypothesis 1. 

A further consideration of the results rele- 
vant to Hypothesis 1 concerns an unexpected 
manifestation of social support for the FSAs. 
Since both models made the same choices on 
two of the seven pairs of melodies, any subject 
could observe that her musical preferences 
were identical with those of both the models 
on 2, 1, or O occasions. The subjects were 
divided into those whose musical preferences 
corresponded to the choices of both models on 
both possible occasions and all others, and 
the number of introjectors, nonintrojectors, 
and negative introjectors in each of these two 
groups was computed. In Table 2, although 
the interaction x? is not significant, the data 


TABLE 1 
NUMBER or Persons WHO DEMONSTRATED DIFFERENT 
DEGREES or INTROJECTION 


A Non- Negative + 
Introjectors = introjectors Introjectors N xt Sdf $ 


36 (25.07)* 10 (18.86) 69 9.09 2 .02 
“Figures in parentheses indicate expected frequencies. For 
nonintrojectors this was the proportion of subjects expected „to 
have similar choices four times with one model and four times with 
the other out of a total of eight choices. The other two expected 
frequencies: remaining proportion of subjects, halved, and multi- 
plied by the total N. Subsequent tables have expected frequencies 
calculated in the same way except when stated otherwise. 


TABLE 2 
EFFECT OF UNANIMITY AMONG MUSICAL PREFERENCES 
OF SUBJECTS AND MODELS ON NUMBER OF 
Persons WHO INTROJECTED 


23 (25.07) 


any of jec-| „ Non- | Negative 
o Gidea | Ptole | introjec- |Inejece | xt |as] 2 
Pairs of Tunes tors tors 
2 10 (5.09) | 1 (3.82) | 3 (5.09) | 7.68 | 2 |.025 
lord 26 (19.99)] 9 (15.09)|20 (19.99)| 4.22 | 2 |.25 
Total x? 11.90 | 4 |.04 
Pooled x? 136 (25.07)|10 (18.86)|23 (25.07)| 9.09 | 2 |.02 
Interaction x? 2.81 | 2 |.25 


ee a a= 
pm eee 
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~ suggest that. those who received more social 
support for their FSAs are more likely to be 
introjectors than those who received less. 

Since the melody pairs were made as alike 
as possible, and neither tune was distinctly 
unpleasant, an unknown degree of uncertainty 
was created for the subjects, which may have 
affected the degree of introjection. In the 
cases in which the subjects found themselves 
in agreement with both the models on the two 
pairs of melodies, however, their preferences 
may have gained in social reality (Schacter, 
1951). The greater certainty that probably 
resulted from the social reality may then have 
increased the tendency to introject. 

Another test of this reasoning is provided 
by an examination of the differences in the 
number of DSAs made by introjectors with 
high and low degrees of social support for 
their preferences on the melodies. In order to 
test the last ad hoc hypothesis, it is first neces- 
sary to show that the two groups did not differ 
in the mean number of FSAs. A test of the 
differences in the mean number of FSAs 
yielded a ¢ of only .15. The introjectors with 
high social support had a mean DSA score of 
3.80, which was significantly higher than the 
mean of 2.96 for the other introjectors (¢ = 
1.56, p < .05). The results support the ad hoc 
hypothesis that greater social support gener- 
ates more introjection. 

The second hypothesis asserted that the 
greater the number of FSAs, the more DSAs 
the individual projects into his concept of the 
model. In this case, the FSAs were computed 
by first calculating the total number of simi- 
larities in preferences with each model for 
both melodies and nonsense syllables and 
then subtracting the smaller from the larger 
figure. The degree of projection was measured 
by the number of times a subject assigned to 
a model the same preferences in names as 
her own. The first test of the second hypothe- 
sis was a tau correlation between the FSAs 
and the degree of projection on to the more 
similar model. This correlation was .22 (p < 
.02, one-tailed), supporting the hypothesis. 

Another test of this hypothesis was made by 
dividing the subjects into three categories. 
The first, called projectors, were those whose 
preferences for names were granted more often 
to the model with whom they had been more 
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similar in choices of melodies and nonsense 
syllables. Nonprojectors included subjects 
whose favored names were assigned to one 
model and to the other to an equal degree, 
whereas negative projectors were those whose 
preferences for names were more often assigned 
to models with whom they had least similarity 
on the music and nonsense syllables. These 
data are presented in Table 3. 

The results show that the number of per- 
sons in the projector’s category and in the 
other two categories were unlikely to occur 
by chance, again supporting the second hy- 
pothesis. 

Considering FSAs based on the number of 
similar choices on melodies and on nonsense 
syllables separately, a tau correlation coeffi- 
cient of the degree of similarity on musical 
preferences and the amount of projection on 
names was .22 (p < .02), but the correlation 
between the degree of similarity on nonsense 
syllables and the amount of projection on 
names was near zero and not significant. The 
degree of similarity in the choices on music, 
thus, accounted for almost all of the relation- 
ship between the first similar attribute and 
the amount of projection. The assumption that 
a DSA may function as FSA in subsequent 
projection is not supported. 


Strength of Preferences 

Well established FSAs may be more likely 
to lead to introjection than less well estab- 
lished ones. Persons who strongly preferred 
the tunes that were the basis of the FSAs 
might therefore be more likely to be intro- 


TABLE 3 
NUMBER oF PERSONS DEMONSTRATING DIFFERENT 
DEGREES OF PROJECTION 


Non- | Negative | y 


injectors | Projectors x | of) $ 


16.57 | 2 | .001 


Projectors 


42 (27.19)" | 12 (11.62) | 12 (27.19) 66 


â These expected frequencies were calculated by determining 
the number of times out of 66 that one would expect by chance to 
get two series of 10 binary choices (predictions of subject for each 
model) which have an equal number of matches to a given series of 
10 choices (own preferences). This gives the expected frequencies 
for the nonprojectors and the other two are calculated from the 
remainder. 

b The total of subjects is 66 because 2 misunderstood the in- 
structions and made no predictions. Two others had zero scores 
on the number of similarities on music plus nonsense syllables. 
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jectors than those who had weak preferences. 
The subjects rated the degree that they pre- 
ferred the tunes they chose on a five-point 
scale in the final questionnaire. The number of 
subjects who had various degrees of introjec- 
tion was determined for those subjects who 
stated that they had strong and weak prefer- 
ences. Table 4 shows that a larger proportion 
of the persons who had strong preferences were 
in the introjection category, whereas a greater 
proportion of those with weak preferences 
were in the negative introjection category. 
The value of the interaction chi square indi- 
cates that the distribution of the persons in 
the groups with strong and weak preferences 
over the three introjector categories is signifi- 
cantly different. Thus, strongly based FSAs 
are more likely than weaker ones to lead to 
introjection. 

Parallel results obtain for projection. In 
Table 5, it can be seen that those with strong 
preferences were projectors significantly more 
often than nonprojectors or negative pro- 
jectors. Those with weak preferences were 
not projectors significantly more often, but 
the interaction x? was not significant. In gen- 
eral, there is a tendency for those with strong 
FSAs to introject and/or project. 


Perceived Similarity 
To measure the subjects’ awareness of their 
similarity to the models in the postquestion- 
naire, the subjects were given a triangle with 
the letters A, B, and C at the vertices (the 
subject was always C), and asked to place an 
“M” on the line connecting the most similar 
pair and an “L” on the line between the least 
similar. The first line of Table 6 shows that 
subjects had a strong tendency to perceive 
themselves as similar to the model with whom 
they agreed more on the melodies. When sub- 
jects with strong and weak preferences on 
music are compared, Table 6 shows that those 
with strong preferences tended to perceive 
themselves consciously as similar to the model 
with whom they agreed more on melodies. 
Those with weak preferences did not show a 
parallel trend significantly. (However, an in- 
teraction x? was computed and not found to 
be significant.) 
Do persons who are consistently similar to a 
given model on all types of stimuli perceive 


TABLE 4 


EFFECT OF STRENGTH OF PREFERENCE FOR MELopy 
on NUMBER OF Persons INTROJECTING 


Introjector Categories 


Si of 
Musical x? |df| p 
Preferences | Introjec- | ; NOn- | Negative 
tors DA Introjectors 
Strong 29 (17.8) | 9 (13.4) | 11 (17.8) [11.09 | 2 |.005 


Weak 7 (6.9) | 1 (5.2) | 11 (6.9) 5.88 | 2 |.02 
Total x? 16.97 | 4 |.02 
Pooled x? 136 (24.82)|10 (18.36)| 22 (24.82)| 9.17 | 2 |.02 
Interaction x? 7.80 | 2 |,02 


* One subject did not respond to the questionnaire items, This 
accounts for the reduction in frequency from 23 to 22 and the al- 
teration of the theoretical frequencies from those previously used. 


TABLE 5 


EFFECT or STRENGTH OF MUSICAL PREFERENCE ON 
NUMBER or PERSONS SHOWING PROJECTION 


Projection Categories (Based 

Strength of Only on Musical Similarities) 

posie x |df| p 
ferences 

ii Nonpro- | Negative 

Projectors! jectors Projectors 
Strong 31 (19.78)|10 (8.44) | 7 (19.78)|14.91 | 2 |.001 
Weak 7 (7.42) | 3 (3.16) | 8 (7.42) | .08 | 2 |975 
Total x? 14.99 | 4 |.02 
Pooled x? 38 (27.20)/13 (11,60)/15 (27.20)| 9.93 | 2 |.01 
Interaction x? 5.06 | 2 |.10 


eh a ee N A 


this similarity more than those whose simi- 
larity is not consistent? The subjects were di- 
vided into three groups. One, containing 36 
subjects, was composed of those who had, on 
all three types of choice materials (melodies, 
nonsense syllables, and names) preferences 
that were similar to those of the same model. 
Of these 36 persons, 27 judged themselves as 
more similar to this model (x? = 9.0, p< 
.05). The second group contained those sub- 
jects who were more similar to the same model 
on two out of the three types of choice mate- 
tials. There were 23 subjects in this group, of 
which 21 indicated that they saw themselves 
as most similar to this model 6? = 15.7, p < 
-001). The remaining 8 subjects (3 did not 
give sufficient data on the triad question for 
inclusion here) had similarities that were en- 
tirely inconsistent, in that they tended to 
match those of one model on one type of 
choice material, the other model on a second 
type of choice material, and both models 
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TABLE 6 
NUMBER OF PERSONS CONSCIOUSLY AWARE OF SIMILARITY WITH MODELS 


Subjects Who Rate Subjects Who Rate 
Selves as More Simi- Selves as More Simi- 


lar to P Who Has lar to P Who Has y 7 P 
the More Matches the Fewer Matches x if bd 
with Subject’s with Subject’s. 
Musical Preferences. Musical Preferences. 
All subjects 48 18 66* 13.70 1 001 
Subjects having strong musical 37 11 48 14.08 1 .001 
Die 
Subjects having weak musical 11 7 18 49 1 -50 
preferences 


® Omits four subjects; one subject had zero music similarity with models, and three subjects gave no judgments on triads. 


equally on the third type. These subjects did 
not tend to see themselves as more similar to 
one model than to the other. It is clear that 
subjects who were similar to a model to a 
consistent or moderately consistent degree 
were likely to be aware of this similarity. 

Finally, we ask if a subject who was aware 
of her similarity to the model on the choice 
materials generalizes the similarity to derived 
attributes. Subjects were asked to write a 
brief paragraph, on the basis of what they had 
learned during the experiment, describing 
each of the two models, and to write an addi- 
tional one about themselves. From each para- 
graph those phrases were abstracted which 
contained traits, abilities, or characteristics 
not directly associated with music, sounds, 
or names. These were considered to be de- 
rived attributes that developed in the sub- 
jects’ concepts of the models as the result of 
experience with them. 

A panel of three judges determined whether 
the content of the paragraphs describing 
Model A or Model B was closer to the descrip- 
tion each subject made of herself. Only those 
subjects who wrote three paragraphs and for 
which there was unanimous agreement among 
the three raters were used in this analysis. Of 
the 37 remaining, 26 rated themselves as more 
similar on the triad to that model whose de- 
scription had been judged by the panel as 
most like their own. The other 11 persons saw 
themselves as most similar to the other model. 
These frequencies when compared to a chance 
expectation of equal distribution provide a 
x? of 6.08 (p < .02). We conclude that persons 
who were aware of their similarity to others 
on the experimental materials most often 


attribute further similarity to others on sepa- 
rate matters. 


Discussion 


We have learned from this experiment that 
the processes of introjection and projection 
may develop where there are FSAs between 
one’s concept of himself and his concept of 
another person. Furthermore, there is some 
evidence to suggest that the greater the simi- 
larity, the more is introjection or projection 
likely to occur. The degree of social support 
for the FSAs and the strength of the prefer- 
ences for those attributes increased the degree 
of introjection and projection of DSAs. 

The results from the use of nonsense sylla- 
bles with low associative value as attributes to 
be introjected were not always strongly signifi- 
cant. The failure of the variation in degree of 
similarity on nonsense syllables (DSAs) to 
contribute to the projection of names may 
have also been due to the low association value 
of the syllables. Because the syllables had 
little meaning for the subjects it may have 
been difficult for them to associate the sylla- 
bles with other parts of their cognitive struc- 
ture, and therefore not possible for them to 
associate preferences on nonsense syllables 
with other attributes of models. To be in- 
volved in the processes of introjection or pro- 
jection as FSAs or DSAs attributes may need 
to be relevant or meaningful to the perceivers. 


SUMMARY 


Subjects working in isolation were led to 
believe that they were the third member of a 
three-person group and were asked privately 
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to write their preferences among eight pairs 
of brief tunes. After they had made each 
choice within each pair, they heard over ear- 
phones the two other choosers state their 
preferences. The subjects then stated their 
choices into a (dead) microphone. The two 
other choosers were in fact paid participants 
whose voices were transcribed. By appropriate 
means each subject came to perceive that her 
preferences were more similar to one of the 
two paid participants than the other. The 
subjects then made preferential choices from 
a series of pairs of nonsense syllables. In this 
instance, however, the subjects heard the 
selections of the other two persons before 
making each decision, but she was led to be- 
lieve that her own selection could not be 
heard by the other subjects. It was found, as 
hypothesized, that the subjects tended to 
prefer the nonsense syllables chosen by the 
paid participant with whom they agreed more 
often on musical preferences, provided that 
the musical preferences were strong, or pro- 
vided that the subjects received social support 
for their musical preferences. 
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Subjects were also asked to make choices 
from a series of pairs of girls’ names and were 
asked to guess the preferences of the other 
two members. It was found, as hypothesized, 
that subjects assigned their own preferences 
on girls’ names to the paid participant with 
whom they more often agreed on musical 
preferences. 

The results were interpreted as giving sup- 
port to a cognitive theory of identification. 
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REvious research (Stotland, Zander, & 

Natsoulas, 1961) has shown that inter- 

personal identification can usefully be 
conceived in cognitive terms as a generaliza- 
tion of interpersonal similarities. The earlier 
study demonstrated that when a person finds 
that he has some characteristics, called first 
similar attributes (FSA), in common with 
another person or model, he tends to perceive 
himself as having other characteristics, called 
derived similar attributes (DSA), in common 
with the model. In the previous study, the de- 
rived similar attributes were trivial ones such 
as preferences for nonsense syllables and girls’ 
names. One of the purposes of the present 
research is to determine whether generaliza- 
tion of similarity can occur where the derived 
attributes have more inherent importance to 
a person than nonsense syllables and girls’ 
names. Primarily, it asks whether identifica- 
tion can influence a person’s evaluation of 
himself. It also complements the exclusively 
cognitive emphasis of the previous study by 
inquiring whether an individual’s desire to 
perceive himself favorably affects his identify- 
ing with a model. 

The theory employed in the previous study 
divided the process of identification into two 
subprocesses, introjection and projection. In- 
trojection was defined as the process by which 
the person changes his conception of himself 
to be more like his concept of the model, i.e., 
the DSA is located in his self-concept. On the 
other hand, projection is defined as the process 
by which he changes his conception of the 
model to be more like his concept of himself, 
i.e., the location of the DSA is in the person’s 
conception of the model. In the present study, 


1This research was financed by a Grant-In-Aid 
M-701 (C) from the National Institute of Mental 
Health, United States Public Health Service to the 
Research Center for Group Dynamics, University of 
Michigan, Alvin Zander, principal investigator. 


the main focus is on introjection. The first 
hypothesis replicates one in the previous 
study. 

Hypothesis 1: The more similarities (FSAs) 
a person perceives between his conception of 
himself and his concept of another person or 
model, the greater is his tendency to introject 
DSAs into his concept of himself. This hy- 
pothesis refers to DSAs that are neutral with 
regard to the person’s self-evaluation. 

Given this view of introjection, it is plausi- 
ble to assume that a person may employ the 
process instrumentally to acquire desirable 
DSAs. Such an assumption implies that a 
person who has few FSAs with a positive 
model (i.e., a model who has desirable attri- 
butes) is more likely to attempt to seek further 
evidence of similarities than one who has a 
similar number of FSAs with a negative model. 
Take a person who has only a few neutral 
FSAs between his self-concept and his con- 
cept of a model and who then discovers that 
the model has other desirable or undesirable 
qualities. The person’s motivation to think 
well of himself, we assume, has a strong effect 
upon his cognition. If the attributes of the 
model are negative, the person tends to de- 
velop few, if any, additional neutral FSAs and 
may attempt to weaken those that already 
exist by changing attributes of himself, by 
distorting perceptions, or by attempting to 
deny that similarities exist. By doing so, he 
minimizes the probability of introjecting the 
negative qualities. On the other hand, if the 
model’s attributes are positive, the individual 
should attempt to increase the number of 
FSAs by appropriately changing attributes of 
himself, distorting or denying, etc. It is, of 
course, assumed that persons have learned 
through past identification how introjection 
can be employed to acquire positive qualities 
in the self-concept. 

Tf the person has a high number of neutral 
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FSAs with a negative model, he should be 
less likely to be able to reduce his similarity 
than one with few FSAs, since, we assume, 
it is more difficult to change many established 
FSAs. Likewise, if he has a high number of 
FSAs with a positive model, he may already 
have sufficient basis for introjecting desirable 
DSAs and therefore less need to increase his 
FSAs. 

Hypothesis 2 then is: Given situations in 
which persons and models have few FSAs, a 
person who perceives a positive model should 
attempt to develop more FSAs with the 
model than a person who perceives a negative 
model. 

We have, thus far, assumed that identifica- 
tion based on similarity can be used instru- 
mentally to change self-evaluation. But the 
self-evaluation of a person may also be affected 
by the presence of FSAs that he has not him- 
self established but occur because of factors 
beyond his control. The critical question here 
is whether a person’s cognitive processes can 
lead him to acquire even undesirable attri- 
butes in his self-concept. 

Hypothesis 3: The greater the number of 
FSAs between a person and a model, the 
more the person should raise his estimate of 
the degree he possesses a socially desirable 
attribute if the model is high in this attribute, 
and lower his estimate if the model is low in it. 


METHOD 


Young boys were addressed by a person in an in- 
teresting role—a deep-sea diver—taken by one of the 
authors, He told them of the abilities one must have in 
order to be a diver and of his own abilities, preferences, 
and experiences in his career, Two weeks prior to this 
talk the boys had completed a questionnaire in which 
they had been asked to rate their ability in a number of 
skills. The diver described his own ability in many of 
these same activities, For half of the boys he depicted 
himself as highly competent in these areas and for the 
other half he revealed that he was grossly incompetent. 
Within each of these two degrees of competence, the 
diver was presented as either highly similar or dis- 
similar to the boys. In a third treatment nothing was 
said concerning degree of similarity. After the diver’s 
speech was finished, measurements were obtained from 
the boys concerning evaluation of their abilities in the 
same skills they had previously rated as well as other 
reactions necessary for testing the hypotheses. The ex- 
periment design is shown in Table 1. 


Subjects 


The subjects were 118 sixth, seventh, and eighth 
grade boys in a small school located in a thinly popu- 


TABLE 1 
DESIGN or EXPERIMENT 
(Ns in parentheses) 


Similarity Conditions 


High Low No Induction 
Similarit Similarity Concerning 
Induces Induced Similarit 
(High Sim) (Low Sim) (No Ind! 
Ability level | High (21) | High (19) | High (18) 
of diver Low (19) | Low (20) | Low (20) 
(model) 


lated suburb of a large industrial city.* Persons of these 
ages were chosen because they would probably have 
little information, contradictory or otherwise, about 
the abilities and preferences of deep-sea divers and be- 
cause it was unlikely that their perceptions of them- 
selves were so strongly fixed that they would be un- 
changeable, 


Pre-experimental Measures 


Baseline measurements were obtained 2 weeks before 
the experiment. All subjects chosen for the study were 
assembled in the school gymnasium. The administrators 
of the pretest (who did not take part in the subsequent 
experiment) were presented to the boys as representa- 
tives of the “district board for school planning.” 
Because the school was in the process of building a new 
gymnasium, the boys were told that information was 
needed for planning purposes about what physical 
activities the boys could do well and what ones they did 
poorly; also, the board wanted some idea of what the 
boys liked to do and what kind of persons they were, 
To provide this information they were asked to com- 
plete a “Describe Yourself Questionnaire.” 

This questionnaire consisted of two sections, In one 
they rated themselves on eight-point scales in respect 
to nine skills: excellence in swimming, ability to hold 
breath under water, ability to judge distance, capacity 
for remaining calm in an exciting situation, capacity for 
doing complex tasks with hands and fingers, ability in 
throwing a baseball, ability in throwing a football, 
excellence in shooting baskets, and ability to high- 
jump. The first five abilities listed were those which 
would play a part in the experimental induction; the 
last four were inserted as “cover” questions. 

The second section contained eight pairs of abstract 
sketches and six pairs of names of colors, The subjects 
were asked to state which member of each pair they 
preferred. Other simple and concrete pieces of informa- 
tion were requested in order that the subjects would 
believe that they were describing the kinds of persons 
they were. 


Experimental Conditions 


The diver met with small groups containing 10 to 15 
boys. Among the 12 available groups, 4 were given in- 


? The authors wish to express their deepest apprecia- 
tion to Raymond Cypher, and Frank Rehamek, 
Atherton, Michigan, for their kind assistance in 
providing subjects for this experiment. 
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formation intended to generate an awareness that the 
subjects and the model were highly similar, 4 were given 
to understand that the boys and the model were not at 
all similar, and 4 were told nothing about the degree 
of similarity between the model and boys so as to allow 
spontaneous perception of different degrees of similar- 
ity. 

In all groups the model was introduced to the boys 
as a member of the “University of Michigan Institute 
of Marine Salvage Engineering.” The meaning of these 
terms was explained and they were told that the diver 
was going to speak about the abilities that are necessary 
in deep-sea diving. Before getting to the topic of his 
lecture, in the high and low similarity conditions, he 
made certain introductory remarks concerning either 
the degree of his similarity or lack of similarity to the 
listeners, 

Induction of high and low similarity, In the high 
similarity (High Sim) condition the diver stated that he 
was interested in meeting with the group because at 
their age he was very much like them. He was born and 
raised in the neighborhood. He went to a school like 
this one. His father worked in the factory that employs 
most of the subjects’ fathers. He participated in 
activities peculiar to boys in that area: swimming, 
boating, and fishing in a nearby lake. In fact, he went 
on, the principal was impressed at the similarity be- 
tween the model’s life and the lives of the subjects. 
This had led him to ask the diver to take a short test 
the boys had taken several weeks ago (the abstract 
drawings and colors). The principal was said to have 
been amazed at the similarity between the diver’s 
choices and those of many, but not all, of the boys. He 
thought the boys would be interested in seeing this 
similarity. He then handed back to each boy the part of 
the premeasurement questionnaire that contained their 
choices among the drawings and colors and referred to 
it as a test of personality which accurately describes 
“the kind of person you are.” On the same sheet, sub- 
jects were told, the principal had seen to it that the 
diver’s answers were copied next the choices made by 
each subject. The alleged choices by the diver were 
inserted on each answer-sheet so that the diver ap- 

ed to have made the same choices as the subject in 
12 out of the 14 pairs. After a brief period to allow study 
of this information, and for each subject to discover that 
he among all the boys was certainly very similar to the 
model, the papers were collected. 

In the low similarity (Low Sim) condition the experi- 
mental sequence was exactly the same as in the High 
Sim except that an attempt was made to induce a 
perception of the diver as a person who was not similar 
to the boys. Thus, the diver stated that this group of 
boys was of interest to him because he at their age was 
very unlike them. He was born and raised in a densely 
populated, apartment-house neighborhood in San 
Francisco. He went to a very large junior high school in 
a very large building. His father was employed as a 
fisherman. The activities in which he engaged as a boy 
were peculiar to a large city in a congested area and un- 
like those of the boys in the audience. He then went on 
to explain how surprised the principal had been about 
his low degree of similarity with these boys and pro- 
duced the previous “personality test” which, in this 
case, demonstrated that the model made choices 
different from the boys in 12 out of the 14 pairs. 


No induction of high or low similarity. In this condi- 
tion no attempt was made to induct a perception of 
similarity or low similarity between the boys and the 
model. The diver was introduced and he immediately 
began his talk without any introductory remarks con- 
cerning common or different upbringing, play activity, 
or personality. 

After he had completed the part of his talk concerned 
with his abilities and before he went on to a discussion 
of the preferences he had developed as a diver, he paused 
to obtain information that was to measure the degree to 
which the subjects projected their own preferences into 
the model. The “personality test” was returned to each 
boy along with another unused copy of the question- 
naire. The boys were asked to guess what choices the 
diver would make if he were taking the test. The diver 
then continued with the final portion of his speech. 

Model’s positive and negative attributes. At the outset 
of his lecture the diver produced a large bar-graph chart 
with the name of the “Maritime Institute” at the top 
and labeled “ability levels for divers of 1st, 2nd, and 
3rd class.” The speaker explained that divers fall into 
three classes and that their classification is determined 
by their ability in various skills. A first-class diver must 
be excellent in these activities, while a third-class diver 
need be only poor in them. The graph consisted of three 
large bars of grossly different heights that made the 
wide disparity between the ability of a top-level and 
low-level diver clearly visible. Eight skills were listed 
on the graph: dark vision, calmness, manual dexterity, 
plan of search, swimming, endurance of loss of air 
supply, endurance of small closed spaces, and distance 
judgment. Of these, it may be noted, five are the same 
as the skills on which the boys had previously rated 
themselves. T 

In the conditions in which the diver was to be a 
positive model he indicated that he was a member of the 
first class, In those cases where he was to be a negative 
model he stated that the third-class bar summarized his 
ability level. The body of his talk was a discussion of 
each of these skills. He pointed out why each of them 
was important in deep-sea diving, what the nature of an 
excellent and poor performance was, and by example 
and direct self-evaluative statements how well he was 
able to perform or not perform each of them. As a posi- 
tive model he was consistently excellent in each skill and 
as a negative model he was consistently inept. 


Statement of Personal Preferences 


In order to test the degree to which neutral attributes 
of the model were introjected (for Hypothesis 1), the 
subjects had to be made aware of attributes of the diver 
that had no clear value for them. This was done by 
having the diver conclude his talk with a brief discussion 
about the unusual matters that different divers prefer: 
some divers prefer to descend from a ship, others from a 
dock; some prefer to dive in the morning, others in the 
afternoon; some would rather work in the Pacific 
Ocean, others in the Atlantic; and some would rather 
dive on a sandy floor, while others like a stony bottom. 
The diver stated his preference on each of these matters 
and admitted that he could not explain his preferences, 
that every diver simply has them regardless of his 
ability. 
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Post-experimental Measures 


At the conclusion of the talk an experimenter who 
served as chairman told the boys that the Department 
of Maritime Engineering and Salvage was interested 
in how persons of their age would respond to questions 
similar to those asked of trainees in diving. He then 
gave them a questionnaire concerning three general 
areas: they estimated the degree to which they were 
similar to the model at present and the degree they 
would be like him in the future, they chose from among 
specified alternatives the working conditions they would 
prefer if they were divers, they rated their own pro- 
ficiency in particular skills (the same ones they had 
tated several weeks earlier) as compared to other boys 
of their own age. Ratings were also made regarding the 
ability level of the experimental group. The boys were 
told that the diver would not see their responses. 

The six experimental conditions were created, two 
groups in each, during a single school day, excluding 
communication between the groups during the period of 
data collection. All boys were assembled several days 
later and the experiment was explained and discussed. 


RESULTS AND DISCUSSION 
Check on Induction of Similarity 


Rehearsals of the experiment showed that 
the degree of similarity the boys perceived 
between themselves and the model could not 
be measured meaningfully immediately after 
the inductions were made (prior to the main 
part of the diver’s speech), since the boys 
simply reported back what they had been 
told. The alternative was to determine the 
degree to which subjects rated themselves as 
similar to the model after the diver’s talk had 
been finished. By this time, however, they 
had also heard him describe his positive or 
negative qualities and his preferences, al- 
though these descriptions did not further em- 
phasize high or low similarity. Thus, despite 
the events beyond the first inductions of simi- 
larity, subjects in the High Sim condition 
were expected to rate themselves as “like the 
diver” (now and in the future) more than those 
in Low Sim. Table 2 shows that subjects rated 
themselves as being more similar to the diver 
in the High Sim condition than in the Low 
Sim condition, whether the model was positive 
or negative in quality. 


Introjection of Neutral DSAs 


The first hypothesis is that persons who 
have a high number of FSAs with a model 
should introject more neutral DSAs than 
those who have a small number of FSAs with 


TABLE 2 


Mean® Decree SUBJECTS RATE THEMSELVES AS 
SIMILAR TO MODEL IN Hicn Sm anp Low 
Sm CONDITIONS 


(Ns in parentheses) 


Attributes of Model 


Condition of Similarity Both Types 
Positive | Negative Kaen 
Combined 
High Similarity 3.38 3.37 3.38 
ae (21) | d9) | 0) 
Low Similarity 2.03 1.50 1.76 
(19) (20) (39) 
Significance of differ- |p < .01|p < .01) p < .001 
ence between rows by 
one-tailed ¢ test 


® Means are based on a combination of two ratings of similarity 
to the model, one concerning “‘now” and the other concerning the 
future, 


TABLE 3 


MEAN NUMBER OF MopEL’s PREFERENCES CHOSEN BY 
Susyects In Hicu Sm AnD Low Smt CONDITIONS 


(Ns in parentheses) 


Attributes of Model 
Condition of Similarity Both 
Positive | Negative je 
Combined 
High Similarity 4.95 4.89 4,93 
eee (21) | (19) (40) 
Low Similarity 4.11 4.15 4.13 
(19) (20) (39) 
Significance of difference | p < .01| p < .02| p < .01 
between rows by one- 
tailed ¢ test 


him. That is, persons in the High Sim condi- 
tion should accept more of the diver’s stated 
preferences for diving conditions as their own 
than do those in the Low Sim condition. Ta- 
ble 3 shows that the hypothesis was confirmed. 

The second hypothesis was based on the 
assumption that persons want to think well of 
themselves. The prediction was made that 
subjects would foster the development of 
FSAs with a positive model where none or 
few were already present, and would hinder 
or weaken their development with a negative 
model in the same circumstances. In order to 
test this prediction, the experimental treat- 
ment was created in which there was no in- 
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duction of high or low similarity (No Ind). 
Recall that the model within this treatment 
was negative for half of the subjects and posi- 
tive for the other half. In this treatment, the 
preferences with respect to diving conditions 
are considered as potential FSAs. In the No 
Ind condition nothing was said concerning 
similarity of attributes between the subjects 
and the model, and subjects were free to treat 
the statements made by the diver concerning 
his preferences as attributes of their own so 
that similarity could be established. 

The prediction then becomes: subjects with 
a positive model should develop many prefer- 
ences similar to those of the diver while those 
with a negative model should develop few 
similar preferences. In addition, the subjects 
made choices on the abstract sketches and 
pairs of colors. They were asked to guess, in 
this treatment, what the diver’s preferences 
might be. It then also follows from the second 
hypothesis that subjects with a positive model 
assign to him more of their own preferences 
than those with a negative model. 

The data confirm both predictions. When 
the model was positive, the subjects accepted 
an average of 4.61 of his preferences as their 
own choices; when he was negative they 
accepted an average of 3.61 (p < .02 by # 
test). In assigning to the model their own pref- 
erences on the materials of the “personality 
_ test,” 5.44 were granted to the positive model 

and 4.10 to the negative one (p < .02 by ¢ 
test). 

We conclude that where subjects have es- 
tablished few FSAs with a model and there- 
fore are free to perceive or not to perceive 
FSAs as they wish, the motivation to think 
well of themselves determines the extent to 
which they develop FSAs. They establish 
more FSAs with models who appear to promise 
the likelihood of providing positive rather 
than negative attributes. 


Self-Evaluation 


The third hypothesis proposed that the 
positive or negative quality a person assigns 
to derived similar attributes in his concept of 
himself is a function of the positive or nega- 
tive quality of the model. That is, subjects 
within the High Sim condition should raise 
their evaluations of personal skills (from the 
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TABLE 4 


MEAN SELF-EVALUATIONS OF ABILITIES BEFORE AND 
AFTER PERCEPTION OF MODEL IN HicH SM AND 
Low Sm CONDITIONS 


(Ws in parentheses) 


Mean Self-Evaluation 
Condition | Attributes i Sieti 
Paie of Before | After | Mean and 
Similarity | Model® | Percep-| Percep-| Direction Change 
tion of | tion of of Oo 
Model | Model* | Change* | qaiteg t 
Test) 
High Simi-| Positive | 3.71 | 3.90 -19 (a) ns 
larity (21) 
Negative | 3.77 | 3.49 | —.28 (b) 205 
(19) 
Low Simi- | Positive | 3.46 | 3.44 | —.02 (c) ns 
larity (19) 
Negative | 4.02 | 3.41 | —.61 (d) „01 
(20) 


A There are no significant differences between means within 
these columns. 

b The following cells in this column were compared by a one- 
tailed f test: a vs. b, p < .02; d vs. c, $ < .02. 


level stated 2 weeks earlier) where the diver 
has positive qualities, and lower them where 
the diver has negative qualities. 

The data for the High Sim condition are in 
the top half of Table 4, and give some support 
to the hypothesis. Where the model was posi- 
tive subjects increased their self-evaluations; 
where the model was negative, subjects low- 
ered them. The difference between the changes 
in these two conditions is significant at the 
.02 level, and the change in evaluation in the 
negative condition is significant at the .05 
level. 

A corollary to the hypothesis would assert 
that the increase or decrease in the subjects’ 
ratings of their personal abilities, depending 
upon the attributes of the diver, should be 
greater in High Sim than in Low Sim. A com- 
parison of the results in the top half of Table 4 
with those in the bottom half of that table 
provides a test for this hypothesis. The boys 
who had a positive model raised their evalua- 
tions of themselves to a greater degree in the 
High Sim treatment than in the Low Sim 
(p < .02), as predicted. Subjects in both 
High Sim and Low Sim who had a negative 
model lowered their evaluations of themselves 
significantly, but the drop was not signifi- 
cantly greater in the High Sim treatment than 
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in the Low Sim. This result is not in accord 
with the prediction. Furthermore, the de- 
crease in self-evaluation, after hearing a nega- 
tive model in the Low Sim treatment, was 
significantly great in itself, when it was ex- 
pected to be small. 

Why did the subjects in the Low Sim nega- 
tive condition lower their ratings so substan- 
tially? One explanation is found in the fact 
that those in the Low Sim negative condition 
had unusually high self-ratings at the time of 
the pretest, and had farther to drop than 
those in other conditions when they rated 
these same abilities during the experimental 
session. Their mean amount of downward 
shift could, thus, be considerably larger by 
statistical artifact. 

Subjects were therefore divided into three 
groups within each of the four experimental 
conditions according to their evaluations of 
themselves on the pretest (high, middle, and 
low). The mean amount of change in these 
groups for each condition may be seen in 
Table 5 which shows, first, that the subjects 
who rated themselves low or medium on the 
pretests did not manifest any significant 
changes in self-evaluation nor significant differ- 
ences among changes. On the other hand, 
among subjects who rated themselves high on 
the pretest, there were differences in the 
amount of change. In the Low Sim conditions, 
there was significantly (p < .05) more drop of 
evaluation when the model was negative than 
when he was positive. In the High Sim condi- 
tion, the difference in changes in self-evalua- 
tion under a positive and a negative model is 
of the same great order of magnitude as in 
the Low Sim condition, although not signifi- 
cant. In fact, when the High and Low Sim 
conditions were combined, the difference be- 
tween the positive and negative model condi- 
tions is significant at the .001 level. 

These significant differences indicate that 
in the negative model conditions, subjects who 
were high in pretest self-evaluation dropped 
markedly during the experimental session, 
either as a result of normal statistical regres- 
sion or of the introjection of the negative 
traits of the model. Since the subjects who 
perceived a positive model appeared to have 
had a tendency to introject his attributes 
(showing little if any regression from their 


TABLE 5 


MEAN CHANGES IN SELF-EVALUATIONS OF ABILITIES 
WITH DIFFERENT LEVELS OF SELF-EVALUATION ON 
PRETEST, IN EACH EXPERIMENTAL CONDITION 


(Ns in parentheses) 


Level of Self-Evaluation on Pretest 


Condition P 
of Simi- Attributes of 


larity lodel High Medium Low 
High Positive —.10 .03 .39 
(6) (4) (11) 
Negative —.62 05 —.31 
(6) (7) (6) 
Low Positive .018 -10 —.16 
(7) (6) (5) 
Negative —.809 —12 —.21 


(9) (6) (5) 
® Cells significantly different by a ¢ test. 


original higher self-evaluation) those perceiv- 
ing a negative model may also have intro- 
jected the model’s attributes. On the basis of 
the present data, it is not possible to choose 
between regression and introjection as ex- 
planations, 

If the interpretation in terms of introjection 
is accepted, why did the subjects with a posi- 
tive model in the Low Sim condition introject 
the positive traits of the model, when the 
theory postulates that such introjection can 
occur only on the basis of high similarity in 
attributes (FSAs)? A reasonable explanation 
is that the attempt to create low similarity for 
this group was not effective; they saw them- 
selves as similar to the model, despite instruc- 
tions which emphasized the lack of similarity. 
In fact, they rated themselves as significantly 
more similar to the model than did subjects 
who had a negative model, as appears in the 
bottom half of Table 6. Furthermore, 5 of 
these 7 subjects were above the median of the 
20 subjects in the Low Sim positive in the de- 
gree to which they rated themselves as similar 
to the model. Finally, all 7 were above the 
median in the number of the model’s prefer- 
ences they accepted as their own. 

Thus, these subjects may not have regressed 
to the mean because they established enough 
similarity with the positive model, and were 
stimulated by him to think at least well 
enough of themselves to overcome a tendency 
toward statistical regression in their self- 
ratings. Because they had rated themselves 
very high on the pretest, there was further 
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not much room for them to make an upward 
shift on the rating scale when evaluating them- 
selves after perceiving themselves as similar 
to the positive model. The fact that they main- 
tained their originally high self-ratings, while 
other groups with a negative model and ini- 
tially high self-evaluations regressed, is con- 
sistent with Hypothesis 3, since their introjec- 
tion of positive attributes was enough to hold 
their ground against tendencies to regress in 
self-ratings. 

Why did the persons who had high evalua- 
tions of themselves on the pretest make them- 
selves similar to a positive model in the Low 
Sim condition despite elaborate instructions 
concerning their lack of similarity? Let us 
assume that the high evaluations they origi- 
nally ascribed to themselves are an indication 
of their high self-esteem. Prior research has 
shown that persons with high self-esteem have 
a strong tendency to protect their high opinion 
of themselves against any influences that 
might lower it, and that they have the confi- 
dence in themselves to do so (Stotland, Thor- 
ley, Thomas, Cohen, & Zander, 1957). We 
assume that this is also true here. Where a 
positive model indicated that he had little 
similarity with the subjects, only those with 
high self-evaluations on the pretest had the 
confidence to contradict this information from 
him and to see themselves as similar to him. 
This contradiction was provoked, we believe, 
by the efforts of the subjects with high self- 
esteem to maintain it through perceiving them- 
selves as similar to the positive model. Sub- 
jects who were medium or low in self-esteem, 
on the other hand, did not appear to be able 
to contradict the experimental instructions. 


CONCLUSIONS 


The results of this study provide additional 
support for the basic hypothesis of the theory 
of identification used here and already sup- 
ported by the results of the study by Stotland, 
Zander, and Natsoulas (1961): that the pres- 
ence of FSAs between a person and a model 
leads the person to introject into his self- 
concept some of the other characteristics of 
the model (DSAs), when these additional char- 
acteristics are neutral with regard to his self- 
evaluation. The results also suggest that, when 
the attributes of a model are desirable or un- 


TABLE 6 
MEANS OF SUBJECTS’ SELF-RATINGS OF SIMILARITY TO 
MODEL FOR SUBJECTS OF DIFFERENT LEVELS OF 
SELF-EVALUATION ON THE PRETEST, IN EACH 
EXPERIMENTAL CONDITION 
(Ns in parentheses) 


Level of Self-Evaluation on Pretest 


Conaition Attributes of 

of 5 

Sala High | Medium | Low 
High | Positive 3.67 3.40 3.20 


(6) (5) (10) 
Negative | 3.50 | 2.00 3.33 
(6) (7) (6) 


Low | Positive | 3.07 1.50 1.77 
(6) (b)* | (6) ()* 
1.33 1.77 

©) (d| (6) (5) 


^ Cells significantly different by two-tailed ¢ test; a vs. b (p < 
.01); a vs. c (p <.01); a vs. d ($ < .01). 


Negative 


desirable, the person’s level of self-esteem in- 
fluences his tendency to introject these at- 
tributes. Persons with low self-esteem seem 
not to react to any marked degree, by not 
introjecting any attribute. Persons with high 
self-esteem, on the other hand, appear to be 
able to take advantage of available possibili- 
ties of establishing FSAs with models with 
desirable attributes, even when the establish- 
ment of such FSAs contradicts the implica- 
tions of the information given by the experi- 
menter. 

The results also show that, when there is no 
obstacle such as contrary information from 
the experimenter, subjects establish FSAs with 
models who have desirable attributes more 
than with those who have undesirable attri- 
butes. 


SUMMARY 


An adult model was presented to groups of 
grade school children as being either highly 
similar or of little similarity to the children in 
background and other attributes. To half of 
the groups he described himself as being high 
in abilities necessary for deep-sea diving. To 
the other half he described himself as being 
deficient in these abilities. He also described 
a number of his preferences relevant to deep- 
sea diving. The subjects evaluated themselves 
on these same abilities both before and after 
the experiment, as well as indicating their 
own preferences relevant to deep-sea diving. 


264 E. Burnstztn, E. STOTLAND, AND A. ZANDER 


It was found that the subjects who were 
told that the diver was highly similar to them 
in background accepted his preferences rele- 
vant to deep-sea diving, more so than subjects 
who were told that he was not similar to them. 
It was also found that subjects who appeared 
to be high in self-esteem made themselves 
more similar in preferences to a positive model 
when he had told them that he was not similar 
to them in background attributes. Apparently 
as a result, these subjects were able to main- 
tain their evaluation of their own abilities 
which they had made in a pre-experimental 
test. Subjects with high pretest self-evalua- 
tions, when presented with a negative model, 
regressed toward the group’s mean self-evalua- 
tion from their high pre-experimental evalua- 
tion. 


In another condition of the study, no at- 
tempt was made to induce a perception of 
high or low similarity of the subjects to the 
model. The subjects accepted as their own the 
preferences of the able model more than the 
preferences of the less able one, and projected 
more of their own preferences onto the model 
when the latter was able than when he was 
less able. 
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IDENTIFICATION AND CHANGES IN PREJUDICE 
AND IN AUTHORITARIANISM! 


EZRA STOTLAND 
University of Washington 


REVIOUS studies have shown that it is 

possible to reduce a person’s degree of 

prejudice by communicating to him— 
in part via case histories—about some of the 
emotional, irrational bases of prejudice (Katz, 
Sarnoff, & McClintock, 1956; Stotland, Katz, 
& Patchen, 1959). Since the acquisition of in- 
sight involves a change in the person’s con- 
ception of himself, any process that leads to 
a change in self-conception might be em- 
ployed to give insight into the emotional 
bases of prejudice and, thus, reduce it. One 
process of changing self-conception is identifi- 
cation with another person or model. The re- 
sults obtained in these studies may, thus, have 
followed from the fact that the person in the 
case history served as a model with whom the 
subjects identified. The study reported here 
attempts to establish that such identification 
can facilitate the acquisition of insight and 
the consequent reduction in prejudice. 

The proposition that identification leads to 
changes in self-conception finds support in 
the work of Stotland, Zander, and Natsoulas 
(1961). When a person has an attribute in his 
concept of himself that is similar to one in 
his concept of a model, these authors term 
this attribute a first similar attribute (FSA). 
It is assumed that persons tend to conceive of 
themselves as possessing the same (or a simi- 
lar) attributes as those that they perceive to 
be associated with the FSA in their concept of 
the model except when veridical self-percep- 
tion would be contradicted. Such induced at- 
tributes are termed derived similar attributes 
(DSAs). The type of identification that leads 


1 This research report draws its data from the pro- 
gram of research on attitude change under the direction 
of Daniel Katz. This research was supported in part by 
the United States Air Force under Contract No. AF 41 
(657)-82, monitored by Crew Research Laboratory, Air 
Force Personnel and Training Research Center (deac- 
tivated in March 1958). 

2 The authors wish to express their indebtedness to 
Daniel Katz for his help, but the responsibility for the 
analysis and the interpretations reported in this paper 
are theirs. We are also indebted to John J. Jochem for 
his help in the administration and analysis of the data. 


AND MARTIN PATCHEN? 


University of Michigan 


to the acquisition of these traits is termed in- 
trojection. The strength of a person’s tendency 
to introject is assumed to be partly a function 
of the number of FSAs in his concepts of him- 
self and of the model. 

This process may be one of those by which 
a person accepts insights about his ego de- 
fenses and about how they determine his 
prejudicial attitudes. Suppose that a person 
finds that he has certain common qualities 
(FSAs) with another person. Suppose also 
that the person conceives that this model is 
prejudiced against certain minority groups 
because of the functioning of such ego de- 
fenses as projection, compensation, displace- 
ment of hostility, etc. He would be expected 
then to develop in his self-concept an aware- 
ness that he himself also has the attributes of 
engaging in these ego defenses. That is, he 
gains insight. 

Self-insight about the defensive component 
of prejudice, once acquired, may be expected 
to lead one to reduce his defensiveness and 
his prejudice, on the assumption that people 
in our culture are trained to think of irration- 
ality and devices like scapegoating as “bad.” 
Furthermore, the materials that our subjects 
read concerning the relationship between 
emotions and attitudes clearly implied that 
defensive reactions and hostility to other 
groups do not reflect a person’s best qualities. 

Thus, the first hypothesis is: If a person 
conceives of another person as being preju- 
diced toward minority groups for ego defen- 
sive reasons, and if he also conceives of him- 
self as having several objective characteristics 
(FSAs) that are similar to those of his con- 
cept of the other person (the model), he should 
tend to reduce his prejudice toward minority 
groups more than if he conceives of himself as 
having fewer objective characteristics similar 
to those of the other person. 

The general hypothesis was not, however, 
expected to hold equally for all persons. In- 
dividuals who do not actually engage in the 
same defenses as the model would be expected 
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to reject the potential DSA of using these de- 
fenses, as inconsistent with their own actual 
behavior. In the light of the work of Adorno, 
Frenkel-Brunswik, Levinson, and Sanford 
(1950) persons low in authoritarianism should 
react in this fashion. 

Since the DSA of engaging in defensive 
behavior would apply to the high authori- 
tarians, such persons might be expected to 
conceive of themselves as also having this 
attribute. However, the additional factor of 
strong motivation to accept or reject the DSA 
must also be considered. The very strength 
of high authoritarians’ defenses bespeaks a 
very great need for protection against anxiety. 
They would probably tend, then, to react 
defensively to reject any information that 
might lead to an insight. Both those who are 
very high or very low in authoritarianism 
would therefore be expected to resist the con- 
ception of themselves as behaving defensively. 
However, persons who are moderate in au- 
thoritarianism should be less subject to the 
forces on persons at either extreme. 

Accordingly, the second hypothesis is that 
when persons are given information about a 
model whose prejudice is based on ego de- 
fensive reactions, those who are moderate in 
authoritarianism should tend more than those 
low or high in authoritarianism to reduce 
their prejudices toward minority groups. 

As a (not necessarily conscious) cognitive 
process, the process of gaining insight into 
defense mechanisms and their relationship to 
prejudice resembles problem solving, since 
both entail a reorganization or a change of 
meaning of given attributes of entities. In 
problem solving, a typical finding is that an 
extended period of time, called incubation, 
must elapse between the presentation of the 
problem and the attainment of the solution 
(Woodworth & Schlosberg, 1954, pp. 838- 
891). During incubation, the person is not 
usually conscious of any cognitive work he is 
doing on the problem. If the process of gain- 
ing insight into oneself and problem solving 
are in fact similar, a period of time should be 
required before the significance of the insights 
is fully brought to bear. Thus, the effects of 
reading the case history should become evi- 
dent only some time after the case history has 
been read. 


Thus, the third hypothesis predicts a 
“sleeper effect”; i.e., that the losses in preju- 
dice hypothesized in the first two hypotheses 
should not occur until after the Passage of a 
considerable period of time. 


MetHop 
Subjects and Setting 


The subjects for this study were 85 female college 
students at a midwestern university. The experiment 
was performed as part of regular class work in three 
meetings of undergraduate courses in introductory 
psychology and introductory social science. The experi- 
mental sessions were administered by the staff of this 
study rather than by the classroom instructors. 


Procedure 


The over-all design of the experiments was as fol- 
lows. First session: Measure of personality and attitudes 
toward minority groups obtained from all subjects. 
Second session: Three or 4 weeks later, an attempt was 
made to give the subjects insight into the emotional 
basis of their attitudes. This was done under two condi- 
tions: Where the person in a case history illustrating 
the relation of emotions and attitudes is someone very 
much like the subject in objective characteristics (FSAs 
with model very high), and where the person in the case 
history is not very similar to the subject in objective 
characteristics (FSAs with model very low). Third 
session: Immediately following the experimental induc- 
tion and again 4 weeks later, attitudes toward minority 
groups were remeasured. In addition, the personality 
measures were repeated at the third session. No formal 
control group was included in the design since previous 
experiments in this program (Katz, Sarnoff, & McClin- 
tock, 1956) have demonstrated that attitudes can be 
changed through the self-insight method. The present 
study is concerned mainly with the problem of why 
such changes take place. These three sessions will now 
be described in more detail, 

First session: Each subject was given a booklet that 
contained scales of personality and of stereotypes about 
Negroes. The Negro stereotype measure was the 21- 
item Likert scale used by Katz et al. (1956). Both 
favorable and unfavorable items were included. A 
person’s score consists of the sum of the ratings of agree- 
ment with the favorable items, Thus, the lower the 
score, the less prejudice. The personality measures used 
were the 30-item F Scale used by Adorno et al. (1950); 
and a Rationality scale (to be described), which was 
intended to measure the tendency to view oneself and 
the world in a rational and objective way. In the first 
session, all subjects received identical question booklets 
and were given all the time they needed. They were 
informed that their answers would be completely anon- 
ymous. 


* The authors are indebted to the following instruc- 
tors for their generous cooperation in this study: 
Thomas Cohen, Robert Daugherty, Leonard Moss, 
Robert Fulton, Evangeline Norton, and Robert Rhodes. 
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Second session: The second session took place ap- 
proximately 4 weeks after the first. At this session, each 
subject received one of two different types of booklets, 
realizing the two experimental conditions. The booklets 
were passed out in the classrooms from a pile in which 
both types were alternated. Thus, both experimental 
conditions were administered simultaneously in the 
same classroom. 

General interpretive section. The two types of booklets 
had identical first sections. In all cases, subjects read 
first a five-page, double-spaced selection entitled, 
“Emotions and Attitudes.” This section described in 
simple, layman’s terms the ways in which emotional 
difficulties may lead to antiminority attitudes through 
such mechanisms as scapegoating, projection, and com- 
pensation. This selection was the same as the first part 
of the “interpretation” materials used by Katz et al. 
(1956). 

Case history. The general statement about how atti- 
tudes are related to emotions was followed by a “case 
history” of a girl named “Carol.” Subjects in the high 
similarity condition (High Sim) read about a person 
similar to themselves in objective characteristics, while 
subjects in the low similarity condition (Low Sim) read 
about a dissimilar person. Since we wish to see the 
effects on attitude change of different degrees of iden- 
tification, separate from the effects of other variables 
like mobility aspiration, the two fictional “Carols” were 
described in terms which made them about equal in 
social status. 

Background data on all subjects were obtained at the 
first administration so that facts about “Carol” could 
be tailored to be either like or unlike the subjects. 
Since subjects in some classes had slightly different 
backgrounds from those in other classes, it was neces- 
sary to have different descriptions of a similar Carol 
and different descriptions of a dissimilar Carol to fit 
the modal background of persons in each class. 

Similarity condition. In the similarity condition 
(Sim), Carol was described as an American girl attend- 
ing the University of Toledo. Her major subject in 
college, her vocational goal, her part-time job, her 
father’s occupation, her recreations (e.g., going to the 
movies), etc. were all described in each class as closely 
similar to those of the subjects in that class. Subjects 
in each class fortunately tended to be homogenous on 
these characteristics. 

Low Similarity condition. In the other condition, 
“Carol” was described as a student at “Canterbury 
University, located in the agricultural area of New 
Zealand.” Her father’s occupation, her part-time job, 
her major subject, her vocational goal, and her recrea- 
tions (e.g., watching cricket matches) were all made 
alien to the average student at this university. 

Following the description of “Carol” the rest of the 
case history was almost identical for both experimental 
conditions, For both “American Carol” and “New 
Zealand Carol,” the relation of personal frustrations to 
antiminority attitudes was traced. Examples given of 
the prejudice of “American Carol” were those against 
Negroes, Mexicans, and Turks, while “New Zealand 
Carol” was described as disliking Negroes, Indonesians, 
and Turks. 


Negro Attitude scale. Next all subjects filled out a 
scale measuring negative stereotypes of Negroes and 
feelings about behavior toward Negroes. The Stereo- 
type scale consisted of 17 items matched to items of the 
Stereotype scale used in the first session, and was found 
to correlate .84 with the original scale when tested on 
another sample of college students. The purpose of 
changing the scale was to meet the problem of monot- 
ony in answering the same scale at all three sessions 
and the danger that subjects might develop stable 
responses to the items after two administrations. In 
addition to the matched items, 4 of the original 21 
items were repeated to provide a constant attitude 
measure. 

Third session: In the third session, approximately 4 
weeks after the second, the same personality and atti- 
tude scales that had been administered in the first ses- 
sion were repeated, 


Analysis of F Scale 


In the analysis of the data, only a subscale of the 
F Scale, termed the Defensiveness subscale (McClin- 
tock, 1958), was used, since it purported to measure 
the tendency to employ the defenses with which this 
study is concerned. McClintock presents evidence sug- 
gesting that the personality syndrome of ego defensive- 
ness operates in a different fashion in attitude change 
from the syndrome defined by the rest of the items in 
the F Scale. 


Rationality Scale 


The Rationality scale administered during the first 
session consisted of the following items answered by the 
Likert method of indicating degrees of agreement or 
disagreement. 

1. What happens to us seems to be determined a 
lot by a fate over which we have little control. 

2. There have been many times when I’ve felt 
jealous of individuals who have personal qualities 
which are better than my own, 

3. If a person is in a position to help me in some 
way, I often act more friendly toward him than I 
really feel. 

4, When a new idea seems to be logical but most 
people feel that it somehow doesn’t make sense, you 
can be pretty sure there’s some hidden flaw in it. 

5. If I have a heated argument with someone, 
once it’s over I forget all about it and don’t hold any 
grudge against the other person. 

6. When a custom isn’t as efficient as possible, it 
should be changed—even though it is traditional and 
people are used to doing things that way, 

7. I usually dislike someone if he disputes my 
opinions. 

8. It is sometimes possible for a person to know 
what some close relation is thinking or that some- 
thing awful is happening to him, even though he is 
far away. 

This scale was designed to measure a person’s 
tendency to judge the world and himself in a hard- 
headed, factual manner. Four of the items (1, 4, 6, and 
8) concern the extent to which he judges objective 
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phenomena in a logical, nonemotional way. The other 
four items (2, 3, 5, and 7) concern the extent to which 
he views himself in an objective rational way. These 
latter items are statements true of almost all people; 
disagreement with 2, 3, and 7 and agreement with 5 
entail the denial of fact and a tendency toward self- 
deception. 


RESULTS 
Differences between Conditions 


The first hypothesis asserts that among 
persons who perceive another individual 
(“model”) to be prejudiced toward minority 
groups for ego defensive reasons, those who 
have several objective characteristics that 
are similar to the model’s should become less 
prejudiced toward minority groups. Before 
this hypothesis can be tested, High and Low 
Sim subjects must be shown not to differ 
significantly in their mean levels of prejudice 
at the first, pre-experimental administration. 
This requirement was met, the mean Negro 
stereotype score being 7.42 for High Sim and 
8.41 for Low Sim. 

The hypothesis leads to the prediction of 
greater reduction in prejudice in the High 
Sim condition than in the Low Sim condition. 
The prediction is not supported, since the 
mean changes of stereotype from the first to 
the third sessions for the Sim and Low Sim 
conditions, —1.70 and —2.30, respectively, 
do not differ significantly. (Negative numbers 
mean a reduction in prejudice.) In addition 
neither of the groups showed a significant 
mean change in prejudice from the first to 
the third sessions. 

A more interesting picture emerges, how- 
ever, when the mean changes in negative 
stereotype are considered in relation to the 
original degree of prejudice that the subjects 
manifested at the first session. Table 1 shows 
for each condition the mean changes in total 
Stereotype scores separately for subjects high 
and low in prejudice at the first administra- 
tion. Subjects low in original degree of preju- 
dice show some tendency to become more 
prejudiced in the High Sim condition. These 
subjects show a mean change in a direction 
opposite to that for subjects in the Low Sim 
condition who started at the same level of 
prejudice. (This difference is significant at the 
-10 level.) Moreover, subjects in the High Sim 
condition are the only group who have a 
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TABLE 1 


MEAN CHANGE® IN TOTAL NEGRO STEREOTYPE SCORE 
FROM First TO Turrp SESSION IN RELATION TO 
ORIGINAL DEGREE OF PREJUDICE 


(Number of Cases in Parentheses) 


Experimental Condition 


Original Decree of 
Prejudice 
Low Sim High Sim 
Low Prejudice —1.55 3.65** 
(21) (30) 
High Prejudice —3.70 —1.25 
(14) (20) 


Note.—Significance of differences between cell means by two- 
tailed ¢ tests: A vs. B, $ < .10. 
* Positive score means change to more negative stereotype. 
** Indicates change from position at first session significant at 
.07 level, two-tailed £ test, 


mean change in stereotype from their initial 
Position that approaches significance (p< 
07). In short, the subjects who were low in 
prejudice show a tendency to become more 
prejudiced when there is similarity between 
themselves and the prejudiced person in the 
case history. In contrast, no differences be- 
tween conditions or significant changes in 
either condition were found among subjects 
who were high in prejudice at the first session. 
The results of Table 1 are contradictory to 
Hypothesis 1. This shift among the relatively 
unprejudiced subjects cannot be explained 
solely as a regression effect since it occurred 
only in the High Sim condition.4 


Authoritarianism and Change in Prejudice 


Confirming the original studies of authori- 
tarianism by Adorno et al. (1950) for the F 
Scale although only the Defensiveness sub- 
scale of the F Scale was employed, the correla- 
tion with anti-Negro prejudice is positive 
(r = .27, p < 02). 

We expected that persons who were medium 
in defensiveness would change further away 


*Tt is also possible to interpret these differences as 
showing that the low prejudiced in the Low Sim con- 
dition were somehow prevented from showing high 
prejudice scores as a result of normal regression. The 
failure of this group to become more prejudiced may 
have resulted from pressure from the experimenters and 
the regular course work toward less prejudice, On the 
other hand, the Sim condition appears to have contrib- 
uted to these pressures, 
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from prejudice than would those who are 
neither high nor low in defensiveness. An 
analysis of the relation between level of au- 
thoritarianism and change in Negro stereo- 
type, separately for those who started with 
little prejudice and for those who started with 
strong prejudice, show that this hypothesis 
is not supported. None of the changes was 
significant for any of the three groups on the 
defensiveness dimension, regardless of their 
original degree of prejudice. Nor was there 
any significant difference among the groups 
in the amount of change. 

The obvious next question is whether the 
subjects’ position on the Defensiveness scale 
had any influence on their reactions to the 
differences in the experimental conditions. 
Table 2 shows that people who are low in de- 
fensiveness and who experience the High Sim 
condition become significantly more preju- 
diced by the third session than they were at 
the start of the study. This group changes 
significantly more than those from the Low 
Sim condition who are also low in defensive- 
ness, and more than subjects with medium or 
low defensiveness in the Sim condition. It is 
those who score low in defensiveness and who 
are in the Sim condition who become more 
prejudiced. 

Since the low-defensive persons become 
more prejudiced in the High Sim condition, 
it might be expected that they would also 
acquire other authoritarian characteristics 
possessed by the model. This expectation is 
confirmed by the data of Table 3, which show 
that low-defensives in the High Sim condition 
scored significantly more defensive at the 
third session than they had at the first ses- 
sion, This change toward greater defensive- 
ness differed significantly from changes among 
those who were also low defensives but were 
in the Low Sim condition. 


Rationality Scale and Attitude Changes 


Since the Rationality scale was used in the 
study primarily for exploratory purposes, the 
first step is to discover the relationship be- 
tween original degree of prejudice and degree 
of rationality. The mean score on the Negro 
Stereotype scale at the first administration 
was 4.28 for those high in rationality and 4.51 
for those low, when subjects are divided at 


TABLE 2 


MEAN CHANGE* IN TOTAL STEREOTYPE SCORE FROM 
First To THD SESSIONS IN RELATION TO 
DEFENSIVENESS SCORE 


(Number of Cases in Parentheses) 


Position on Defensive- Dia ae 
ness Subscale 
Low Sim High Sim 
Low —1,35 7.70*** 
(18) (14) 
A B 
Medium —4.45 —.80 
(7) (25) 
D E 
High = 2.50 —.25 
(10) (11) 
H 


Note.—Significance of differences between cell means by a two- 
tailed ¢ test: A vs. B, p < .05; B vs. E, p < .05; B vs. H, p < .01. 
2 More positive score means change to more negative stereo- 
type. 
*** Indicates change significant at .05 level in two-tailed test. 


TABLE 3 


MEAN CHANGE" IN DEFENSIVENESS SUBSCALE OF F 
Scare* FROM First TO THIRD IN RELATION TO 
INITIAL DEFENSIVENESS SCORE 


(Number of Cases in Parentheses) 


Experimental Condition 
Original Position Defen- 
siveness Subscale 
Low Sim High Sim 
Low 1.95 8.90"*** 
(18) (14) 
B 
Medium 7.15 —1.80 
(7) (25) 
D 
High —3.50 —4,55 
(10) (11) 


Note.—Significance of differences between cell means by a 
two-tailed # test: A vs. B, p < .10; C vs. D, p < .10. 
® Positive score means change to greater defensiveness, 
**** Indicates change significant at .001 level in two-tailed test. 


approximately the median (difference not 
significant). Level of rationality can there- 
fore be regarded as independent of the initial 
degree of prejudice. 

In Table 4 are presented the mean changes 
in Negro stereotype scores from the first to 
the third conditions for subjects high and 
low on the Rationality scale. Subjects who are 
low in rationality tended to shift (p < .10) 
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TABLE 4 


MEAN CHANGE® IN TOTAL STEREOTYPE SCORE FROM 
First TO THIRD SESSIONS IN RELATION 
TO RATIONALITY SCORE 


(Number of Cases in Parentheses) 


Experimental Condition 


Position en Rationality 


cale 
Low Sim High Sim 
Low Rationality —3.50* 2.95 
(20) (31) 
High Rationality —0.65 —0.35 
<i (15) (19) 


Note.—Significance of difference between cell means by two 
tailed ¢ test: A vs. B, p < .05. 
* More positive score means change to more negative stereo- 


* Indicates change significant at .10 level in two-tailed test, 


TABLE 5 


MEAN CHANGE IN TOTAL STEREOTYPE ScoRE" FROM 
First TO THIRD SESSIONS IN RELATION TO 
RATIONALITY AND PREJUDICE 


(Number of Cases in Parentheses) 


Initial Low Prejudice |Initial High Prejudice 


ELE SEAT Sera 
me Dissimilar- | Similari issimi- | Similari 
ity Condi | Gonaition | lanty | Simileity 
Low Ration- |—5.10***| 5,70*** | —1,10 | —1.35 
ality (12) (19) (8) (12) 
A B C D 
High Ra- 3.65 0.20 —7.15 | —1.10 
tionality (9) (11) (6) (8) 
F G H pi 


Note.—Significance of difference between cell means by two- 
tailed ¢ test: A vs. B, p < .05; F vs. H, » < .10; B vs. D, $ < .10; 
Avs. F, p < .05. 

* Positive score means more negative steotype. 
*** Indicates change in cell significant at .05 level, Two-tailed 
test used where change is positive, 


toward less prejudice in the Low Sim condi- 
tion, while those in the High Sim condition 
showed a slight tendency to shift in the other 
direction, although neither shift is statistically 
significant. The difference between these 
groups is significant at the .05 level. On the 
other hand, subjects who are high in rational- 
ity do not appear to have been affected by the 
differences between the High Sim and Low 
Sim conditions. These results might suggest 
that it was the Low Sim condition that led 
to the greatest change in Negro stereotypes 
and not the High Sim condition, as in the 
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previously reported results. However, the 
data in Table 5 help to clarify this issue. 

Table 5 shows the relationship between 
attitude change and rationality separately 
for subjects high and low in original degree of 
prejudice. We see that, for subjects who are 
high in initial prejudice, neither their degree 
of rationality nor the experimental condition 
they experienced makes a significant differ- 
ence in how much they changed in Negro 
stereotype. On the other hand, the subjects 
who are low in original degree of prejudice 
and who are also irrational are the ones most 
affected by the difference between the condi- 
tions. In the High Sim condition, they change 
significantly toward more prejudice, while in 
the Low Sim condition they show a significant 
change toward less prejudice. Thus, for those 
low in rationality and low in original prejudice, 
both conditions were leading to significant 
changes—in opposite directions. The differ- 
ence between the conditions in the mean de- 
gree of change is significant at the .05 level. 

An additional finding shown by Table 5 is 
that in the Low Sim condition, among sub- 
jects low in original prejudice, irrational per- 
sons changed significantly more in the direc- 
tion of lower prejudice than did rational ones. 
Although this finding is not inconsistent with 
the previous finding, it is difficult to under- 
stand why these irrational subjects should 
tend to be influenced toward less prejudice 
in the Low Sim condition. 

The correlation between the Rationality 
scale and the Defensiveness subscale of the F 


was found to be —.26 ($ < .02), showing ¥ 


some tendency for the low rationals to be high 
in defensiveness. Since it was found that low 
defensives and low rationals were more sus- 
ceptible to identification, these dimensions of 
personality appear to have different signifi- 
cance, although they are related. 


Sleeper Effect 


In order to test the hypothesis predicting a 
sleeper effect four items concerning Negro 
stereotypes were repeated at all three sessions. 
A sleeper effect would be indicated by the ab- 
sence at the second session of changes that 
appeared at the third. All of the significant 
changes appearing at the third session were 
related to initial position on prejudice or on a 
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personality dimension. Thus, the sleeper effect 
can be checked by examining changes from 
the first to the second sessions for persons at 
different positions on these variables. 

We have seen that in the High Sim condi- 
tion, subjects who were low in initial prejudice, 
low in defensiveness, or low in rationality all 
changed significantly (toward greater preju- 
dice) from the first to the third sessions. None 
of these groups shows any significant change 
from the first to the second sessions, in keep- 
ing with a sleeper effect; nor did the changes 
they showed from the first to second sessions 
differ from those of any other group. These 
findings hold for both the total stereotype 
scales and the items duplicated at all three 
administrations. 

However, subjects high in defensiveness in 
the Low Sim condition showed a significant 
drop in prejudice at the end of the second 
session when prejudice was measured by 
either the total scale or the duplicated items. 
This drop was evidently dissipated by the 
third session, since none of these subjects 
changed significantly from the first to third 
sessions. Although this finding is contrary to 
the sleeper effect hypothesis, it is not incon- 
sistent with the changes that were effected by 
the experimental manipulations over the en- 
tire time period of the study—i.e., a reduction 
or no change of prejudice in the Low Sim con- 
dition as compared to an increase of prejudice 
in the High Sim condition. 


DISCUSSION 


The first hypothesis of the study was that 
similarity between a subject and a prejudiced 
model, coupled with an explanation of the 
defensive basis of the models prejudice, 
would lead the subject to gain insights into 
the defensive basis of her own prejudice, and, 
thus, to reduction of prejudice. The results, 
however, did not support this hypothesis. 
They indicated instead that persons of little 
prejudice tend to become more prejudiced 
when they find they have objective charac- 
teristics in common with a bigoted model. On 
the other hand, subjects who were already 
prejudiced did not change. While these results 
are unexpected, they can still be described in 
terms of our theory of identification. 

We assumed that when a person conceives 
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of himself and a model as having objectively 
similar characteristics, the person should tend 
to conceive that he is like the model in other 
respects called derived similar attributes 
(DSAs). In the present experiment, we at- 
tempted to induce the subject to see herself 
as similar to the model in the DSA of preju- 
dice resulting from the use of defense mecha- 
nisms. Having perceived herself as like the 
model in this “undesirable” DSA, the subject 
was expected to want to change her own be- 
havior to eliminate the undesirable DSA. The 
unexpected finding that certain subjects be- 
came more like the model in prejudice and in 
other related characteristics indicates that 
these subjects perceived themselves as like 
the model in the DSA of prejudice but did 
not take the next step of trying to change 
this trait. The question arises as to why these 
subjects did not try to eliminate the attributes 
that were clearly implied by the entire experi- 
mental situation to be “bad.” 

There are at least two ways of explaining 
this finding, one more directly motivational 
in orientation, the other more cognitive. From 
the standpoint of social supports for prejudice, 
in the first place, the High Sim condition may 
have inadvertently acted to make prejudiced 
attitudes legitimate. At least some of those 
subjects who initially scored low on prejudice 
may have had a good deal of latent anti-Negro 
feeling—held in check, perhaps, partially by 
the notion that other college level people in 
an intellectual climate of “tolerance” did not 
share such prejudiced views. The presentation 
of a vivid story about a person like themselves 
who had such anti-Negro prejudices may have 
served to give these subjects reassurance that 
they were not, after all, alone in these “wrong” 
feelings. Labeling these feelings as unrealistic 
may have been less important than the idea 
that they were shared by persons like them- 
selves. The experimenters’ fairly clear, though 
implied, opposition to prejudice may have 
been rendered relatively ineffective by the 
facts that they were persons of a very different 
status and strangers (from another school) to 
boot. 

In this light, it becomes easier to under- 
stand the rather surprising finding that the 
subjects low in defensiveness were most sus- 
ceptible to becoming more prejudiced, since 
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Berkowitz and Lundy (1957) have shown that 
persons low on the F Scale tend to be suscepti- 
ble to conformity pressures from peer groups, 
i.e., groups with characteristics similar to the 
subject’s. Furthermore, as indicated in the 
original report by Adorno et al. (1950, p. 
771ff.), some persons are low on the F Scale 
only for reasons of conformity to certain liberal 
group norms and not for personality reasons. 
Thus, although some low defensives may not 
be disposed to prejudice by personality, their 
susceptibility to peer group influence makes 
them more susceptible to accepting the preju- 
diced norms of some peer groups. The par- 
ticular susceptibility of the irrationals is 
consistent with this general explanation. To 
accept the prejudiced idea of peers even though 
their unrealistic basis is clearly explained is, 
after all, quite irrational behavior. 

The cognitive explanation of the findings 
retains some of the original thinking about 
identification. Since subjects were made to 
perceive first similar attributes with the model, 
they have strong tendencies to perceive them- 
selves in a manner consistent with the FSAs. 
In the present case, consistency is achievable 
mainly by perceiving themselves to possess 
the derived similar attributes (DSAs) that 
were associated with the FSAs. The subjects 
probably could not easily employ any other 
way of achieving consistency within the con- 
text of the present study. Achieving consist- 
ency by adopting greater prejudice was un- 
opposed in the present study by any strongly 
aroused motivation to behave rationally and 
reasonably. In a previous study (Stotland, 
Katz, & Patchen, 1959), the experimental 
arousal of such motivation to be rational and 
reasonable was found to be essential to the 
lowering of defensively based prejudice by 
means of communicating information that 
could provide the basis for insight into the 
defenses. Since no similar effort to arouse the 
subject’s motivation to be rational was em- 
ployed in the present study, the subjects 
could become like the model even in ways that 
were negatively evaluated by the experi- 
menters as irrational and unreasonable. The 
fact that some of these changes were observa- 
ble only after a period of time argues for this 
cognitive explanation, since cognitive changes 
probably take time to occur. 
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This cognitive explanation is also consistent 
with the finding that the subjects low in 
rationality were most prone to identify. Since 
such subjects were assumed to tend not to 
perceive themselves objectively, they should 
therefore be more likely than others to shift 
their self-perceptions to be like those of a 
model. To accept the prejudice of the model, 
they had to disregard some of their own previ- 
ous feelings and to magnify the prejudiced 
feelings shared with the model. It may be 
noted that the original, unsupported predic- 
tion of a shift toward less prejudice in the 
Sim condition was based on the assumption 
that the identifiers would have some tendency 
to reject irrational forms of behavior. The 
hypothesis may have failed to be confirmed 
in part because it was the unrealistic, irra- 
tional persons who tended to change their 
self-perceptions readily to be more similar to 
a model. Therefore, the identifiers were per- 
sons who would be expected to place minimal 
pressures on themselves to become more rea- 
sonable. 

The cognitive explanation can also be recon- 
ciled with the finding that the low defensives 
became more defensive and more prejudiced 
in the High Sim condition. Adorno et al. 
(1950) suggest that persons low in F relate 
easily, warmly, and tolerantly with others. 
The low defenders may accordingly have a 
strong tendency to respond to, and identify 
with, other persons and to be more susceptible 
to influence by them. Such a tendency toward 
identification may extend even to those models 
who have harsh, aggressive attitudes—espe- 
cially if the low defenders’ tolerance is super- 
ficial. Highly defensive persons, on the other 
hand, may tend to project their faults onto 
others and then reject them. It is as if the 
high defenders relate to others in terms of 
their own needs, and, thus, are less likely to 
change through identification with others. 

Regardless of which explanation, cognitive 
or motivational, should ultimately prove more 
valid, a question remains as to the exact 
nature of the changes on the part of subjects 
who identified with the model. In view of the 
relatively superficial experimental inductions, 
the shift in scores on the F Scale or the preju- 
dice scale cannot reasonably be thought to 
represent a profound change in personality 
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functioning. On the other hand, the fact that 
changes in F Scale scores occurred when only 
antiminority attitudes were the direct target 
of change indicates that some generalized 
effect took place. Most likely, a perception of 
oneself as being tough or severe in outlook, 
like the model, was acquired. Such a charac- 
teristic could be reflected by the subjects not 
only in their expressed degree of prejudice, 
but also in their acceptance of the F Scale 
items, a number of which involve harsh tough- 
ness in evaluation of people. Thus, the ac- 
quired characteristic can be reasonably under- 
stood as a change in self-perception, not 
necessarily a change in overt behavior. If it was 
toughness that was accepted, the attempt to 
develop genuine insight among the subjects 
into the irrational basis of prejudice clearly 
failed. Since an entirely different process oc- 
curred the study does not provide any data 
on the question of whether insight actually 
does lead to a reduction of prejudice. Perhaps 
if the subjects had been induced to identify 
with the model at the same time that their 
motivation to be rational had been aroused, 
the study would have had different results. 

The second hypothesis of the study was that 
persons medium in authoritarianism would be 
most susceptible to being changed by experi- 
mental inductions. This hypothesis was not 
supported but the conditions necessary for 
its confirmation were not achieved, since the 
experimental inductions did not result in the 
acquisition of insight. The findings concerning 
defensiveness were that, in the High Sim con- 
dition, the low defensives changed their ex- 
pressions of general attitudes in the way that 
would make them more like the model, i.e., 
more authoritarian and more prejudiced. 

The third hypothesis of a sleeper effect was 
in general confirmed. None of the long-term 
changes observed were manifest at the end 
of the second session. Since cognitive reorgani- 
zation presumably takes time, these results 
are in keeping with the interpretation that 
the acquisition of more prejudice and defen- 
siveness was due to a change in the self-con- 
cept. 

The results of this study have some im- 
portant social implications. Many communica- 
tions—e.g., in movies and television—vividly 
portray the activities and sentiments of im- 
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moral, antisocial persons, such as gangsters 
and juvenile delinquents. Defenders of such 
communications point out that the behavior 
of such persons is labeled as bad. The present 
results indicate, however, that people may be 
influenced to become more like fictional per- 
sons who have undesirable characteristics, 
even when the irrationality of such qualities 
is clearly explained. 

The present results also bespeak the power 
of all types of communication to effect at least 
superficial changes in the personalities of the 
recipients. The relative ease with which the 
reading of a single case history affected re- 
sponses to F Scale items several weeks later 
suggests that stronger and more repeated 
communications can have more basic effects 
than the relatively superficial ones of the pres- 
ent study. In the present context, this result 
is disconcerting, since the characteristics the 
subjects acquired are thought by most social 
scientists to be undesirable. Of course, if the 
models presented have desirable qualities, 
e.g., are less authoritarian, then the potency 
of these types of communication may be 
socially valuable. 


SUMMARY 


Eighty-five subjects were first given the 
F Scale and a measure of anti-Negro prejudice 
in regular college classes. Four weeks later 
they read a case history of a person who was 
prejudiced and authoritarian. Half the sub- 
jects were told that the person in the case 
history was similar to themselves in back- 
ground and objective personal characteristics. 
The other half were told that the person was 
dissimilar to them, Three weeks later the F 
Scale and the anti-Negro prejudice scale were 
readministered. It was found that those low 
in prejudice at the first administration be- 
came more prejudiced but only if they were 
told the case history was about someone like 
themselves. Persons low in the Defensiveness 
subscale of the F Scale also became more preju- 
diced by the third administration, if the 
person in the case history was similar to them 
in background. The latter group also became 
significantly more authoritarian, as measured 
by the Defensiveness subscale at the third 
administration. These results were interpreted 
in terms of the social support for suppressed 
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“wrong” feelings and in terms of the pressures 
to achieve cognitive consistency in one’s self- 
concept. 
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ON GROUP COMPOSITION 


WILLIAM C. SCHUTZ 
University of California, Berkeley 


HE problem of composing groups to at- 
tain certain group outcomes has been of 
interest to social scientists especially 
in the last few years. Interest arises in part 
because prediction of group outcomes from a 
knowledge of the characteristics of the in- 
dividuals and of the laws of interaction con- 
stitute an empirical test of the ancient prob- 
lem of “emergents’—whether or not a whole 
can be predicted from a knowledge of its parts. 
Also, group composition presents an ultimate 
challenge to any theory of human interaction. 
If the theory is valid, it must predict certain 
outcomes when particular people interact. 
Finally, the problem of group composition has 
important practical implications for compos- 
ing groups such as therapy groups, work 
teams, committees, research teams, hospital 
wards, army squads, and any other situation 
in which particular qualities are desired in 
particular groups. 
After an extensive review of the literature 
on group composition, Gross (1959) concludes: 


In this investigator’s opinion, there have been three 
major systematic attacks upon the group composition 
question: (a) Powdermaker and Frank’s study on 
Veteran’s Administration patients undergoing group 
therapy (1953); (b) the National Training Laboratory 
in Group Development studies on small group function- 
ing (Stock and Thelen, 1958); and (c) Schutz’s work on 
his FIRO: A Three-Dimensional Theory of Interpersonal 
Behavior (1958) (p. 7). 


Powdermaker and Frank evolved three 
major group-relevant composition areas: the 
extent to which the patient is openly aggres- 
sive or competitive, the ability of the patient 
to tolerate aggression toward himself, and the 
patient’s ability to expose his own weakness. 
The studies by Stock and Thelen (1958) con- 
clude that important group composition varia- 
bles are Bion’s three modalities, pairing, de- 
pendence, and fight-flight. 

The approach taken by the present in- 
vestigator in past studies is based on the con- 
cept of compatibility. This approach assumes 
that wholes are predictable from a knowledge 
of the parts and the laws governing the inter- 


action of the parts. In terms of three basic in- 
terpersonal needs—inclusion, control, and 
affection—various indices of compatibility are 
computed for pairs of people, based on the 
characteristic ways in which they express be- 
havior in these areas and how much of each 
kind they want from others. From these con- 
siderations two kinds of compatibility are 
computed, one based on a notion of need com- 
plementarity called reciprocal compatibility, 
and one based on a notion of need similarity 
called interchange compatibility. 

These two types of compatibility can be 
computed for each of the three need areas. 
Several experiments (Schutz, 1958) have been 
performed that show the technique to have a 
certain degree of validity for composing pro- 
ductive, cohesive groups desirous of further 
mutual interaction (e.g., roommates, traveling 
companions). One study used a derived tech- 
nique to compose therapy goups (Gross, 
1959). 

The present study continues this interest 
in group composition, but differs in that (a) it 
deals primarily with variations of interchange 
compatibility, (b) it uses a different criterion 
from productivity, namely, awareness by the 
group members of the interactional characteris- 
tics of the group. 

If groups are composed to be homogeneous 
with respect to the group members’ expressed 
behavior and/or the behavior they want from 
others in the interpersonal areas of inclusion, 
control, and affection (as measured by 
FIRO-B), it is then hypothesized that after 
enough time has elapsed for the group mem- 
bers to become familiar with the group (six 
meetings in this case), the members should be 
able to identify correctly the interpersonal 
characteristics of their own group. 


METHOD 


Subjects and Setting 


This study was performed on 70 subjects divided 
into five groups of 14 each, at the Western Training 
Laboratory in Human Relations in July 1959, The 
groups were heterogeneous with respect to sex, age, 
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education, occupation, and income. The laboratory is 
designed for: “...persons involved in educational, 
consultant, leadership or administrative capacities. 
School administrators, business executives, training 
directors, religious education workers, teachers, com- 
munity organizations and civic leaders—in fact all 
those with major responsibilities for working with or 
through other people can profit from this laboratory.” 
This statement (WTL brochure, 1959) describes well 
the actual distribution of participants. 

The groups met once a day for 10 meetings of 1 hour 
and 45 minutes each, and the members had close con- 
tact with each other throughout the rest of the day, 
since the laboratory was located in an isolated residen- 
tial setting. The meetings were run as unstructured 
groups with no agenda and with the aim of studying 
group process and interpersonal behavior through ob- 
servation of the interaction among the group members. 
This type of group has been given various names: T 
group, training group, sensitivity training group, de- 
velopment group, basic group. 


Procedure 


Before any subjects met in groups, they were ad- 
ministered the FIRO-B (Fundamental Interpersonal 
Relations Orientation-Behavior) questionnaire. This 
instrument is composed of six Guttman scales of 10 
items each, measuring an individual’s expressed and 
wanted behavior in the interpersonal areas of inclusion, 
control, and affection (Schutz, 1958). On the basis of 
individual scores, the 70 participants were divided 
among five groups, each of which had members who 
were similar on certain of these characteristics. Follow- 
ing are the descriptions of the five groups based on 
measures derived from the FIRO-B. (For purposes of 
exposition the groups are designated by the letter of 
their FIRO-B descriptions, In the actual situation, of 
course, symbols unrelated to the descriptions were used 
to label the groups.) 

Group A is characterized by the fact that the 
majority of group members prefer to initiate behavior 
of any kind more than they like to receive it. This 
includes initiating interaction rather than waiting to 
be invited, giving orders and directions rather than 
receiving them, and to a lesser extent giving affection 
rather than receiving it, In general the desire for 
interaction with people is telatively low in this group, 
we the concern with authority and contro] is very 

ite 

Group B is characterized primarily by a concern. 
for the area of control and authority. Like Group A, 
the members prefer to exert control rather than sub- 
mit to it. They differ from A in that they prefer an 
average amount of interaction and have a consider- 
ably higher preference for receiving affection than 
initiating it. 

Group C is characterized mainly by its strong 
preference for receiving behavior of all types rather 
than initiating it. This includes taking orders and 
directions rather than receiving them, receiving af- 
fection rather than giving it, and to a somewhat 
lesser extent being invited to interact rather than 
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originating contact. The group members very much 
prefer to be interacting with people than not. Their 
desire to control other people is very low. 

Group D is strongly and primarily concerned with 
affection, both giving and receiving, although their 
preference is to receive. In the area of control they 
prefer to receive direction rather than to give it. 
They are quite high in interacting and in their desire 
for interaction. 

Group E is marked by a very low desire for close 
personal relations. The members also exhibit a 
strong preference for initiating interaction and con- 
tact with people rather than having others do it. 
They have a preference for receiving direction rather 
than giving it, though not as markedly as in Groups 
C and D. They are also moderately low on general 
interaction both initiating and wanting. 

Prior to examining the data the experimenter told 
the subjects that Groups A and B were similar in 
terms of FIRO-B group description, and also that 
Group C was similar to Group D. Group E was not 
especially similar to any others, This classification 
was done to allow for later analysis of the groups in 
terms of three types (AB, CD, E) as well as five. 

After six meetings all members of each group were 
given a dittoed sheet on which the group descriptions 
were written as above. Instructions were: 

Below are descriptions of the five afternoon groups 
as predicted from FIRO-B scores, In order to test 
the validity of these composition patterns, we would 
like you to evaluate your own group. Read over all 
five descriptions. Then select the one that seems best 
to describe your afternoon group. Place a 1 next to 
that description. Then select the one next most like 
your group and place a 2 next to that one, and. so 
forth, finally placing a 5 next to the description that 
sounds least like your group. 


REsULTS 


The original raw data! consist of rankings, 
from one to five, of the five groups in terms of 
respondents’ perception of similarity of each 
description to his own group. These rankings 
were analyzed in two ways: by weighting all 
ranks and summing them, and by first choice 
rank only. The weighted ranking data will be 
considered first. 


Percentage of Weighted Ranks 


The ranks were weighted in inverse order; 
i.e., a ranking of 1 was given a weighting of 5, 
rank 2 a weight of 4, etc. For a given group, 
the sum of weighted ranks given to each of the 
five group descriptions by all 14 members of 
that group was computed. By calculating the 
percentage of the total score given to each of 


* The author is indebted to Vernon Allen for data 
analysis and computation. 
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the five descriptions, the “percentage of 
weighted ranks” was obtained for each group. 
The hypothesis predicts that scores for Group 
A, for example, will reveal the highest per- 
centage of weighted ranks for the Group A 
description, and so on for the other groups. 

In order that the data be presented more 
meaningfully, these scores were transformed 
to take account of the limited range imposed 
by the ranking procedure. For example, when 
five categories are used, a description that 
happened to receive first place votes from all 
group members would have a weighted rank 
percentage of 33, while if the description re- 
ceived all last place votes it would have a 
weighted rank percentage of 7. The trans- 
formation was accomplished by setting 7% 
equal to 0 and 33% equal to 1. Then each 
obtained score is expressed as a proportion of 
the possible range of weighted rank percent- 
ages, a number between 0 and 1, with chance 
equal to .50. 

In order to determine the error in prediction 
resulting from the high similarity of group 
descriptions among the possible choices, the 
proportions were computed for two situations: 
when the five descriptions were considered 
separately; and when the five descriptions 
were reduced to three by combining A with 
B, and C and D, as announced prior to the 
data analysis. The results of the transformed 
percentage of weighted ranks for both the five 
and three description situations are given in 
Table 1. 

Using the ¢ test, three groups, A, B, and D, 
were predicted significantly greater than 
chance, while the other two groups fell at 
about the chance level. Overall for the 70 in- 


_ dividual predictions from all five groups the 


results were significant at the .01 level. Such 
an overall rate of predictive success (.70 to 
.75 correct when chance is .50) is typical of 
several studies using the FIRO composition 
technique (Schutz, 1958; Schutz & Gross, 


1961). 


First Choices 


The second method of handling the results 
was to treat only the first choice given by 
the group members. The number of first 


choices given by all group members of a par- 
ticular group was converted to a proportion 


TABLE 1 


For WEIGHTED RANKS, PROPORTION OF PERFECT 
Prepiction (1.00) ACHIEVED BY EACH GROUP 


(chance prediction .50) 


Using Using 


1 three 
R rodons ti Cry ieam (uA 
A 88 <.01 97 <.05 
B 86 05 88 05 
C 54 — 44 — 
D 17 .05 .91 05 
E 54 — -50 — 
All groups com- 72 .01 74 01 
bined 
TABLE 2 


PROPORTION OF GROUP MEMBERS CHOOSING THE 
CORRECT DESCRIPTION OF His GROUP AS 


First CHOICE 
Using Using 
Group five de- p three de- ’ 
scriptions scriptions 
A 57 <.05 1.00 <.01 
B 66 -05 .86 +05 
c -33 = 53 — 
D .50 = +92 05 
E 06 = 12 - 
Chance +20 33 
All groups 42 01 69 001 
caine 


of the total choices (14). Proportion of first 
choices as predicted is shown in Table 2, again 
with both five and three descriptive categories 
used for analysis. 

For five descriptive categories, Groups A 
and B are again well predicted and Group D 
just misses significance (using the ¢ test). When 
similar descriptions are combined a sharp im- 
provement occurs, making D once again sig- 
nificant. Although the .05 level of significance 
is not attained, even Group C is predicted 
about 1.5 times chance for first choices. Every 
member of Group A predicted either Group 
description A or its similar description, B. 
As might be expected, the analysis by weighted 
ranks did not differ significantly when de- 
scriptive categories were reduced from five to 
three, but the analysis of first choices was 
significantly affected. This finding shows that 
there was a close association between the 
choice of adjacent, similar descriptions. 
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Summary of Results 


Groups A, B, and D were predicted signifi- 
cantly better than chance by almost any 
measure, Group C showed a tendency to be 
well predicted as far as first choices were con- 
cerned, and Group E was poorly predicted by 
any measure. With all groups combined, the 
predictions were significantly higher than 
chance and lend considerable weight to the 
hypothesis and added confidence in the tech- 
nique of group composition, 


Discussion 


Why do Groups A, B, and D demonstrate 
such a high degree of predictive accuracy com- 
pared to Groups C and E? An answer seems 
to lie in the nature of the common charac- 
teristic by which each group was originally 
composed. Members of Group E had in com- 
mon a desire not to do something, namely, 
not to have close personal relations. There 
was a great variety of preference for the be- 
haviors they characteristically do express. 
Hence the behavior directly observable to 
group members was of a highly variable 
nature, a point reflected in their even distribu- 
tion of choices on all five group descriptions. 
Within each of the well predicted groups, A, 
B, and D, there was a commonality of ex- 
pressed behavior, either inclusion, control, or 
affection. The members of Group C were 
similar in that they all wanted someone else 
to initiate behavior. The commonality lay in 
the realm of want rather than of expression, 
Group C differs from E in that the members of 
E had a common wish to avoid something. 
The intermediate position of C with respect 
to predictability—not so good as A, B, and 
D, but not quite so bad as E—may be under- 
standable in these terms. 

The results suggest that groups composed 
around a common interpersonal characteristic 
are more identifiable by the group members 
after several meetings to the degree that the 
common trait is overtly expressed. Tf it is only 
a common desire rather than an overt expres- 
sion on the part of the members, the common 
group characteristic is harder to detect, and 
if this characteristic is a behavior or feeling 
that everyone wishes to avoid, it is hardest of 
all to identify. 
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The relation between this finding and the 
Postulate of Group Development (Schutz, 
1958) requires exploration. In part, the postu- 
late states that every group goes through the 
same developmental stages in the same order: 
namely, concern first with problems of inclu- 
sion (in or out), then with problems in the 
control area (top or bottom), finally reaching 
the area of affection (near or far). In the pres- 
ent study there was no direct test of the postu- 
late; it was simply noted that one area was 
stressed in each group. The other two areas 
may have been dealt with briefly before each 
group focused on its salient area, but there are 
no relevant data on this point. 


Behavioral Observations 


Although these groups were composed on 
the basis of homogeneity along the inclusion, 
control, and affection scales of FIRO-B, no 
specific predictions were made about the be- 
havior of the groups. This was because the 
interchange compatibility (Schutz, 1958) that 
was built into every group did not lead to any 
clear behavioral prediction about the content 
of the topics and the central concerns of an 
unstructured group. However, informal im- 
pressions of group behavior revealed striking 
differences among the groups, differences that 
seemed generally consistent with the compo- 
sition patterns. 

The group trainers? were asked, about 4 
months after the workshop, to describe the 
behavior of their groups. Since they were at 
that time familiar with the experiment they 
cannot be considered independent observers. 
However, they attempted to be as accurate 
and as objective as possible, and their reports 
are helpful for illuminating the behavioral 
phenomena corresponding to the objective 
FIRO-B composition patterns and are sugges- 
tive for understanding, future hypotheses, and 
applications. It is in this sense that they are 
presented, rather than as experimental evi- 
dence. 

The general characteristic of the best pre- 
dicted four of the five composed groups was 


? The trainers for Groups A, B, C, D, and E were, 

respectively, Schutz, John Weir, Robert Tannenbaum, 

ington Ingham, and Joseph Luft. The valuable 
cooperation of the last four is greatly appreciated. 
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>Æ a single focus of discussion, especially at the 


beginning meetings. In heterogeneous groups 
a wide variety of topics is typically discussed 
throughout the life of the group. In contrast, 
in each of these four composed groups there 
was a narrowing of discussion topics to the 
point where there was predominantly one area 
at issue. 

In the two initiating, controlling groups 
(A and B) the discussions centered around 
authority, power, and competition: 


In Group A the atmosphere for most of the first 
several meetings was one of tense caution and covert 
struggle. Each member seemed afraid to attempt any 
leadership act because of the perceived strength and 
power in many other members, which might be turned 
against the initiator. At one point a group member 
described the atmosphere thus: “I feel like there’s an 
electric charge circling through the group and each 
time it comes to me the hair on the back of my neck 
stands up.” 

There was a great deal of talk about the members’ 
relation to their boss and their relations with the trainer 
of the group. During the hours between meetings, for 
example, various members of Group A challenged the 
trainer to a ping-pong match, “jokingly” punched him 
on the arm, and tried almost to the point of personal 
injury to hit a softball farther than he did. In addition, 
one group meeting was almost disbanded so that the 
group could go out on the grass to see whether a partic- 
ular group member could throw a football farther than 
the trainer. There were many additional manifestations 
of concern with masculinity. The group members soon 
named their group the “Tigers,” later changed to the 
“Gorillas,” In the morning meetings there was a great 
deal of talk about hangovers and the previous night’s 
drinking, and the group members were highly overrep- 
resented each night among the people who, after the 
center had closed, stayed up to continue drinking and 
socializing. 

After several meetings this common group interest 
became clearly established. Attempts to understand 
the dynamics of competition and authority problems 
in group behavior were initiated when one group mem- 
ber said he felt “inadequate” in his job, and the people 
who promoted him did not realize his inadequacy, From 
this point on the group worked through from job in- 
adequacy to social inadequacy to a felt inadequacy with 
women (the group had 12 men and 3 women). Then 
ensued discussion about mothers and fathers who never 
gave reassurance, or permitted achievement and the 
attaining of competence, as possible explanations of 
their inadequacy feelings and hence their attitudes 
toward competition, authority, power, and masculinity. 


Hence in the short space of 10 meetings in 
which usual groups of this kind cover more 
topics to less depth, this group concentrated 
on one problem area—authority and compe- 


tence—and carried it through to a rather re- 
markable depth of insight. 

Group B, made up of people with a strong 
preference for controlling others, according 
to FIRO-B, is described as follows by the 
group trainer: 


The composition of the group was obvious to trainer 
and members almost from the beginning. The initial 
meeting was devoted to a heated denunciation of the 
behavior of a lab delegate who had appointed himself 
leader in a somewhat highhanded manner during a 
preceding general session. The group was very hostile 
toward him, resenting his leadership as presumptuous 
and censuring him as an authoritarian. 

The first four or five group meetings were devoted 
to competitive leadership strivings. The group com- 
pletely rejected any attempt by the trainer either to 
impose or insinuate training procedures or structure. 
There was much hostility directed toward him and at 
one time a majority decision to eject him from the 
group. This leadership attempt met with strong resist- 
ance from a vocal minority, resulting in the decision’s 
being changed to one of letting the trainer sit with the 
group, but ignoring him. 

At the same time the group members were struggling 
to establish warm affectionate relationships with each 
other. This need was finally brought into the open 
during the last five meetings and, when mutually shared 
and experienced, brought a diminution in the com- 
petition for leadership and a marked group integration. 
However, when the lab design later required each group 
to observe and be observed by another group the re- 
bellion toward authority flared up again almost to the 
point of an outright refusal to cooperate with the plan. 
Ultimately the group acquiesced to the observation 
procedure. During the observation of another group, 
the need to control became so intense one member said, 
“Aren’t you (the other group) going to ask us what we 
think of your meeting?” Finally the drive to exercise 
control could no longer be contained. The members 
burst out with observations, evaluations, and reactions 
to what they had observed, alternately praising their 
own group behavior and criticizing the behavior of the 
other group, Analysis and discussion of this even de- 
veloped much insight into the dynamics of the earlier 
group behavior regarding authority and control. 

The final meetings were characterized by the ex- 
change of affection and mutual acceptance. Conflict 
over authority, control, and structure disappeared and 
it became possible to explore the dynamics of authority 
and affection in a manner that was very deep and 
satisfying to the members. Several testified to the 
gaining of important new insights into their relation- 
ship with authority figures. 


Thus in this group, too, the predominant 
theme was control and authority, with affec- 
tion relations beginning to be evidenced 
toward the end. It is interesting to note that 
one member of this group was at variance with 
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the others from the outset. When they were 
so preoccupied with control she wanted to 
talk about expressing feelings and warmth. 
She could not believe’ she was like the other 
members (according to the composition pat- 
tern). Often she wrote of peace and concilia- 
tion on postmeeting reaction sheets distributed 
after each meeting, when virtually all the 
others would comment on strife and conflict. 
It turned out later that she was placed in the 
group only because of the mechanical limita- 
tions to making every group homogeneous. 
Actually, according to her FIRO-B score, she 
was very low on expressing control (2 on a 
scale of 0-9) compared to the others, and con- 
siderably higher on affection. She was quite 
relieved to discover this. 

Group D, composed of people high on the 
affection scales of FIRO-B, was described by 
its trainer in the following way: 


There was exceedlingly little discussion of authority 
or power, but a great deal about expressing and accept- 
ing feelings. In the beginning there was much talk of 
getting along fine in the group because everyone was 
considerate and understanding. Then there was an un- 
comfortable time when hostility was condemned for a 
surprising number of reasons, followed by a painful 
realization that the people could not really communi- 
cate without including the hostile things they felt, 
Even after this conclusion was accepted it remained 
unwelcome. 

Throughout the life of the group people could speak 
of their feelings of inadequacy. It was no problem for 
the group to accept and encourage this. Several mem- 
bers considered increasing prominence as a questionable 
thing. They recognized their desire for it, but also con- 
sidered that it detracts from family life, and may dis- 
tort a person’s values about the work itself, 


Here the emphasis was almost entirely on 
affection and expressions of emotion, with 
almost no attention to the problems that 
preoccupied Groups A and B. It is interesting 
to note that discussion of inadequacy, so diffi- 
cult to face in high-initiating Group A, was 
easily done by Group D. 

Group C, made up of participants with a 
preference for receiving behavior in all areas, 
especially control and affection, rather than 
initiating it, is described this way by its group 
trainer: 

I certainly had no difficulty in getting the group to 
follow most of my suggestions (there was really only 
one man who would resist and fight back). Actually the 
group gave very little overt attention to the control 
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area. The chief characteristic of the group was its very 
heavy occupation during most of its meetings with 
feelings. I believe the group members were aware of 
many more feelings than they were able to express, 
Anxiety or other concerns seemed to be associated with 
the expression of strong positive or negative feelings, 
Most members were unable to use themselves in order 
to facilitate the expression of feelings by others. Many 
had difficulty in expressing great discomfort in facing 
these strong feelings of others. The group had one of its 
most intensive experiences on 2 days in which all mem- 
bers of the group functioned with eyes closed. This 
seemed to make it easier to express, facilitate in others 
the expression of, and to receive the expression of strong 
feelings. Certainly there was great need to receive af- 
fection but a considerable inability of group members 
to express affection. To a lesser extent, this would be 
true with inclusion. No really expressed strong 
control and, as previously stated, the members were 
quite receptive to control, 


Here the chief characteristic of the group 
is its difficulty in expressing feelings. The 
anxiety they feel in initiating behavior and 
their need to 228 affection and inclusion, 
and great willingnéss to accept control, would 
be consistent with the composition pattern, 
Also consistent with the prediction is one con- 
trast between Group B—the high controllers 
—and this group. In Group C the trainer had 
no difficulty in getting the group to follow 
his suggestions, while in Group B the mem- 
bers completely rejected any such attempts 
by the trainer and actually once voted to eject 
him from the group. 

Group E, composed of persons described on 
FIRO-B as low on affection, was the poorly 
predicted group. Perhaps a description of the 
behavior of this group by the trainer will shed 
light on the predictive failure: 


There were two major factions, roughly authori- 
tarian and equalitarian (or high controllers and low 
controllers). Although the low controllers did not 
initiate actions they were highly effective in stopping 
the more agressive controllers. The low control people 
actually turned out to be stronger and eventually set 
the tone and norms of the group. The opposing factions 
locked horns early and for a day and a half the group 
was stalemated. This was symbolized by the copper 
pot which was used to collect anonymous statements 
regarding some significant events as seen by the group 
members. For nearly 2 days the group could not re- 
solve the simple matter of extracting the slips of paper 
from the pot. Every time the controllers initiated ac- 
tion, the others systematically exposed their motives 
and refused to agree to any action which forced even 
one person to do anything against his wishes. They 
finally found a way out of their bind, with the result 
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that everyone had lived through an experience of 
clarifying and resolving strong differences. This experi- 
ence, followed by others like it but of lessened duration, 
was heuristic, to say the least. Even the controllers 
were surprised at the affection they felt and could give 
as the crises were overcome. Having tamed tigers in 
their midst, they felt, when they visited the “Tiger” 
group, that the latter had a long way to go. 


The problem of affection did not arise until 
late in the group in the form of surprise at 
tħ affection they felt. Perhaps the struggles 
of the group at the beginning were actually 
methods for maintaining emotional distance 
among the group members. 

Thus if the FIRO-B composition patterns 
are accepted as accurate, the interesting result 
of the composition patterns was the alteration 
of the topics discussed to bring them into line 
with the “common group problem.” The possi- 
bility thus emerges of composing groups with 
certain characteristics for a given purpose. 
For example, a training group to be intro- 
duced to a variety of group phenomena might 
better be made heterogeneous on the FIRO-B 
scales, while a therapy or other type of group 
in which it is desirable to explore a single area 
in some detail could be composed homogene- 
ously. 


SUMMARY 


As part of the Western Training Laboratory 
in Human Relations, five 14-person groups 
were composed of persons with certain similar 
individual interpersonal needs, as determined 
by scores on the FIRO-B questionnaire. After 
meeting six times the group members were 
given behavioral descriptions of the five groups 
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with instructions to rate the five descriptions 
in terms of similarity to their own group. 

Three of the groups were accurate, identify- 
ing their own group descriptions significantly 
better than chance. One group showed a tend- 
ency to select the “correct” description for 
first choice, and one group was poorly pre- 
dicted. The total prediction was highly signifi- 
cant and lends confidence to the FIRO tech- 
nique for group composition. 

An explanation offered for the results was 
that homogeneous groups are predictable to 
the degree that the common characteristic is 
expressed behavior, rather than a common 
desire or a common wish to avoid a particular 
interpersonal area. 

Behavioral differences in the composed 
groups were noted. Each group seemed to 
settle on a particular topic, and went into this 
topic to a greater depth than is usual in 
groups of this type. Practical implications 
were noted. 
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AN INVESTIGATION OF UNILATERAL CEREBRAL PATHOLOGY 
WITH EQUATED VERBAL AND VISUAL-SPATIAL TASKS? 


ROSEMARY STARK? 
Wayne State University 


HE relationship of unilateral cerebral 
pathology to selective psychological 
deficit has been the subject of increasing 
experimental investigation over the past 
decade. Several studies (Anderson, 1951; 
Heilbrun, 1956; Reitan, 1955) have lent con- 
firmation to the well known clinical observa- 
tion that verbal deficit is more strongly asso- 
ciated with left hemisphere pathology and 
have further suggested that performance 
deficit may be more strongly associated with 
right hemisphere pathology. The verbal per- 
formance dichotomy was based primarily on 
the sums of the respective Wechsler-Bellevue 
Intelligence Scale scores. Examination of the 
consistent findings in these studies suggested 
that the performance deficit in subjects with 
right hemisphere pathology may be more 
specifically described as a visual-spatial deficit. 
Further, it was felt that tasks developed to 
maximize cerebral pathology would serve 
better than tests of general intelligence to 
measure such deficits. The present study was 
designed to investigate these relationships 
with paired-associate learning tasks in the 
verbal- and visual-spatial modalities. 
Paired-associate learning tasks were se- 
lected because they are well suited to maxi- 
mize the tendency toward perseveration, a 
characteristic often observed in the perform- 
ance of brain injured subjects. By persevera- 
tion is generally meant the persistent, non- 
adaptive repetition of an initial response 
through changing stimulus conditions. As 
such, it was observed in brain injured subjects 
by both Benton (1955) and Bender (1938) in 
their respective visual-spatial tests. It may be 
thought of as resembling the tendency to 


1 Condensed from a doctoral dissertation submitted 
to Wayne State University, completed under the direc- 
tion of Bertram D. Cohen with the assistance of 
Michael M. Reece. 

The author wishes to express appreciation to 
John E. Webster, MD and Stephen E. Gurdjian, MD 
for their cooperation in the referral of patients with 
thorough diagnostic evaluations. 

2 Now at the University of Detroit. 
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“repetition” noted by Piotrowski (1937) on 
Rorschach test protocols or as stemming from 
the “stimulus bondage” of brain injured sub- 
jects described so well by Goldstein and 
Scheerer (1941). The latter authors stressed 
particularly the tendency of brain injured 
subjects to fixate one aspect of the stimulus 
situation and their inability to shift to others. 

For the purposes of this study, perseveration 
is defined as the persistent repetition of re- 
sponses of high associative strength where they 
are no longer appropriate. The paired-associate 
tasks were structured so that they would 
first elicit and reinforce a strong association 
to a given stimulus and then require the learn- 
ing of a new, unrelated response to the same 
stimulus. Such a procedure should maximize 
the tendency to perseverate the strong associa- 
tion to the stimulus throughout the learning 
of the new response to the same stimulus. 

It was hypothesized that: 

1. On the verbal task, subjects with left 
hemisphere lesions show poorer verbal scores 
and more perseverative errors than either the 
right hemisphere lesion group or a control 
group with no cerebral pathology. 

2. On the visual-spatial task, subjects with 
right hemisphere lesions show poorer visual- 
spatial scores and more perseverative errors 
than either the left hemisphere group or the 
control group. 


METHOD 


Tasks. The stimuli and responses for the verbal task 
are shown in Table 1. The seven stimulus words were 
chosen at random from Shellenberg’s frequency table 
(Tinker, 1947). Since frequency of response has been 
interpreted as indicating strength of association, those 
associative responses most frequently given were 


- chosen as the strong or “easy” response. Seven other 


words, chosen from the same list because of their ap- 
parent unrelatedness to the stimulus words, served as 
the “difficult” competing response. 

The visual-spatial task was constructed in the 
same way. Figure 1 gives a sample of the items. Seven 
such simple, incomplete stimulus figures were selected 
for their apparent property of eliciting simple “closure” 
responses. 

Both tasks were tried out on a normal population 
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TABLE 1 
STIMULI AND RESPONSES FOR THE VERBAL TASK 
Easy Difficult 
Stimulus response ie response 
DARK light salt 
SLOW fast needle 
CARPET rug whistle 
HIGH low child 
LONG short health 
BED sleep street 
BLOSSOM flower hammer 


with the general aim of selecting responses that would 
give equivalent error scores. It was found that normal 
subjects did respond to most of the verbal stimulus 
words with the selected easy response and to the visual- 
spatial stimulus figures with the simple closure re- 
sponse, Those difficult visual-spatial responses were 
selected which yielded a mean error score equivalent 
to that on the verbal task. 

Subjects. Three groups of 20 subjects? each composed 
the sample in the final study. All were private patients 
of Detroit neurosurgeons, hospitalized for recent onset 
or recurrence of illness as follows: a control group of 
back injury patients (C group) in whom cerebral path- 
ology was ruled out, a right hemisphere lesion group 
(R group), and a left hemisphere lesion group (L 
group). All groups were screened for background fac- 
tors of serious mental or emotional disturbance. All 
had been functioning effectively in their normal occu- 
pations prior to the onset of illness. None had been 
institutionalized other than for brief periods of private 
medical hospitalization. Only right-handed subjects 
were included. Current functioning level was deter- 
mined on the basis of verbal IQ estimated from the 
Vocabulary subtest of the Wechsler Adult Intelligence 
Scale. A minimum verbal IQ of 80 was accepted; all 
but eight of the subjects scored 90 or above. All groups 
were equated for age and vocabulary scaled scores as 
shown in Table 2, and the two experimental groups 
were well matched in educational background. 

The brain injured subjects were diagnosed by a 
neurosurgeon following a complete neurological study 
including a neurological examination, an angiogram, 
an electroencephalogram, and, in some cases, a ven- 
triculogram, pneumoencephalogram, or skull X ray. 
‘These subjects were of two kinds, recent cerebrovascu- 
lar accidents and brain tumors. 

Since these were all private patients, only those 
who consented to take part were tested. Each subject 
was tested individually during his hospitalization by 
the same examiner. 

Procedure. Since the aim of the easy task was to 
reinforce the strong association and promote persevera- 
tion of this response in the learning of the difficult 
list, the easy list was presented to the criterion of two 


3 Complete information relevant to subject popula- 
tion, tasks, and statistical treatment of the data can 
be obtained from University Microfilms, Ann Arbor, 
Michigan, Doctoral Dissertation Series. The disserta- 
tion title is the same as that used for this article. 
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L 


Fic. 1. Sample stimulus figures and responses for 
the visual-spatial task. 


perfect trials or a maximum of three trials, this number 
being found sufficient for perfect learning in the normal 
population. On each trial of the tasks, each stimulus 
was presented in succession with a timed interval for 
the subject to respond before presentation of the cor- 
rect response. Five seconds were allowed for each ver- 
bal item and 10 seconds for each visual-spatial item, 
the longer time for the latter necessitated by the in- 
struction to draw the figure completion. The items were 
randomized for each trial. The first trial of the difficult 
list on each task was made a guessing trial to maxi- 
mize perseveration tendencies. On this trial the sub- 
ject was instructed to give any new response that came 
to mind as long as it was different from that on the 
easy list. The difficult list was then presented to the 
criterion of two perfect trials or a maximum of six 
more trials. Possible fatigue or practice effects were 
controlled by random assignment of all subjects to 
two order conditions. Half the subjects were given the 
verbal task first and the other half the visual-spatial 
task first. 

Scores. Only the responses from the difficult list 
were scored. The error score consisted of all the in- 
correct or missing responses from the last six trials of 
the difficult list. The first trial could not be counted 
in the error score since the subject had no way of an- 
ticipating the correct response. The perseveration 
score included all seven trials of the difficult list and 
consisted of all the responses which were repetitions 
of responses from the easy list. 


RESULTS 


Error scores. An analysis of variance tech- 
nique using a mixed design considering groups, 
tasks, and order on the error scores for the 
three groups, divided according to the order 
conditions, was performed first. Since no sig- 
nificant effect of order was indicated, the order 
conditions were combined. The means and 
standard deviations on the error scores for the 
three groups are seen in Table 3. Inspection 
of these means shows that the tasks were of 
approximately equal difficulty for the C group, 
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TABLE 2 
DESCRIPTION OF SUBJECTS 
Lesion 
Group Mean piesa yom Kind Duration 
cva Neo- 3 3 
plasm months months* 
C 49.7 9.8 
R 51.5 11.8 18 2 11 9 
L 50.6 11.2 17 3 11 9 
TABLE 3 
MEAN Error SCORE 
Groups 
Task c R L 
“M SD M SD M SD 
Verbal 19.80 | 7.14 | 19.05 | 9.06 | 30.95 | 8.07 
Visual-spatial 21.35 | 7.16 | 32.55 | 8.43 | 23.30 | 8,43 


but selectively difficult in the predicted direc- 
tion for the two brain injured groups. While 
the R group scored the same as the C group 
on the verbal task, they scored on the average 
approximately 11 more errors on the visual- 
spatial task. The L group, on the other hand, 
did almost as well as the C group on the 
visual-spatial task but scored on the average 
approximately 11 more errors on the verbal 
task. These mean differences between the 
experimental groups were found to be signifi- 
cant by ¢ test (p < .01), whereas none of the 
mean differences between experimental and 
control groups reached significance. 

These results are illustrated most clearly in 
Figure 2 where the means of the C group on 
the two tasks are used as the base line against 
which to compare the task means of the brain 
injured groups. The hypotheses concerning the 
error scores for all groups were confirmed. 

Perseveration scores. The results for the 
perseveration scores were not as clear-cut as 
for the error scores. Since a preliminary analy- 
sis showed no significant effect of order of 
administration, the means for both order con- 
ditions were combined. These means are shown 
in Table 4. Although an analysis of variance 

yielded a significant F (p < .01), an inspection 
of the means suggested that this was due to 


VERBAL VISUAL-SPATIAL 
TASK TASK am 


Fic. 2. Comparison of mean error score of 
groups, the control group (C group) mean used as the 
base line. 


the extreme perseveration of the R group 
the visual-spatial scores. Here it is seen 
this group scored an average of 14.70 perse 
ative errors in comparison with the C group 


Differences between the means for the thr 
groups on the verbal perseveration scores di | 
not reach significance. It was felt, however, 
that the trend of these means from 1.70 errors 
in the C group to 3.60 errors in the L group 
did indicate the presence of the verbal persev- 
erative tendency in the latter. 3 

Diagnostic value of the tasks. An analysis of 
the distribution of individual scores was made ~ 
to assess the degree to which the learning task 
Scores could identify an individual diagnosti- 
cally. Cutting scores were determined by 
graphing the distributions of the three groups 
separately for each task and selecting the 
highest crossover point which identified the 
brain injured group in question. On the verbal 
task, 85% of the C and the R groups scored 
below the cutting score, while 70% of the L 
group scored above it. On the visual-spatial 
task, 90% of the C group and 85% of the L 
group scored below the cutting score, while 
75% of the R group scored above it. 

The percentages of correct identification for 
each learning task considered separately, how- 
ever, cannot be considered evidence of diagnos- 
tic accuracy. A subject must not only be identi- 
fied as showing evidence of deficit on the 
appropriate task; this deficit must also clearly 
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TABLE 4 
MEAN PERSEVERATION SCORE 


Task c R 3 


Verbal 
Visual-spatial 


exceed the scores on the opposite task. When 
the evidence from both tasks was thus com- 
bined for each group, 75% of the C group and 
80% of the R group were correctly identified. 
The accuracy in the L group, however, dropped 
to 55%. This drop was due to the relative task 
scores of three subjects. Although these sub- 
jects exceeded the cutting score on the verbal 
task, their visual-spatial error scores were also 
so elevated that they would be misidentified 
as right lesion subjects. 

An examination of subjects misidentified in 
the two brain injured groups suggested that 
the factor of age may have a selective effect on 
performance on the two tasks. As a group, the 
mean age of these misidentified subjects was 
57.5 as compared with a mean age of 51.3 for 
the entire brain injured sample. This discrep- 
ancy led to an investigation of the relationship 
between age and learning task scores in all 
groups. Pearson product-moment correlations 
were performed between age and each task 
error score. A correlation of .47 (p < .05) was 
found between visual-spatial error score and 
age; a correlation of .23 (p < .10) was found 
between verbal error score and age. 


DISCUSSION 


At an empirical level, the results of the pres- 
ent study suggest some clear relationships re- 
garding localization of cerebral functioning. 
The differences on the error scores for both 
tasks make it possible to conclude that: 

1. Subjects with left hemisphere pathology, 
although able to learn and retain visual-spatial 
designs almost as well as the controls, are 
significantly inferior to either subjects with 
tight hemisphere pathology or controls in ver- 
bal learning and retention. 

2. Subjects with right hemisphere pathol- 
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ogy, although able to learn and retain verbally 
presented materials as well as controls, are 
significantly inferior to either subjects with left 
hemisphere pathology or controls in visual- 
spatial learning and retention. It should be 
noted that all subjects tested were right- 
handed; hence, the visual-spatial deficit in the 
subjects with right hemisphere pathology does 
not represent a motor deficit. 

Although the results on the perseveration 
scores were not as clear-cut, they were in the 
predicted direction. They clearly demonstrate 
the prevalence of the perseverative tendency 
in both brain injured groups as compared with 
the controls. The fact that this tendency is 
better demonstrated on the visual-spatial task 
may be due to a difference in the degree of 
conscious verbal control exercised in the re- 
spective tasks. The more frequent response of 
the left hemisphere lesion subjects on the 
verbal task was blocking, i.e., no response, 
Subjects frequently commented that the re- 
sponse they “had in mind” was the old one 
from the easy list but they knew it was wrong. 
It may be that verbal responses are more sub- 
ject to verbal control than are visual-spatial 
responses. Hence, the overt perseveration 
would be less likely to be demonstrated on 
verbal tasks than on visual-spatial tasks. 

While the results obtained with these learn- 
ing tasks show clear group differences, much 
individual variability remains. Although the 
cutting scores showed an average of about 
85% accuracy for each task considered sepa- 
rately, this percentage dropped considerably 
when overlap on the task scores was considered. 
It was further evident that this percentage of 
success was due primarily to the right hemi- 
sphere lesion group where 80% of the sub- 
jects were correctly identified. Considering the 
left hemisphere group alone, it is clear that 
55% accuracy is hardly sufficient to place 
much faith in the tasks for individual diagnos- 
tic purposes. 

“However, examination of the distributions 
of the two task scores revealed something that 
may be relevant in other psychological diag- 
nostic tests in the area of cerebral pathology. 
This concerns the overlap of the visual-spatial 
scores. It was noted that the reason for the 
poor diagnostic accuracy in the left hemisphere 
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lesion group was not the failure to demonstrate 
deficit on the verbal task but, rather, the fact 
that older subjects with cerebral lesions tended 
to show elevated visual-spatial scores regard- 
less of area of lesion. The significant positive 
correlation of age with visual-spatial scores in 
all groups further suggested that an increase of 
visual-spatial errors with age may also be ex- 
pected in a normal population. These findings 
suggest that visual-spatial functioning is more 
easily disrupted by both age and cerebral pa- 
thology than verbal functioning. Hence, visual- 
spatial tests of cerebral pathology should be 
carefully equated for age if they are to have 
any validity. 

An examination of individual test scores 
raised another point that may be important 
for both localization studies and the use of tests 
for diagnosis. Much of the variance on the 
respective tasks was found to be due to the 
extreme scores of several highly deviant indi- 
viduals in both brain injured groups, e.g., a 
radiologist, a minister, an engineer, and a psy- 
chiatrist. Such subjects—individuals with good 
educational background and a general intact- 
ness in most areas of intellectual functioning— 
were found to be the best subjects for demon- 
strating clear localization relationships. 

Tn summary, the tasks here developed, while 
showing very clear results on a group basis, at 
the same time show limitations for individual 
diagnostic use. The validity of such tasks could 
be greatly improved with norms for age and 
general level of functioning. 

The implications of the present study for 
localization of cerebral functioning could be 
clarified in two needed follow-up studies. It is 
possible that the present lateral hemispheric 
division of brain injured subjects obscured im- 
portant anterior-posterior relationships. It may 
be that the verbal-symbolic functions are more 
closely related to anterior portions of the brain 
and visual-spatial to posterior, as indicated, for 
example, by Benton (1955, p. 36). A definitive 
study would include at least four groups of ex- 
perimental subjects: right and left hemisphere 
anterior, and right and left hemisphere pos- 
terior. Further, it is not known what possible 
effects cerebral dominance may have on this 
apparent functional asymmetry of the brain. 
The present study should be repeated on a 
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left-handed population to clarify the impor 
tance of dominance in relation to laterality of 
lesion. 


SUMMARY 


This study was designed to investigate the 
relationships between left hemisphere pathol- 
ogy and verbal deficit and right hemisphere 
pathology and visual-spatial deficit. Equated 
paired-associate learning tasks were adminis- 
tered to three groups of 20 subjects each: a 
control group of back injured subjects in whom 
cerebral pathology was ruled out and two brain 
injured groups with demonstrable right and 
left hemisphere lesions. 

The results confirmed the hypotheses that: 
subjects with left hemisphere pathology are 
significantly inferior to both subjects wi 
right hemisphere pathology and controls in 
verbal learning and retention, and subjects 
with right hemisphere pathology are signifi- 
cantly inferior to both subjects with left hemi- 
sphere pathology and controls in visual-spatial 
learning and retention. Both brain injuri 
groups performed as well as the controls o1 
the tasks apparently unrelated to the area of 
their lesions. 

A significant relationship between visual- 
spatial performance and age was demonstrated 
in all groups. The results suggested that visual- 
spatial functioning is more easily disrupted by 
both age and cerebral pathology than verbal 
functioning. f 

These findings suggest an apparent func- 
tional asymmetry of the brain in respect to ver- 
bal and visual-spatial tasks. Two follow-up 
studies are indicated: one in which the groups _ 
would be further subdivided within each hemi- 
sphere as anterior and posterior; the other, a 
repetition of the present study on a left-handed 
population to determine the possible effects of ` 
cerebral dominance on laterality of lesion. 


REFERENCES 


ANDERSON, A. L. The effect of laterality of localiza- 
“tion of brain damage on Wechsler-Bellevue in- 

dices of deterioration. J. clin. Psychol., 1951, T, 
149-153. 

BENDER, L. A visual motor gestalt test and its clinical 
use. Res. Monogr. Amer. Orthopsychiat. ASS- 
1938. No. 3. 

Benton, A. L. The revised visual retention test: Clinical 


UNILATERAL CEREBRAL PATHOLOGY 287 


and experimental applications. Iowa City: Iowa organic disturbance of the central nervous sys- 

State Univer. Press, 1955. tem. J. nerv. ment. Dis., 1937, 86, 525-537. 
GOLDSTEIN, K., & SCHEERER, M. Abstract and con- REITAN, R. M. Certain differential effects of left and 

crete behavior, an experimental study with special right cerebral lesions in human adults. J. comp, 


tests. Psychol. Monogr., 1941, 53(2, Whole No. physiol. Psychol., 1955, 48, 474-477. 
TINKER, M. A. Introduction to methods in experimental 


239). 
HEILBRON, A. F., Jr. Psychological test performance as psychology. (2nd ed.) New York. Appleton-Cen- 
a function of lateral localization of cerebral lesion. tury, 1947. 


J. comp. physiol. Psychol., 1956, 49, 10-14. 
Prorrowsx1, Z. The Rorschach inkblot method in (Received February 4, 1960) 


Journal of Abnermal and Social Psychology 
1961, Vol. 62, No. 2, 288-293 


SOME DETERMINANTS OF SELF-EVALUATION 


HAROLD B. GERARD? 
Bell Telephone Laboratories, Incorporated, Murray Hill, New Jersey 


HERE appear to be two types of social 

comparison that affect self-evaluation. 

In one process a person directly com- 
pares his standing on a given attribute with 
that of others; in the other, his self-evaluation 
is influenced by his conception of how others 
regard him, in a way reminiscent of Cooley’s 
(1909) “looking glass” self. The distinction 
parallels one drawn by Deutsch and Gerard 
(1955) between informational and normative 
influence upon an individual’s judgment re- 
garding a nonpersonal object. Informational 
influence involves reliance upon another as an 
impersonal source of information concerning 
some object that is being judged, just as one 
would refer to an arbitrary measuring instru- 
ment. Normative influence can occur only 
when members of the comparison group are 
able to reward or punish the person for his 
acts and beliefs. When sanctions can be ap- 
plied the individual tends to conform to the 
others’ expectations of him. Direct and re- 
flected comparison correspond to informational 
and normative influence, with the self as ob- 
ject. 

Previous research (Festinger & Thibaut, 
1951; Gerard, 1953) supports the expectation, 
for either type of social comparison, that the 
greater the discrepancy between the individ- 
ual’s current estimate and the estimate de- 
rived from comparison, the greater his tend- 
ency to change his current estimate. 

The relevance of the distinction between the 
processes of direct and reflected comparison 
should be established if factors can be found 
that produce differential effects on self-evalua- 
tion in one type of comparison but not in the 
other, effects that can therefore be attributed 
to mediation by one rather than the other 
process. Any factor that affects the degree to 
which members of the comparison group have 
means-control over the person’s satisfactions 
should tend to produce reflected comparison. 


1 The author wishes to thank G. S. Rotter for help 
with the analysis of the data and M. Deutsch and 
W. M. Evan for critical suggestions regarding the 
manuscript. 


Naturally-given individual differences in de- 
pendency upon others for self-evaluation 
should presumably also affect the degree to 
which the person adjusts to a discrepancy in a 
situation engendering reflected comparison, 
Logically, either experimentally induced de- 
pendency or a naturally given variable may 
serve the role of detecting reflected comparison. 
From the foregoing discussion these predic- 
tions follow: 

1. Given a discrepancy between anticipated 
and actual standing in a group, change in self- 
appraisal should result; the greater the dis- 
crepancy, the greater the tendency to change. 

2. If the skill involved is relevant to the 
individual’s standing in the group, anticipated 
confrontation by others in the group should, 
in addition to direct comparison, produce re- 
flected comparison. Reflected comparison in 
turn should result in an increased tendency for 
the person to change his self-evaluation in the 
face of a discrepancy between anticipated and 
actual standing. 

3. The greater the person’s dependence upon 
others for self-evaluation, the greater should 
be the tendency for reflected comparison. 


METHOD 
Subjects 


A total of 159 undergraduates at New York Univer- 
sity served as subjects. Fifty-one potential subjects 
had to be eliminated since they did not participate in 
all three phases of the experiment. 


General Setting 


The experiment was conducted in the classroom 
and used time during each of three class sessions. 
During the first session a specially devised question- 
naire was administered that provided an a priori meas- 
ure of dependency upon another for self-evaluation. 
Each item could be answered “true” or “false.” Some 
items referred to personal characteristics whereas 
others confronted the subject with a hypothetical 
social situation and asked him how he would react to 
it. A large pool of likely items were collected from a 
number of sources and each item was thoroughly dis- 
cussed with the author’s colleagues in terms of its rela- 
tionship to dependency upon others for self-evalua- 
tion. A tentative list of 71 items was administered to 
a comparable sample of subjects at another university. 
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Items that failed to discriminate, i.e., were answered 
by nearly everyone as either true or false, were elimi- 
nated, as were items that correlated poorly with the 
subject’s total score. The final list upon which the 
present analysis is based consists of 27 items. Examples 
of items that refer to a personal characteristic are: 

One of my secret dreads is to stand out like a sore 

thumb. 

I don’t mind being the butt of a joke. 

Examples of items that confront the subject with a 
hypothetical situation are: 

When someone asks me to lend him something, I 

usually can’t refuse even when my better judgment 

tells me to refuse. 

I feel uncomfortable when I enter a room where 

everyone has already assembled and is in a discus- 

sion. 

To have induced differential dependency experi- 
mentally would have required closer control than can 
usually be achieved in a classroom situation; we there- 
fore settled for natural variation in a tendency that 
could serve alternatively as a detector of reflected com- 
parison, 

During the second session, the Psychological Corpo- 
ration’s Space Relations test was administered. In 
order to heighten concern with performance, subjects 
were told that the ability involved is rather critical 
in most walks of life. The test items are quite difficult 
and it was presumed that the subject would be unable 
to predict with much certainty what score he had 
made. In other words, his expected score and self- 
evaluation should tend to be somewhat labile. Immedi- 
ately after the test, a questionnaire was given in which 
the subject was asked to indicate the score he expected 
to make (the scoring system was carefully explained) 
and he was asked to guess the score he thought a typi- 
cal member of a list of groups would have made. 
Among others, this list included a “group of your 
friends,” “honor students at New York University,” 
and “the lower 10% of students at New York Univer- 
sity.” The subject was asked for these estimates to 
provide a basis for determining his own relative level 
of aspiration. Since the test was introduced as a fairly 
good predictor of academic success, the subject’s 
guesses as to how the best and poorest student would 
have fared were thought to establish a range within 
which his own score expectation could be evaluated. 
The questionnaire contained a six-point a priori scale 
on which the subject was asked to indicate his ability. 
He was also asked to indicate the score he would have 
made under ideal conditions. A number of other periph- 
eral questions were included, to which reference will 
be made later. 

The third session followed the second by 2 days. 
During that class period each subject received, on a 
slip of paper, the score he had presumably made. 
Another questionnaire was then administered which 
was substantially the same as the one administered 
immediately after the Space Relations test. Some of 
the wording had to be changed to accord with the fact 
that the subject now had his score. ; 

The dependency questionnaire administration pre- 
ceded the second session by 2 weeks and was so pre- 
sented that the subjects could not connect it with what 


289 


followed. Its eventual use, any other deception in- 
volved, and the general purpose of the experiment were 
revealed at the end of the third session. 


Experimental Manipulations 

In order to induce differential reflected comparison, 
approximately half of the subjects were led to believe 
that everyone’s performance would shortly be public 
knowledge. They were told that a roster of scores 
would be passed around the class and then posted so 
that each person could see just where he stood in rela- 
tion to particular people in the class. He would thus 
be confronted by the evaluations of others by virtue 
of their knowledge of his relative standing, with at- 
tendant, anticipated social consequences. The other 
half of the subjects were told that each individual’s 
score was his own business and would under no cir- 
cumstances be made public. 

To vary discrepancy, the score given each subject 
was falsified such that within the public and private 
treatments a variation in the discrepancy between ex- 
pectation and performance was produced. The subject 
received either a false low score or a high score. The 
false low scores were either 10, 11, or 12 whereas the 
false high scores were 67, 68, or 69. A presumably 
accurate but actually false distribution of scores was 
written on the blackboard, and it was made clear that 
the high false scores placed the individual in the upper 
5% of the class, whereas the scores selected as the low 
false scores placed the individual in the lower 5% of 


the class. 
RESULTS 


Changes in Self-Evaluation and Level of Aspira- 
tion 

An index of expected performance was de- 
rived from the questionnaire administered im- 
mediately after the test. The index, I., was 
computed as follows: 


Es — En 


T Ex — Ew’ 


where Es = subject’s expected score, Ew = 

subject’s guess as to the score a mem- 
ber of the lower 10% of the class would 
make, and Eg = subject’s guess as to the 
score an honor student would make. The in- 
dex represents the subject’s estimate of the 
quality of his own performance relative to his 
estimate of the general range of scores for the 
student body at large. Subjects with an index 
at or above the median were classified as ex- 
pecting a high score, whereas subjects with an 
index below the median were classified as ex- 
pecting a low score. The two degrees of score 
expectation were cross-classified with the high 
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and low false score feedback to yield four 
treatments, two in which the discrepancy be- 
tween expectation and presumed performance 
was relatively small and two in which the dis- 
crepancy was relatively large. 

The data presented in Table 1 indicate the 
mean shift in self-evaluation by subjects in all 
16 experimental conditions. If we examine the 
amount of change produced by the three ex- 
perimental variables—public vs. private (Cells 
1-8 vs. Cells 9-16), high discrepancy (where 
presumed and expected performance were quite 
different, i.e., Cells 3-6 and Cells 11-14) vs. 
low discrepancy (where presumed and expected 
performance tended to be somewhat alike, i.e., 
Cells 1, 2, 7, 8, 9, 10, 15, and 16), and high vs. 
low dependent (odd vs. even numbered cells)— 
we find that, overall, each of the variables 
produces significant effects in the expected 
direction. However, it is apparent that the in- 
teractions between the experimental variables 
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are such that it is not meaningful to consider 
their effects independently of one another. _ 

A change in self-evaluation should be re- 
flected in a change in level of aspiration. On — 
the questionnaire administered immediately — 
after the test subjects were asked to indicate 
what score they would have received under 
ideal conditions (with no distractions, etc.), 
The same question was asked after they re- 
ceived their scores. Table 2 presents the mean 
change in the score that would have been 
made under ideal conditions, from before to 
after the presumed score was received. These 
results also reflect the trends observed in 
Table 1. 

The interaction effects are as follows: 

1. Dependency has consistent effects upon 4 
self-evaluation only when the subject antici- 
pates that his score will be made public. If he 
has this anticipation, he changes his self-eval- 
uation and ideal score to a greater extent if he 


TABLE 1 
CHANGE IN SELF-EVALUATION® 
Received High Score Received Low Score 
Expected High Score Expected Low Score Expected High Score Expected Low Score 

Public 

High Dependent +0.88 (1)> +2.00 (3) —2.00 Q —0.38 g 
Ane Dependent +0.70 (2) +1.29 (4) —0.33 (6 0.00 (8) 

rivate 

High Dependent +0.67 th +0.33 W —0.20 (13 —0.57 i 

Low Dependent +0.40 (10) +0.75 (12) -1.50 KEU —0.75 18 


^ The questionnaire item upon which these data are based is: 

I think my ability to visualize things in space is 
——excellent 
——very good 


——-somewhat poor 
very poor 


: ——xzood ——extremely poor 
‘The figures indicate the average shift and its direction which occurred as a result of the knowledge of relative standing in the class. 
b The figures in parentheses are cell numbers to facilitate reference in the text. 


TABLE 2 
CHANGE IN Score THAT Woutp Have BEEN OBTAINED UNDER IDEAL CIRCUMSTANCES® 


Received High Score Received Low Score 
Expected High Score Expected Low Score Expected High Score Expected Low Score 
Public T 
High Dependent +8.25 (1) +25.13 (3) ~28.33 (5 21:88 (7 
x Low Dependent +6.40 (2) +21.14 (4) 18.44 G —9.25 B 
High Dependent +0.50 (9) +8.33 (11 —24.60 (13 —31.14 (15 
Low Dependent +0.70 3) +6.00 {3} 33.75 R —30.00 3 


^ The sign of the number indicates the direction of the change. 
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is high than if he is low in his dependence 
upon others (p < .01 by £ for both compari- 
sons). There is some indication in Table 1, 
which is not supported in Table 2, that under 
the private treatment the reverse effect oc- 
curred. 

2. Discrepancy produces consistent effects 
only in the public treatment (p < 01 by ¢ 
for both comparisons). It is only when the 
subject anticipates confrontation that the 
larger the discrepancy between expectation 
and standing the greater the change in both 
self-evaluation and ideal score. The relation- 
ship holds despite an obvious artifact involved 
in the measure used. The opportunity for 
change in scale position is in general greater 
the more the person’s performance deviates 
from his initial expectation. If the subject ex- 
pects a low score, his self-evaluation should 
tend to be on the low side. Receiving a very 
low score further tends to lower his self-evalua- 
tion. The opportunity for lowering, however, 
is generally only one or two scale points at 
most. On the other hand, if the same subject 
receives a very high score, the opportunity for 
increasing his self-evaluation may be as much 
as five scale points. As a result of the differen- 
tial ceiling placed upon the possibility for 
change, therefore, we should expect greater 
change where expectation and presumed per- 
formance are discrepant. 

3. When the subject’s attained score is high, 
he is likely to change his self-evaluation (in a 
positive direction) to a greater extent in the 
public than in the private treatment. On the 
other hand, when the subject’s attained score 
is low, he is likely to change his self-evaluation 
(in the negative direction) to a greater extent 
in the private treatment (p < .001 by # for the 
interaction). It will be noted that the com- 
parison of Cells 5 with 13 in both tables is an 
exception to the general trend. 

In terms of the distinction between direct 
and reflected comparison, changes in self-eval- 


2 The standard analysis of variance format was not 
used since many of the comparisons involved collapsing 
the basic 16-cell design in a number of ways that were 
not orthogonal. Also, ¢ tests were easier to compute 
since because of variations in class size (five college 
classes in all were used) and of attrition, n’s were un- 
equal. Comparisons were based upon the amount of 
change irrespective of direction. 
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uation that occurred in the private treatment 
may be attributed primarily to direct com- 
parison? An understanding of the effects of 
anticipated appraisal by others involves in- 
terpreting the foregoing interactions. 

One of the interactions indicates that de- 
pendency showed consistent effects only in the 
public treatment. Apparently, the degree of 
dependence upon others for self-evaluation is 
relevant only where appraisal from others is 
anticipated. This is the evidence we were look- 
ing for in order that we infer the occurrence of 
reflected comparison in the confrontation treat- 
ment. 

The interaction between discrepancy and 
anticipated confrontation suggests that re- 
flected comparison was more important in 
producing change than was direct comparison. 
That direct comparison was effective is indi- 
cated by the trend, in Table 2, for greater 
change in the high as compared to the low 
discrepancy treatments, especially in the high 
received score treatment. With the addition of 
anticipated confrontation, however, strong and 
clear effects emerge. 

The interaction between confrontation and 
received score can best be interpreted by con- 
sidering the satisfactions mediated by the in- 
dividual’s status in the comparison group. 
Since the Space Relations test was introduced 
as a good predictor of academic success, we can 
assumethat thesubject considered the skill ger- 
mane to the activities of the comparison group. 
The subject’s status in the group is presum- 
ably mediated by the attribution of skills to 
him by others in the class. If we assume that 

sIt is ible that in the private treatment some 
subjects anticipated having to reveal their score at 
the end of the class session. When a person 1S asked 
his score, it is not considered good form for him to 
refuse to divulge it. To the extent that this occurred, 
the results for the private treatment are attributable 
to reflected comparison. Tt would not be too difficult 


k out a procedure in which the operation of re- 
a gloat a would be minimal, A further prob- 
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high status is desired, and that during the 
course of the anticipated confrontation high 
status will be attributed to the subject, he 
should show little or no resistance to raising 
his self-evaluation. If, on the other hand, his 
standing is lower than he anticipated, he 
should resist lowering his self-evaluation since 
this would be de facto acceptance of lowered 
status in the group. Since status considerations 
are not involved in the private treatment, 
there should be less resistance to lowering self- 
evaluation. The positive bias observed in the 
public treatment seems understandable in 
these terms. Since this experiment was con- 
ducted Mischel (1958) has reported a similar 
finding—that the individual tends to resist 
lowering his self-evaluation after publicly ex- 
posed failure. 


Emotionality and Change in Self-Evaluation 


A discrepancy between two sources of in- 
formation about the self (or an aspect thereof) 
should, by our present assumptions, produce 
tension which would subsequently tend to be 
reduced. We have thus far discussed one ave- 
nue of tension reduction, namely, a change in 
self-evaluation that leads to equilibration of 
the original self-estimate with the estimate de- 
rived from social comparison. Presumably, a 
negative relationship should appear between 
the amount of change and tension reduction. 
An indirect test of this consequence was avail- 
able. The questionnaires administered before 
and after the Space Relations test contained a 
question that asked the subject to indicate to 
what extent emotional feelings tend to inter- 
fere on a test like the one he had taken. The 
amount of change in emotional feelings was 
found to be negatively correlated with both 
change in ability evaluation (p < .05 by chi 
square) and change in ideal score (p <` .001 
by chi square). 


Other Avenues of Tension Reduction 


Various modes of reducing discrepancy are 
open to the person in the situation studied 
other than changing his self-evaluation: at- 
tributing little validity to the test, denying the 
importance of the skill in question, attempting 
subsequently to influence the other to change 
his appraisal, and shifting the frame of refer- 
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ence for self-evaluation. We shall examine each 
of these in turn. 

Since it is via the test score that the other 
will come to appraise the individual, devaluing 
the test as a valid source of information regard- 
ing one’s ability should also serve to reduce | 
discrepancy tension. Both questionnaires asked 
the subject to rate the validity of the testasa 
measure of spatial visualization. Uniformly — 
high validity was attributed to the test on the 
first questionnaire and very little change in 
attributed validity occurred, probably be- 
cause of the way in which the test was intro- 
duced and the very elegant and impressive 
format of the test booklets. There was a slight 
trend, however, for a subject who expected a 
high and received a low score to lower his 
validity estimate more than a subject who 
both expected and received a low score (p < 
-15 by chi square). If the test were introduced 
as being of somewhat questionable validity as 
a measure of the ability, we would expect dif- 
ferences in attributed test validity rather than 
changes in self-evaluation. To the extent that 
attributed validity is more labile than self- 
evaluation we would expect discrepancy ten- 
sion to produce changes in validity, the results 
paralleling those obtained in the present ex- 
periment. 

On both questionnaires the subject was 
asked to indicate how important the test re- 
sults were to him. Since the test was described 
as a good predictor of academic success we can 
assume that the subject’s response reflected 
the degree to which his score was relevant to 
skill attribution in his present comparison 
group. Subjects attributed uniformly high im- 
portance to the test. There was some change 
after the subject discovered his relative stand- 
ing. When the subject received a low score he 
tended to lower the importance of the test, 
whereas when the subject received a high score 
he tended to increase its importance (p < .001 
by chi square). Here is evidence for another 
mode of reducing discrepancy tension. 

In discussing change in self-evaluation, the 
conjecture was advanced that the low scoring 
subject who anticipated confrontation saw his 
performance as a threat to his status in the 
group. His resistance to lowering self-evalua- 
tion may reflect the anticipation that he would 
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be able to influence the other’s appraisal of 
him in subsequent discussion and thus disa- 
buse him of the appraisal reflected by his low 
score. The prediction should follow of an in- 
verse relation between amount of change and 
tendency to communicate, an hypothesis in- 
viting further research. 

In addition to changes in self-evaluation, 
changes in scores attributed to members of 
various groups in the general population were 
also available for study as an indication of 
changes in the subject’s frame of reference. 
The subject was asked to guess what score a 
typical member of the following groups would 
make: your friends, your class, undergraduates 
at this school, “the man in the street,” and 
“a first-rate mind.” There were no clear effects 
of dependency and discrepancy. However, the 
public-private variable produced striking ef- 
fects that parallel those for self-evaluation and 
level of aspiration. The attributed scores were 
consistently lower in the private treatment for 
all of these groups (in all instances the p values 
were well beyond the .001 level of confidence). 
It would appear then that another adjustment 
to the discrepancy tension induced by antici- 
pated confrontation is a shift in the frame of 
reference for evaluating one’s ability. 


SUMMARY 


Processes of direct and reflected comparison 
were distinguished. In the former the person 
uses the skill evaluation of others as a “yard- 
stick” on which to place himself, whereas in 
reflected comparison he refers his self-evalua- 
tion to the appraisal he assumes others have 
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of him. The study manipulated: the degree of 
discrepancy between an individual’s appraisal 
of himself and his relative standing in a com- 
parison group, and the anticipation of ap- 
praisals from others in the comparison group. 
The degree to which the person was dependent 
upon others for self-evaluation was also meas- 
ured. 

Self-appraisal, according to the findings, is 
influenced by direct comparison of one’s per- 
formance to the performance of others. The 
addition of anticipated confrontation (which 
presumably engenders reflected comparison), 
produces a positive bias in self-evaluation, 
produces an increase in the effects of discrep- 
ancy, and provides the medium in which dif- 
ferential dependency may operate. Interpre- 
tations of these results were offered. Additional 
avenues of discrepancy tension reduction were 
also examined. 


REFERENCES 


Cootey, C. H. Social organization. New York: Scrib- 
ner, 1909. 

Devtscu, M., & Gerarp, H. B. A study of normative 
and informational social influences upon individual 
judgment. J. abnorm. soc. Psychol., 1955, 51, 
629-636. 


Fesrincer, L., & THIBAUT, J. Interpersonal com- 
munication in small groups. J. abnorm. soc. Psy- 
chol., 1951, 46, 92-99. 

Gerar, H. B. The effect of different dimensions of 

i t on the communication process in 
small groups. Hum. Relat., 1953, 6, 249-271. 

MıscHeL, W. The effect of the commitment situation 
on the generalization of expectancies, J. Pers.. 
1958, 26, 508-516. 


(Received February 5, 1960) 


Journal of Abnormal and Social Psychology 
1961, Vol. 62, No. 2, 294-302 


THE SOCIAL PSYCHOLOGICAL NATURE OF STRESS: 
A FRAMEWORK FOR CAUSAL INQUIRY? 


BRUCE P. DOHRENWEND?* 
Columbia University 


number of recent studies have begun 
A to lay a groundwork of theory and 
fact about relations between social 
environmental factors and mental illness. 
Notable among these researches are the New 
Haven study (cf. Hollingshead & Redlich, 
1958), the Midtown study (cf. Rennie, Srole, 
Opler, & Langner, 1957), and the Stirling 
County study (cf. Leighton, 1959). These 
studies have differed among themselves in 
several ways: in the definitions of mental dis- 
orders employed, in whether they have at- 
tempted to assess untreated as well as treated 
prevalence, in the particular social and cul- 
tural factors emphasized, and in the kinds of 
population studied. The studies have, how- 
ever, four things in common. First, their 
emphasis tends on the one hand to be on gross 
environmental features described, for example, 
as social disintegration or social class, and on 
the other hand, on mental disorder. The in- 
tervening experiences of the individual receive 
less theoretical and empirical attention in these 
searches for the ecological outlines of the prob- 
lem (cf. Dohrenwend, 1957; Hollingshead & 
Redlich, 1958, p. 360; Leighton, 1959, pp. 3-5). 
Second, all set forth as their central assump- 
tion, in one form or another, the theoretical 
position that stressful social and cultural fac- 
tors are causally related to mental disorder. 
Third, all report strong relations between 
various social and cultural factors and mental 
disorder. And fourth, all are cross-sectional 
studies of communities or urban neighbor- 
hoods, conducted at one time and in one place, 
without experimental controls. 
These last circumstances set severe methodo- 


1 This work has been supported in part by Mental 
Health Project Grant OM-82 from the National Insti- 
tute of Mental Health, United States Public Health 
Service. It has been conducted under the joint auspices 
of the Department of Psychiatry and the School of 
Public Health and Administrative Medicine. 

2I am grateful to Barbara S. Dohrenwend, Jack 
Elinson, and Lawrence C. Kolb for their helpful criti- 
cism of this paper. 
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logical limitations on the power of these 
studies to test the general theoretical position. 
The chief limitation is this: correlations found 
between sociocultural factors and mental 
illness in these cross-sectional studies do not 
demonstrate causal relationships; the well 
known “drift hypothesis,” for example, would 
suggest an opposite interpretation, counting 
mental disorder the cause, and disorganized 
social conditions the effect produced by down- 
ward drift of mentally ill persons. 

In contrast to these community studies are 
laboratory investigations of stress. A con- 
vergence may be observed between work such 
as Hans Selye’s on physical and chemical 
changes associated with stress, and such 
notable research in social psychology as that on 
frustration and aggression (Dollard, Doob, 
Miller, Mowrer, & Sears, 1939), Mostintriguing 
in this respect are the recent studies by Funk- 
enstein, King, and Drollette (1957) which 
attempt to relate psychological stress to phys- 
ical and chemical changes. Since these studies 
are controlled experiments, the direction of 
relationships between the stress situations and 
their consequences is clear. The difficulty is 
that the relationships are not between enduring 
social situations, stress, and mental disorder. 
The laboratory situations for human subjects 
must, of necessity, be relatively benign and 
short-lived. They cannot have, as their aim, 
the creation of mental disorder. These con- 
trived situations add to a certain artificiality a 
large degree of impotence compared to stress 
situations manufactured in social and cultural, 
environments outside the laboratory. Further 
the laboratory stress conditions tend to lack 
even theoretical reference to the very social 
conditions designated as the major sources of 
stress in the community studies noted above. 

What is needed on an empirical level, then, 
is sensitivity to stress situations which, though 
created by social and cultural factors outside 
the researcher’s control, nevertheless provide 
close approximations to experimental control 


—— ow 
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of meaningful conditions. Such sensitivity, in 
turn, depends in large part on the existence of 
theoretical perspective in terms of which these 
strong and enduring stress situations can be 
assessed. This paper is an attempt to contribute 
to such perspective. 


THEORETICAL FRAMEWORK 


On the basis of their studies of anxiety and 
stress in paratroop training, Basowitz, Kor- 
chin, Persky, and Grinker (1955, pp. 287-289) 
observe that there are “... difficulties in- 
herent in the prediction of behavior from 
information about the person and his past 
experience alone without knowledge of the 
new setting, its social characteristics, and its 
meaning to the individual.” They conclude 
that, “In future research. ..we should not 
consider stress as imposed upon the organism, 
but as its response to internal or external proc- 
esses which reach those threshold levels that 
strain its physiological and psychological 
integrative capacities close to or beyond their 
limits.” Thus viewed, what kind of theoretical 
formulation can be made of the individual’s 
stress response in social and psychological 
terms? 

Clues are available in the work of Hans 
Selye (cf. 1955, 1956). Selye’s investigations 
have been conducted largely with animals in 
laboratory settings. His focus has been on 
physical and chemical changes as manifesta- 
tions of stress observable in these settings. 
Nevertheless, elements of his general position 
provide a useful point of departure for the 
present endeavor. 

Selye’s position is that stress is basic to all 
adaptive reactions. It is thus the common 
denominator of diverse reactions to a wide 
variety of agents or “stressors” (the term 
given by Selye to anything that produces 
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stress). Moreover, the same stressor that has 
adaptive consequences in one case may have 
maladaptive results in another. It is this 
general view of stress that underlies the precise 
and elegant formulations based in his re- 
searches. 

It is possible to abstract from Selye’s for- 
mulations four main elements involved in 
stress situations: an antecedent stressor (often 
a toxic agent or electric shock in Selye’s 
work); conditioning or mediating factors such 
as climate or diet which increase or decrease 
the impact of the stressor; the General Adapta- 
tion Syndrome of nonspecific physical and 
chemical changes, indicating the intervening 
state of stress in an organism over time; and 
consequent adaptive or, in cases where there is 
a “derailment” of the mechanism of the 
General Adaptation Syndrome, maladaptive 
responses, usually in the form of what Selye 
calls “diseases of adaptation.” It is possible to 
strip these four kinds of elements of particular 
examples and arrive at the general paradigm 
of the individual organism’s stress response 
shown in Figure 1 below. This general para- 


digm, in turn, can be developed in social 
psychological terms. Consider each of its 
elements. 
Stressor 


Since a stressor is any agent that produces 
stress, the number of stressors is virtually 
limitless. The concern here is with social 
stressors—an economic disaster or boom, a 
clash with the enemy, forced relocation, dis- 
ruptive contact with representatives of a 
different culture—these are but a few of the 
more dramatic and far-reaching social stressors. 
Reaching fewer people at any given time, but 
no less important to the individual person, 
would be the impact of a death, divorce, or 


Stressor = 
8 Adaptive 
57 ee Response 

t 

t 

o 

STRESS <f-------7 3 

n 

3 

a 


Mediating Factors 


er Maladaptive 


Response 


Fic. 1. General paradigm of the stress response. 
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illness on the members of a family—again, only 
a few of the possible examples. 


Mediating Factors in Relation to a Concept of 
Constraint 


To avoid confusion with the conditioning 
concepts of learning theory, the term “medi- 
ating factors” has been substituted for Selye’s 
“conditioning factors.” Selye discusses two 
different types of such factors: “internal” 
(i.e., those which have become part of the body 
through heredity, past experience, etc.) and 
“external” (i.e., those factors acting from 
without, such as diet and climate), which 
influence the response to a physical stressor 
that is simultaneously acting on the organism. 
The social and psychological counterparts to be 
advanced here will be considered under two 
general headings: first, those mediating factors 
which determine the amount of external con- 
straint associated with the stressor; second, 
those that determine the amount of inner 
constraint associated with the stressor for the 
individual in the stress situation. 

Constraint is here considered to be the 
psychological force exerted consciously or un- 
consciously by the individual to inhibit a 
course of action called for by events. These 
events originate in the environment external 
to the individual or in his own inner drives, 
desires, and internalized rules or prescriptions, 
Inner and external constraint are defined in 
terms of relations between inner and outer 
events. When force is exerted by the individual 
to suppress action demanded by inner events 
in favor of action demanded by outer events, 
the individual may be said to experience 
external constraint. When, in contrast, he at- 
tempts to inhibit action demanded by outer 
events in favor of action demanded by inner 
events, the individual may be said to experi- 
ence inner constraint. In general, the stronger 
the external events, the stronger the force to 
suppress action demanded by internal events 
in opposition to them, and hence the stronger 
the external constraint experienced by the 
individual. Conversely, the stronger the inner 
events, the stronger the force to suppress ac- 
tion demanded by external events in opposition 
to them, and hence the stronger the inner con- 
straint experienced by the individual, What, 
then, are the mediating factors which, in inter- 
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action with the stressor, determine the amount | 
and nature of constraint experienced by the 
individual in the stress situation? i 

Sources of external constraint. The phenom- 
ena of suicide andhomicidehavebeenstudiedas 
reactions to a most important social stressor, 
the business cycle. Henry and Short (1954) 
report some interesting findings on relations. 
between the business cycle and these two 
extreme actions, These authors show that 
suicide rates vary inversely and homicide 
rates vary directly with the degree of what they 
term “external restraint” associated with this 
stressor. 

Henry and Short describe two major sources 
of external restraint. One is the status of t 
individual as based primarily on his ethnic 
background and occupation; the other is the 
strength of his “relational system,” defined 


This is a most interesting formulation. It 
appears, however, to mix two quite intricate. 
things, external constraint produced by events” 
related to low status on the one hand, and 
social support associated mainly with a strong 
relational system (e.g., family ties) on the 
other. Both are grouped by Henry and Short 
under the heading “external restraint.” 

That low status is probably a source of 
strong external constraint seems apparent. It 
is, for example, the person who is low in 
Status who is first to be fired in a time of eco- 
nomic decline, thus pressing the previously low 
ceiling on his aspirations still lower. Whether 
the individual experiences external constraint 
or social support for his inner goals from his 
relational system, however, depends upon a 
number of factors not considered by Henry 
and Short. 

It seems likely that external constraint in- 
creases with the “relative deprivation” ex- 
perienced by the person as he compares him- 
self with others in the situation (Merton & 
Kitt, 1950; Stouffer, Suchman, DeVinney, 
Star, & Williams, 1949); some of these others 
whose circumstances the individual compares 
as favorable to his own may be important 
members of his relational system—his friends, 
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for example. In addition, to the extent that the 
person’s salient inner goals are dissimilar to 
those of others in his relational system who are 
involved in the situation, he is also likely to 
experience external constraint. It is only when 
the inner aims of the others are similar to his 
own that the individual is likely to experience 
social support as opposed to external constraint 
from his relational system. 

The position adopted for the present for- 
mulation, then, is that external constraint 
varies inversely with status as defined by the 
wider society, and directly with the relative 
deprivation experienced by the individual as 
he compares his position to that of others in 
the situation. In addition, external constraint 
varies directly with the dissimilarity between 
the drives, desires, and internalized prescrip- 
tions of the individual and those of the other 
individuals in the stress situation. 

Not only the similarity or dissimilarity of 
the inner goals of others in such activities are 
important. The aggregate nature of the rules or 
norms governing various activities in his 
relational system should themselves be im- 
portant as they interpret the stressor to him; 
for example, if there were a high degree of 
anomie or normlessness in these activities, one 
would expect that a confused and ambiguous 
interpretation of the stressor would be trans- 
mitted to the individual by others in his rela- 
tional system. Thus the quality of external 
constraint should vary with the state of the 
norms in the different groups and activities 
comprising his relational system (Dohrenwend, 
1959). 

Some examples from a study now in progress 
may illustrate these points. The subjects of the 
research are drawn from approximately 1,800 
New York City families who found themselves 
in the path of improvements being made in the 
approach to a major bridge. These families 
lived in a narrow strip of eight square blocks 
bounded by rivers on the west and east. 
Identical sections of blocks paralleling this 
telocation section to the north and south pro- 
vide controls. The stressors of the study, then, 
consist of various stages of forced relocation of 
the families in the path of the bridge. 

These families come from varying economic 
and ethnic backgrounds. The western portions 
of the relocation and control sections have had 
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a plurality of middle class Jewish residents. 
The central portions, in contrast, tend to be 
Trish descent and lower middle class. On the 
east, the plurality is Puerto Rican and eco- 
nomic deterioration is much in evidence. The 
families to be relocated found themselves 
face-to-face with the need to move out of rent- 
controlled apartments into a scarce housing 
market since no alternative public or private 
housing was to be built for them. Moreover, 
the situation within the urban area is such that 
a move south and east of the relocation section 
means a move to blocks that are rapidly de- 
teriorating economically. These blocks have 
also changed in composition from Jewish and 
Irish descent to Negro and Puerto Rican 
families in recent years. The threat of ethnic 
marginality thus faced a Jewish or Irish family 
considering a move in this direction. One 
respondent put the situation this way: “The 
colored and Puerto Ricans are taking over 
.... My only objection—and I’m not preju- 
diced—is that the Puerto Ricans refuse to 
speak English.” Another less circumspect 
informant overstated the change this way: 
“...once you go down from——— -Street 
and east, your life isn’t worth a nickel at 11 
o’clock at night.’ 

There are a number of factors in this situa- 
tion which, as mediators of forced relocation, 
are probable sources of external constraint for 
the persons involved. Two factors defining the 
social status of the individual, his class situa- 
tion and his ethnic background, are of major 
importance. In a scarce housing market, the 
individual’s opportunities are greatly limited 
by the rent he can afford to pay. The organiza- 
tion charged with improving the bridge ap- 
proach set up an office to help relocate the 
displaced families. By providing bonuses to 
landlords for renting to such families, this 
office secured listings of vacant apartments to 
facilitate relocation. In these circumstances, 
what an individual is offered depends in large 
part on his economic resources. The lower his 
income, the greater the external constraint 


3 These quotations are taken from exploratory 
interviews with members of families in the relocation 
section. About 20 respondents were interviewed. I 
would like to express my appreciation to Claire Jacob- 
son and Warren Swidler who conducted the large 
majority of these interviews, including those quoted 
above. 
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likely to stem from his experiences with the 
personnel of the Relocation Office and the real 
estate managers as gatekeepers of the scarce 
housing market in this situation. 

In addition to his class situation, his ethnic 
background is a key factor. There are dis- 
criminatory practices in the real estate market; 
a favored position is granted to some ethnic 
groups and an unfavorable position imposed on 
others. The Relocation Office personnel, as 
intermediaries between the real estate man- 
agers and the families to be relocated, reflected 
these practices directly by withholding some 
apartments from some categories of tenants at 
the behest of the landlord, or indirectly by 
inability to change the practices of landlords 
who turned down tenants in these categories. 
Thus, for example, the darker the prospective 
tenant’s skin or the more pronounced his 
accent, especially if Spanish, the greater the 
external constraint he is likely to encounter 
during relocation. 

A person participates, in greater or lesser 
degree, with other individuals in a wide variety 
of activities in such spheres as kinship, religion, 
education, economic exchange, politics, and 
recreation. In cases where influential persons 
in one or more of these activities intervened 
for him with the Relocation Office or with real 
estate agents, these interventions may have 
modified—either increased or decreased as the 
case may be—the external constraint. An 
active member of a church, for example, might 
have expected his clergyman to speak on his 
behalf at the Relocation Office; a small busi- 
nessman might have had expectations of the 
local Chamber of Commerce; a tenant might 
have placed his faith in collective action with 
other tenants, sponsored perhaps by local 
political leaders; a wife might have expected 
her husband to produce a solution; and so on. 
In general, to the extent that the inner goals of 
others with whom he stands in social relation- 
ship are dissimilar to his own, such expecta- 
tions tend not to be fulfilled; external con- 
straint increases for the individual. 

At each stage of relocation, it is possible for 
the person to compare his own situation with 
that of others experiencing the same process. 
At the initial stage when relocation is first 
announced, for example, it is possible for him to 
compare what is likely to be available to him 
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with what is likely to be available to 
neighbors; later, he is able to compare what 
the Relocation Office offers his neighbors to 
what it offers him; he is able to compare the 
duration of his stay in the relocation section to 
theirs; and, in some cases at least, he could, 
still later, compare the quality of their new 
homes and neighborhoods with his own new 
location. Further, it is possible for him to 
compare his economic circumstances prior to 
relocation with his circumstances after; his 
old home with his new one; the quality of his 
old neighborhood with the quality of his new 
one, in terms of both economic condition and 
ethnic composition. In short, the person can 
judge whether his status has increased or 
decreased; whether his web of social relations 
has been strengthened or torn asunder. In 
general, the greater the relative deprivation he 
experiences as he makes these comparisons, 
the greater the external constraint. 

Sources of inner constraint. Quite another, 
although highly related, set of mediating fac- 
tors are those that provide the main sources of 
inner constraint. Of central importance are the — 
person’s drives and the internalized rules that 
give these social direction. These situationally 
relevant aspects of superego and ego ideal will 
be referred to hereafter as “self-prescriptions.” 
As they are confronted by a particular stressor, 
they become sources of inner constraint. 

A number of factors should determine the 
strength of the inner constraint. Major ones 
would be the sheer number and importance of 
the individual’s self-prescriptions that are af- 
fected by the stressor; if, for example, the 
stressor threatens the self-prescriptions that 
define the person’s status aspirations as well 
as other aspects of his self-image, he is likely to 
experience greater inner constraint than if 
such aspirations are not in jeopardy. The num- 
ber and importance of others with similar self- 
prescriptions should also play a large part; if 
other such persons are in the same situation, 
interaction with them should strengthen his 
self-prescriptions and hence the inner con- 
straint he experiences. In this view, social 
support functions to increase inner constraint. 
Moreover, to the extent that a person’s self- 
Prescriptions are consistent and support each 
other, the aggregate inner constraint is likely 
to be increased. 


SOCIAL PSYCHOLOGICAL NATURE or STRESS 


p And, of great importance, the stronger per se 


the self-prescriptions governing the expression 
of the person’s drives, the stronger the inner 
constraint he is likely to experience. Thus, in 
general, the strength of self-prescriptions and 
hence inner constraint may be greater in the 
case of the individual with the stronger 
superego stemming from childhood discipline 
by withheld love and approval rather than 
physical punishment (Sears, Maccoby, & 
Levin, 1957). There is evidence that these 
differing types of childhood discipline vary 
with social class (cf. Bronfenbrenner, 1958). 
They may also vary with ethnicity. 

Consider again the person faced with forced 
relocation. Should relocation involve his status 
aspirations, his business interests, his hopes for 
the education of his children, his internalized 
obligations to relatives and friend—a large 
number of his self-prescriptions as opposed to 
very few—the inner constraint is likely to be 
greater. Thus, for example, a person who has 
business interests in the area is likely to 
experience greater inner constraint than one 
who does not, other things constant. 

In addition, the number and importance of 
others with similar self-prescriptions with whom 
the person interacts should affect his experi- 
enced inner constraint. The assumption is that 
the more such persons there are, and the more 
important they are to the individual, the 
more likely interaction with them strengthens 
his internalized aims and obligations. Thus, 
for example, if his spouse had the same neigh- 
borhood groups as points of positive and/or 
negative reference, his self-prescriptions with 
regard to these groups are likely to be strength- 
ened by his interaction with his spouse. 

It is likely that socialisation practices vary 
with both class and ethnic background. Some 
of these variations may affect the strength of 
the rules internalized by individuals—their 
self-prescriptions—and hence the inner con- 
straint experienced. Thus, for example, should 
it be found that physical punishment is re- 
ported as a more frequent socialization practice 
in lower than in middle class families of a given 
ethnic background, inner constraint may also 
be weaker for persons of the former rather than 
the latter background, other things constant. 

A note on other mediating factors. Other 
factors than those just considered undoubtedly 
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also mediate the impact of the stressor on the 
individual, Differences in intellectual abilities 
and physical health, for example, have to be 
considered as factors affecting the constraint 
experienced. So also do differences in age and 
sex. The emphasis above has been on factors 
with which the theory has been able to deal 
somewhat systematically at its present stage of 
development. Their formulation is not meant 
to indicate closure that would be, at this point, 


premature. 


Adaptation Syndrome 

The stressor, in association with mediating 
factors, then, produces inner and external 
constraint. The person’s subsequent attempts 
to adapt to constraint are represented by what 
is termed his adaptation syndrome. A central 
assumption of the theory is that the adaptation 
syndrome represents efforts by the individual 
to reduce constraint. 

A question arises as to the manner and 
degree to which something like the defense 
mechanisms postulated in psychoanalytic 
theory (cf. Freud, 1946) underlie adaptation 
syndromes. For the time being, this relevant 
and important question will be left open. For 
the present, three broad, conceptually distinct 
facets of social psychological adaptation to 
constraint may be considered: the affective, 
the conative, and the cognitive, Each will be 
developed in turn. 

Affective facets. Basic affective aspects of 
adaptation to constraint are fear and anger, 
and that complex combination of the two, 
anxiety. For human organisms in social situa- 
tions, these can become highly elaborated. Of 
special interest in this regard is the “frustra- 
tion-aggression hypothesis” investigatedat Yale 
(Dollard et al., 1939), and more recent studies 


stances—“anger-in,” that is, anger directed by 


research by and his 
associates (1957), who suggest that transient 
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physical and chemical correlates of anger-in for 
healthy subjects resemble conditions found to 
be chronic in depressive psychotics. In con- 
trast, the correlates of anger-out in healthy 
subjects tend to be similar to chronic physical 
and chemical states in paranoid schizophrenics. 

Conative facets. Adaptation to constraint 
involves changes in the person’s social activity. 
His participation in any given social interaction 
in which he has been engaged may increase or 
decrease; some activities may be abandoned; 
new activities may be undertaken. Old or new 
activities may be undertaken with other 
persons the individual has known in the past, or 
with others with whom he has had no prior 
acquaintance. The activity may be governed 
primarily by affect or by social rules; the latter 
may be either his own internalized rules, or 
rules of others that are incongruent with them. 

Cognitive facets. Changes should also occur 
in the orientation of the individual as he adapts 
to constraint. Certain of his self-prescriptions, 
for example, may take on heightened impor- 
tance and strength; in short, they may be 
reaffirmed. In contrast, the self-prescriptions 
may become less important, weaker—in which 
case he may be described as becoming alienated 
from them. Or, there may be a change in these 
self-prescriptions amounting to reorientation 
(Dohrenwend & Smith, 1957). 

The problem of baseline combinations of these 
affective, conative, and cognitive facets of adapta- 
tion. There is no ready answer to the question of 
how these various affective, conative, and 
cognitive facets of adaptation combine to form 
one or more baseline syndromes of social 
psychological adaptation, syndromes that 
serve as relatively neutral standards against 
which adaptive vs. maladaptive departures can 
be assessed. The nature of the problem can be 
illustrated by considering clinical descriptions 
of reactions to combat, perhaps the best 
documented of stress situations in psycho- 
logical terms. 

“Normal battle reaction” is described by 
Noyes and Kolb (1958, pp. 532-534) in the 
following way: There is complete and con- 
tinuous alertness, an alertness associated with 
fear, even terror, and often with anger in the 
form of irritability and resentment; these can 
bring the person to the point of exhaustion; 
commonly accompanying these reactions are a 
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number of psychosomatic disturban 
excessive perspiration, nausea, difficulty sleep- 
ing, muscular weakness, oversensitivity to 
threatening noises are a few examples. 

Noyes and Kolb (1958) suggest that these 
reactions can be considered a baseline against 
which pathological (maladaptive) responses 
should be evaluated. This baseline is, however, 
a relative one. Such baseline reactions, these 
authors state, “... would be considered ab- 
normal in a civilian setting yet are not so in- 
capacitating as to demand removal from com- 
bat. Unless extreme, such reactions must under 
the circumstances be considered within the 
range of normal” (p. 533, italics added), 
Implicit is the assumption that the reactio 
have not been combined with changes 
orientation or social interaction that would se- 
riously undermine “morale,” defined as “.,. 
those attitudes, feelings and beliefs which 
promote participation in a united effort” (p. 
529). 


The relative nature of the “normal battle 
reaction” as a baseline for assessing whether 
departures are maladaptive underlines a more 
general question, the question of whether it 
will be possible to speak of a general adaptation 
syndrome in meaningful social psychological 
terms. In large part, of course, this is a problem 
for research. For the present, however, it seems _ 
reasonable to take the position that adaptation 
may have a number of baseline social psycho- 
logical manifestations which form syndromes 
that are distinct and different from each other 
both for different persons in the same general 
situation, and for the same person in different 
situations, 


Adaptive and Maladaptive Responses 


A number of extreme responses to combat — 
have come to be taken as indications of “‘com- 
bat neurosis” (Noyes & Kolb, 1958, pp. 
533-535). One form of this neurosis is termed 
an anxiety state; in this state, fear becomes 
panic which is manifested in flight, “freezing,” 
or “primitive protective withdrawal, with 
stupor, catatonia, and mutism.” Another form 
of combat neurosis consists in conversion re- 
actions, with, for example, sensory losses, 
aphonia, deafness, partial or total blindness — 
and occasionally, amnesia. Still a third form of 
this neurosis is depression characterized by 
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extreme anger-in, evidenced by ideas of failure, 
guilt and self-condemnation behind a façade of 
“ rigid, mask-like face, staring eyes that 
wink infrequently, . . . paucity of movement 
....” These extremes of fear, anger-in, 
weakening of self-prescriptions, and withdrawal 
from social activities, necessitate removal of 
the individual from combat. They are hence 
maladaptive within a military frame of 
reference, and would be considered maladap- 
tive in most other contexts as well. In short, 
there is strong indication that the mechanisms 
underlying adaptation have become, in Selye’s 
term, “derailed,” leading to maladaptive stress 
responses. 

In contrast are the adaptive responses to 
combat. Within a military frame of reference, 
anger-out, if directed at the enemy, is con- 
sidered an adaptive response; so also is re- 
affirmation of collectivistically oriented self- 
prescriptions which dictate increased loyalty, 
obligation to other soldiers, and greater par- 
ticipation in the group effort. Such responses 
add up to increased military effectiveness and 
heightened morale. Yet such responses would 
by no means be considered adaptive in all 
other circumstances—situations, for example, 
in which killing as many members of the out- 
group as possible is not a social goal of the 
action. And the tremendous range of 
responses between the extremes of incapacita- 
ting combat neurosis as against heightened 
morale further compound the dilemma of 
ascertaining the grounds on which, and the 
degree to which, a stress response is adaptive or 
maladaptive. 

Hollingshead and Redlich (1958, p. 11, 179) 
state of their New Haven research that, in 
dealing only with psychiatrically treated 
mental illness, they are assuming a social 
definition of mental disorder. For the present, it 
may make most sense to broaden and extend 
this idea of a social definition of adaptive and 
maladaptive behavior by widening the range 
of sources of such definitions. Thus, for exam- 
ple, several agencies of law, health, and welfare 
in our society have, as their primary purpose, 
defining whether or not the responses of in- 
dividuals who come to their attention are 
adaptive or maladaptive over a wide range of 
circumstances. Where a person is of school 
age, educational agencies bring their perspec- 
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tive to bear. In addition, a person’s primary 
groups pass judgment on whether his behavior 
is adaptive or maladaptive. Finally, the in- 
dividual himself, implicitly or explicitly, takes 
stock of his own behavior. In each case, the 
frame of reference used for such assessments 
consists of the rules or standards by which 
the agency, group, or individual operates. 

Note that, in this formulation, it is by no 
means necessary for the various definitions of 
adaptive vs. maladaptive responses to agree. 
A person, for example, does not have to con- 
sider himself as suffering in order to be diag- 
nosed as mentally ill. Moreover, different 
responses by the same individual may be de- 
fined as adaptive in one case, and maladaptive 
in another, by the same source of assessment. 
Thus a theft may be thought adaptive by 
the thief’s primary group, who judge as mala- 
daptive only his carelessness at being caught. 
In this approach, the focus is on mental dis- 
order within a wider framework of social de- 
viance. 


SOME PROPOSITIONS 


The theory as thus far developed can be 
summarized by a number of assumptions 
about the general nature of relationships 
among the various elements of the stress re- 
sponse. The assumptions refer, of course, to 
classes of stressors that are primarily social 
in nature and do not involve direct and im- 
mediate physical harm to the individual. 
Stated in propositional form, they are as fol- 
lows: 

Proposition 1. Stress is a state intervening 
between antecedent constraint and consequent 
efforts to reduce constraint. 

Proposition 2. Stress underlies both adaptive 
and maladaptive behavior. 

Proposition 3. The probability of maladap- 
tive responses varies directly with the inten- 
sity and duration of stress. i 

Proposition 4. The intensity of stress varies 
directly with the strength of external con- 
straint associated with the stressor. 

Proposition 5. The intensity of stress varies 
directly with the strength of inner constraint 
associated with the stressor. 

Proposition 6. The adaptation syndrome 
represents efforts to reduce constraint. 

Proposition 7. The nature of the adaptation 
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syndrome varies with: the relative strengths 
of external constraint vs. inner constraint 
associated with the stressor, and the nature 
of the sources of inner and external constraint. 

Proposition 8. The nature of the adaptive 
and/or maladaptive responses varies with 
the nature of the adaptation syndrome. 
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INDIVIDUAL DIFFERENCES IN REACTION TO FAILURE: 
MODE OF COPING WITH ANXIETY AND INTERFERENCE PRONENESS! 


ALFRED 


LOWE? 


Boston University 


Te is an attempt to investigate and 
clarify some of the sources of individual 
differences in reaction to stress as mani- 
fested in ongoing performance on a cognitive 
task. This problem is approached through a 
conceptualization of stress (or frustration) 
suggested by Lazarus, Deese, and Osler 
(1952), Sargent (1948), and others, which 
places the arousal of affect or anxiety at the 
center of the psychological experience resulting 
from interference with the pursuit of a goal. 
The behavioral consequences in terms of on- 
going performance are understood to be deter- 
mined to a large degree by the effects of the 
individual’s efforts to cope with the aroused 
affect, and by his ability to withstand and 
overcome its interference in cognitive func- 
tioning (Lowe, 1959). 

Two contrasting modes of coping with anx- 
iety aroused by frustration were selected for 
study: the “goal oriented” mode, in which 
continued pursuit of the blocked goal is the 
path chosen for the reduction of affect; and 
the “ego oriented” mode, in which withdrawal 
from the stressful situation is instrumental 
towards the reduction of tension. In terms of 
behavior, the former mode may be expected 
to lead to an increase of efforts invested in the 
task and the latter to a relative decline. 

These reactions are based on concepts de- 
scribed by Rosenzweig (1943) and Bloom 
(1952). While it is understood that there are 
other modes of tension reduction and that 
people are able to use more than one mode, 
it is assumed (and there is much support in 
the literature for this assumption) that in- 
dividuals develop preferred modes of coping 

1 This article is based on a dissertation submitted 
to the Graduate School, Boston University, in ji 
fulfillment of the requirements for the degree of Doctor 
of Philosophy. 

2 The author wishes to express his appreciation to 
Austin W. Berkeley, A. W. Hire, and W. Weiss for 
their assistance and advice, and to John Arsenian for 


his aid and encouragement. zi 
At time of writing with the Boston State Hospital. 
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with anxiety and that these become relatively 
stable personality characteristics. 

In addition to this indirect effect on be- 
havior via the mode of coping, anxiety also 
affects ongoing performance directly, as an 
interfering, distracting stimulus. The ability 
to resist and overcome such interference 
varies between individuals, and has been con- 
ceptualized here as interference proneness. 

This study then deals with two independent 
variables: mode of coping with anxiety, and 
interference proneness. Each has its main 
impact on a specific facet of cognitive func- 
tioning: the former operates essentially 
through shifts in motivation, and therefore 
its behavioral manifestation must be sought 
in the consequences of differential energy 
output. The latter, as other kinds of distrac- 
tion, may be expected to affect mainly the 
accuracy of performance. 

The general proposition can be stated as 
follows: The effect of stress on ongoing per- 
formance is to an important degree determined 
by the individual’s mode of coping with the 
anxiety aroused and by his ability to resist 
or overcome its interference in cognitive func- 
tioning. 

The following predictions were derived from 
this statement and the preceding theoretical 
considerations: (a) Subjects using the goal 
oriented mode of coping with anxiety show 
larger output on an output task under 
stress than prior to the induction of stress, 
whereas subjects using the ego oriented mode 
of coping with anxiety show no such increase 
in output. (b) Subjects classified as highly 
interference prone make more errors on a 
cognitive task under stress than prior to its 
induction, whereas subjects classified as low 
in interference proneness do not show this 
tendency. 

METHOD 


The appropriate selection of procedures for the 
induction of stress, of measures for the variables dealt 
with, and of experimental tasks relevant to these 
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variables constitutes a major problem confronting this 
kind of research. In the present study, measures were 
required to assess the subjects’ mode of coping with 
anxiety and their proneness to the interference of 
anxiety in cognitive functioning. The experimental 
task had to be sensitive to the effects of a change in 
motivation as well as of distraction. 


Induction of Stress 


Stress was induced by exposing the subjects to 
failure on a task on which they were motivated to do 
well. This task was represented as an intelligence test, 
which led to considerable involvement by the subjects, 
with many signs of anxiety and concern. In view of this 
and the fact that similar methods have often been 
used in related studies, the assumption that the subjects 
were actually frustrated by failure seems warranted. 


Mode of Coping with Anxiety 


In order to differentiate the subjects into goal 
oriented and ego oriented categories, a measure was 
needed that would reveal differences in motivation 
under frustration, Such an instrument is provided by 
the selective recall phenomenon. Previous studies have 
shown that the direction of differential recall, i.e., the 
relative preponderance of successes or failures in recall, 
are related to motivational characteristics. Bloom 
(1952) confirmed the operation of personality factors 
in differential recall by demonstrating the existence of a 
relationship between differential memory for failures 
and successes on the one hand and changes in motiva- 
tion on the other. Subjects who recalled a preponderance 
of failures tended to increase their motivation on a 
level of aspiration experiment following failure, while 
subjects recalling a preponderance of successes lowered 
their goals following failure. A questionnaire further 
confirmed this relationship. Bloom’s study amounts to a 
validation of differential recall as a measure of per- 
sonality characteristics which are closely related to 
modes of coping with anxiety as conceptualized here. 
Other studies reporting results consistent with the 
above are those of Atkinson (1953) and McClelland 
and Liberman (1949). 

Accordingly, the subjects’ preferred mode of coping 
with anxiety was inferred from their relative emphasis 
in recall on successes and failures. In order to control 
for possible significance of the size of total recall, the 
actual score used was the ratio of the number of failures 
recalled to total recall. This will be referred to as the 
failure recall ratio (FRR). Thus subjects above the 
median, or upper quartile, on this measure were oper- 
ationally defined as goal oriented, and subjects below the 
median, or lower quartile, on this ratio were opera- 
tionally defined as ego oriented. 

The experimental task which was used for the in- 
duction of failure was simultaneously used for the 
measurement of differential recall preferences, It con- 
sisted of 14 scrambled sentences: 7 could be meaning- 
fully rearranged, but the other 7 were insoluble. The 
sentences used in this experiment were taken from two 
lists of 14 sentences each used by Caron (1954) for a 
similar purpose. They consisted of approximately 20 
words each bunched in phrases of 2 or 3 words; these 
phrases were scrambled and had to be rearranged by 
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recall, each sentence was given a name by sel 
the phrase that best represented the gist of the seni 
and placing it, underlined, at its beginning. 4 
This kind of task is well suited for the purpose at 
hand. It has a certain interest for the subject, is credi e 
as a measure of intelligence, and lends itself easily to 
the manipulations it is subjected to. Levels of difficulty 
can be adjusted to the abilities of the subjects and 
call can be easily ascertained. 


Interference Proneness 


A scale composed of 19 items culled from a question- 
naire used by Waterhouse and Child (1953) in a study 
dealing with interference proneness was used for 
assessment of the subjects’ susceptibility to the in 
ference of anxiety in cognitive functioning. In order to 
maximize the concept validity of the scale, the items 
relevant to this notion were selected independently by 
four judges on the basis of the assumption subjec 
with a score above the median would be considered hig 
on interference proneness and subjects with score 
below the median would be considered low. ‘This meas- 
ure is referred to as the interference score (IS), 


Performance Measures 


The task to be set had to be sensitive to shifts ii 
motivation as well as to the effects of interfereno 
Study of the former generally involves operations that 
permit inferences as to the amount of energy € 
pended by the subject, and a common method is 
use of simple output tasks (Atkinson & Raph 
1956; Lowell, 1952). 

With regard to the second requirement, that 
task be sensitive to the effects of interference, the liter 
ture provides less clear guidance. It seems that 
effects of interference from inner sources on ongoil 
performance have never been thoroughly investi 
Rapaport (Rapaport, Gill, & Schafer, 1945) to 
on this problem when he considers that “attention 
is hampered “if the subject’s affect and anxieties are 
not well controlled and get out of balance.” It certainly 
seems likely that there are special aspects of behavior 
which are particularly vulnerable to the operation of 
interfering stimuli. While these aspects undoubtedly 
are dependent on the nature and complexity of the 
task as well as on the intensity of the interference, it 
likely that one important effect of distraction will be a 
decline in the accuracy of execution of the task, i.e., @ 
increase in errors. a 

In view of the above considerations, it was decide 
to employ output or speed of performance as a mea 
of changes in motivation, and errors as a measure 0 
interference. A suitable instrument that combined 
measures was at hand in the Minnesota Clerical 
(MCT) (Andrew & Paterson, 1946). This is a test 
speed and accuracy which involves the scanning 
pairs of numbers and of names and the detection 0 
minor differences between the members of a pair. 
half the pairs the members are identical; in the oi 
half, distributed at random, the members of the 
are different. Two scores were compiled for each subj 
an output score, consisting of the number of i 
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attempted, and an error score, consisting of the items 


on which errors were made.* 


Subjects and Procedure 


Sixty-five student nurses, required to participate, 
served as subjects. In order to rule out intelligence as a 
relevant variable, the relationship between differential 
recall and level of intellectual functioning was explored. 
It was found to be not significant. 
Each subject was seen twice; first in a group setting, 
when the questionnaire was administered, and then 
individually for approximately one hour divided in the 
following way: 
1, Rearranging the seven soluble scrambled sen- 
tences, “A” 

2. The first half of each part of the MCT 

3. Recall of sentences “A” 

4, Rearranging the seven soluble scrambled sentences, 
up” 

5. The second half of each part of the MCT 

6. Recall of sentences “B” 

The introduction to the research as well as the 
instructions to the various tests were phrased so as to 
induce high motivation in the subjects to do well. In 
order to get the subjects ego involved for the subjective 
experience of failure to come about, the tests were 
represented as intelligence tests. 

The following are the null hypotheses tested: 

1. The mean difference in output postfailure 
minus prefailure for the group of subjects above the 
median on the FRR is equal to that of the subjects 
below the median on this measure, i.e., there is no 
interaction between experimental conditions and 
recall. ‘The same relationship holds for the extreme 
groups defined by the upper and lower quartiles. 

2, The mean difference in errors postfailure minus 
prefailure for the group of subjects above the median 
on the IS is equal to that of the subjects below the 
median on this measure, i.e., there is no interaction 
between experimental conditions and interference 
proneness. The same relationship holds for the 
extreme groups defined by the upper and lower 
quartiles. 

The predictions mentioned earlier are the alternatives 
to these null hypotheses. 


RESULTS 


_For the purposes of analysis, subjects were 
divided into high and low groups on the basis 
of their FRR, and these groups were further 
subdivided into high and low groups on the 
basis of their IS. Four experimental groups 
were thus differentiated: 

1. High FRR-High IS 

2. High FRR-Low IS 

3 Correlations among the principal measures were 
not significant. For speed and accuracy in the MCT, 
the r was .11. For the indices of interference proneness 
and mode of coping with stress, the correlation was 5. 
Tn the absence of evidence that these scores entail any 
serious overlap, it seems quite legitimate to assume 
that the processes as defined are relatively independent. 


305 


3, Low FRR-High IS 
4. Low FRR-Low IS 


To test the different hypotheses, the high 
and low groups were alternately defined by 
the median of the respective distributions and 
by the upper and lower quartiles. 

To test the significance of the relationships 
obtained, the output scores and the error 
scores were subjected to a complete analysis 
of variance in a three-way factorial design 
which took into account the correlation aris- 
ing out of repeated measurement of the sub- 
jects before and after failure. 

The results of the experiment generally, 
with one exception, support the predictions 
made and permit rejection of the null hypothe- 
ses, 

Table 1 shows that the subjects high on FRR 
have significantly larger output postfailure, 
while there is no evidence of difference in the 
output of subjects low on FRR. The difference 
between the postminus-prefailure output 
differentials of the two groups is highly signifi- 
cant. That for the high FRR group is more 
than twice that of the low FRR group. These 
relations obtain both when the median and 
when the extreme quartiles are taken to define 
the groups. 

Table 2 summarizes the analysis of variance 
of the mean output scores pre-and postfailure 
of the four experimental groups when defined 
by the medians of the distributions. The fifth 
row of this table shows the significance of 
the variance contributed by the interaction 
of imental conditions with recall charac- 
teristics to the total variance of output scores. 


TABLE 1 


Mean OUTPUT Scores OF SUBJECTS ABOVE AND BELOW 
MepIAN FRR VALUE PRE- AND POSTFAILURE, AND ~ 
MEAN DIFFERENCES BETWEEN EXPERIMENTAL 
CONDITIONS 


DMM 


Failure Recall Prefailure Postfailure Spee 

Above Median 139.44 150.55 11.11" © 

Bı P Mian 128.30 133.05 4.75** 
hove” minus 6.36 
“Below”) 
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TABLE 2 
ANALYSIS OF VARIANCE OF OUTPUT SCORES OBTAINED 
PRE- AND POSTFAILURE BY Four GROUPS oF 
SUBJECTS CLASSIFIED BY THE MEDIAN FRR 
AND IS VALUES 
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TABLE 4 
ANALYSIS OF VARIANCE OF ERROR SCORES OBTAINED 
PRE- AND POSTFAILURE BY THE EXTREME IS GROUPS 
(Upper and Lower Quartiles), SUBDIVDED AT 
THE MEDIAN FRR VALUE 


Source SS df F Source SS af F 
Recall 12.024 1 6.60" Interference 3.06 1 3.558 
Interference 3.429 1 1.88 ecall 5.10 1 5.938 
Condition 3.243 1 58.11 Condition 0.22 1 
RXI 0.948 1 IXR 2.26 1 2.62 
RXC 0.577 1 10.34 Ixc 1.54 1 6.418 
IXC 0.156 1 2.79 RXC 0.46 1 1.91 
RXIXC 0.030 1 IXRXC 0.04 1 
Total Among 20.347 7 Total Among 12.60 7 
Subject 111,062 61 Subject 24.15 28 
SXC 3.404 61 sxc 6.74 28 
Total Within 114.466 122 Total Within 30.89 56 
Total 134.813 129 Total 43.49 63 


Note.—The analysis is based on the square roots of the raw 
scores. 

* F.95 = 4,40. 

b F99 = 7.08, 


The same probability of such interaction oc- 
curring, less than .01, attaches to the extreme 
groups. 

With regard to the interference hypothesis, 
no significant differences were found between 
experimental conditions or between groups 
when they were defined by the median scores 
on the IS. As Table 3 shows, when the analysis 
of the data was confined to the extreme groups 
(those above the upper and below the lower 
quartiles on the IS), it was found that the 
high interference prone subjects significantly 
increased the number of errors after failure 
as compared to their performance before fail- 
ure, whereas the low interference prone sub- 
jects actually reduced the number of errors 
from prefailure to postfailure. 

Table 4 presents the summary of the analy- 
sis of variance of the error scores obtained 
pre- and postfailure by the extreme IS groups 
subdivided by the subjects’ standing with 
regard to the median of the FRR. The fifth 


TABLE 3 
MEAN ERROR Scores or SUBJECTS ABOVE THE UPPER 
AND BELOW THE IS QUARTILES PRE- AND 
POSTFAILURE, AND THE MEAN DIFFERENCES 
BETWEEN EXPERIMENTAL CONDITIONS 


1s iie aiee OEO 

Upper Quartile (n = 4.81 6.12 1.31 

Lower Quartile 4.25 2.81 —1.4 

(““Upper” minus 2.75 
“Lower”) 


Note.—The analysis is based on the Square roots of the raw 
scores plus .5 (4/x + .5). 

a F95 = 4.2, 
row gives the variance due to the interaction 
of experimental conditions with the interfer- 
ence variable. The probability of such an 
interaction occurring is less than .02. 

The null hypotheses stipulating no interac- 
tion between (a) experimental conditions and 
modes of coping with anxiety and (b) experi- 
mental conditions and interference proneness 
could therefore be rejected with confidence. 


DISCUSSION 


The general proposition advanced earlier 
finds much support in the results obtained. 
Subjects classified as goal oriented in coping 
with anxiety increased output significantly 
following failure, whereas the subjects cate- 
gorized as ego oriented showed no significant 
increase. Similarly the group classified as 
highly prone to the interference of anxiety 
in cognitive functioning registered an increase 
in the number of errors after failure, while 
the least interference prone group actually 
showed a decrease. 

The finding that the predicted relationships 
emerged, on the whole—particularly where 
the interference variable was concerned—more 
clearly when the extreme groups alone were 
considered, suggests that other, so far obscure, 
factors enter into the situation, which the 
tendencies postulated here are able to over- 
come only when they are at their strongest. 
Subjects in the middle ranges may be more 
flexible in their reaction to failure. This would 
be consistent with Bloom’s (1952) finding 
that subjects who had a balanced recall pat- 
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REACTION TO FAILURE 


‘tern differed from the extreme groups in 


(among other ways) a more flexible approach 
to the frustrating situations facing them. 
Flexibility implies either the ability to use 
different coping mechanisms without marked 
preference for any one or inconsistency in 
their use. Such a trait could thus account for 
the greater variability of the scores obtained 
from the subjects in the middle ranges on the 
FRR. 

The fact that the intermediate ranges of 
the IS fail to differentiate groups meaningfully 
may be due to the intrusion of other factors 
not considered here, such as the intensity of 
stress. It is probable that more severe anxiety 
would evoke different reactions in the pre- 
dicted directions over the entire range. 

The dual role of anxiety in its effect on per- 
formance under stress emerges clearly in this 
study. In its role as signal, it determines the 
fate of motivation and thus indirectly of per- 
formance. In this capacity it affects primarily 
the quantitative aspects of the task. In its 
other role as the active interfering agent, it 
has a direct effect on performance and im- 
pinges mainly on its qualitative aspects. 

The results emphasize the necessity to dis- 
tinguish between the different effects of anx- 
iety in an evaluation of the consequences of 
frustration on performance. It is apparent 
from the differential vulnerability of certain 
aspects of functioning and from the immunity 
of others that these effects are specific in their 
consequences and relatively independent from 
one another. Many conflicting and inconclu- 
sive findings in this general area may be due 
to a failure to take this distinction between 
direct and indirect effects of anxiety and their 
different relevance to specific areas of func- 
tioning into consideration. 

As in many other studies mainly concerned 
with discovering changes in performance aris- 
ing out of the introduction of stress per se, a 
main effect, ascribable to experimental condi- 
tions, was found in addition to the interaction 
effect discussed earlier. This consists of an 
increase of output following failure, which, as 
Table 2 shows, is statistically highly signifi- 
cant. No such consequences of failure were 
noted on the number of errors. 

While the majority of subjects thus in- 
creased output following failure, the relative 
extent of this increase is, as the results show, 
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firmly related to differences in the postulated 
defensive processes. Also, a substantial pro- 
portion of subjects (over 18%) did not increase 
output at all under failure, and—what is more 
significant—of these, over 83% were classified 
as ego oriented on the basis of their FRR. 
Conversely, only 7% of the goal oriented 
group failed to increase output following fail- 
ure. Although this is in accord with theoreti- 
cal expectations, it still seems profitable to 
speculate as to the probable reasons for the 
actual increase of output after failure of a large 
part of the group resorting to the ego oriented 
mode of coping with anxiety. 

The relative mildness of the stress almost 
certainly is a relevant factor, and it is likely 
that with more severe anxiety, the tendency 
towards withdrawal, thought to be inherent 
in this mode of coping, would be more accen- 
tuated. Other elements in the total situation 
also act against leaving the field on the part 
of the subject and thus obscure the conse- 
quences of the defensive reaction. Prominent 
among these are the effects of the instructions 
and the wish to please the experimenter. This 
extrinsic motivation can be thought of as to 
some extent counteracting the ego oriented 
subject’s inclination towards withdrawal from 
the situation, so that investment in the task 
is maintained on a level high enough to keep 
the subject from reducing his efforts and possi- 
bly lead to some increase in output. Perhaps 
we are confronted here with some of the proc- 
esses subsumed under the concept ‘“‘compe- 
tence” by White (1959). Under more severe 
stress, some of these factors are likely to lose 
their force, and relatively neutral compliance 
can be expected to turn into withdrawal. This 
is a question that should find an answer in 
more experimentation. 

In its conceptualization, operations, and 
results, this study represents an integration 
of a number of phenomena which have been 
the objects of numerous unrelated investiga- 
tions. It forms a bridge between the large 
number of studies of changes in performance 
in response to stress, on the one hand, and of 
selective recall of successes and failures on the 
other. By demonstrating that a relation exists 
between changes in performance following 
failure and differential recall of successes and 
failures, it is possible to impart a new and 
wider significance to this phenomenon. In an 
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indirect way, through Bloom’s (1952) finding 
that selective recall is related to behavior in 
level of aspiration experiments, the present 
study is also linked to that problem area. 
Since performance following failure has been 
shown to be related to selective recall and the 
latter to level of aspiration, changes in per- 
formance under stress should be positively 
correlated with shifts in level of aspiration 
following failure. In other words, common 
personality factors can be assumed to underlie 
these reactions, which were here conceptual- 
ized as modes of coping with anxiety. 

Of theoretical as well as methodological 
significance is the demonstration that frustra- 
tion has not a unitary effect on ongoing per- 
formance which can be assessed by means of 
some composite measure of improvement or 
deterioration. Rather, different facets of behav- 
ior seem to be affected differently according 
to individual variations in the constellation 
of personality characteristics. Improvement 
or deterioration under stress is not necessarily 
uniform, and in assessing the consequences 
of the introduction of stress, it is neces- 
sary to specify which aspects of perform- 
ance are improved or impaired. This study 
has been able to discriminate between varying 
effects of failure on both quantitative aspects 
of ongoing performance (output) and qualita- 
tive aspects (errors) and to specify some of 
the sources of these differences in reaction, 


SUMMARY 


The study begins with the Proposition that 
the individual’s mode of coping with anxiety 
and his ability to resist and overcome the in- 
terference of anxiety in cognitive functioning 
are important determinants of performance 
under stress. Subjects were divided into goal 
oriented and ego oriented groups and sub- 
divided into high and low interference prone 
groups. Various aspects of performance on a 
cognitive task preceding and following failure 
in a situation in which they were motivated 
to succeed were compared. 

The results supported the hypotheses (a) 
that the goal oriented subjects increase output 
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significantly following failure, whereas the 
ego oriented subjects do not, and (b) that the 
high interference prone subjects make more 
errors after failure than before, whereas the 
low interference prone subjects do not. 

Some of the implications of these results for 
the general problem of individual differences 
in performance under stress were discussed. 
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research has been devoted to studies of 

mother-child relations. Psychoanalytic 
theory and most current theories of child de- 
velopment have stressed the crucial role 
played by early experiences in influencing 
cognitive processes and personality formation. 
The focus of the present study is on the fol- 
lowing three important phenomena that are 
believed to be influenced by mother-child 
relations: the child’s self-concept, the child’s 
concept of the mother, and the child’s food 
aversions. The purpose of the investigation 
is to study interrelations among these three 
factors in a group of normal children and in a 
group of hospitalized emotionally disturbed 
children, and to compare the two groups on 
each of these variables. 

More specifically, on the basis of psycho- 
analytically oriented theories of child de- 
velopment and previous empirical findings, 
the following hypotheses were formulated for 
examination in the present research. 

Hypothesis I: There is a positive association 
between the self-concept and the mother con- 

cept. 
_ Hypothesis II: There is a negative associa- 
tion between the self-concept and food aver- 
sions. 

_ Hypothesis III: There is a negative associa- 
tion between the mother concept and food 
aversions. 

Hypothesis IV: In comparison with dis- 
turbed children, normal children have a higher 
self-concept, higher mother concept, and 
fewer food aversions. 


METHOD 


[ recent years an increasing amount of 


Subjects 


There were two groups of subjects in this study. The 
normal group consisted of 30 boys, ranging in age from 


1 The authors wish to express their appreciation to 
Charles Hanscomb, the director of the YMCA summer 
camp in Wakefield, Rhode Island, for providing the 
Subjects for the normal group. Gratitude is also ac- 
knowledged to Maurice W. Laufer and the staff of the 
Bradley Hospital in Riverside, Rhode Island, for 


9-2, to 12-9, with a mean age of 11 years, These boys 
were attending a YMCA summer camp, and were 
mostly representative of average middle class families 
with some from the lower classes. The disturbed group 
consisted of 25 boys, ranging in age from 10-0 to 12-4, 
with a mean of 11 years. They were undergoing resi- 
dential psychiatric treatment for various emotional 
disturbances, and most of these boys came from middle 
and lower class families. 


Measures and Procedure 


The following instruments were administered in 
individual testing sessions: 

Adjective check list. An adjective check list was 
used, consisting of 20 direct questions concerning the 
self. Each adjective was preceded by the question, 
“Are you usually ?” and the child answered 
“yes” or “no.” Ten of the adjectives were positive, or 
socially desirable (e.g., happy, helpful, popular, etc.), 
and 10 were negative, or socially undesirable (e.g., 
selfish, hostile, fearful, etc.). Scoring of responses to 
this instrument consisted of adding the number of yes 
responses to the positive adjectives and the number of 
no responses to the negative adjectives, which provided 
a measure of positive self-concept. The number of 
yes responses to the negative adjectives and the number 
of no responses to the positive adjectives were summed 
to provide a measure of negative self-concept. With this 
procedure, the highest positive score could be 20 and 
the highest negative score could be 20. In order to arrive 
at a single self-concept score for each child, the negative 
score was subtracted from the positive score, and this 
score, which could be either plus or minus, was used in 
the statistical analyses. y 

Picture self-rating. Another self-rating device 
consisted of an 834” X 11” piece of cardboard on which 
were pasted 10 small circles. The child was told that 
the circles were supposed to represent the 10 boys in 
his tent (camp subjects) or the 10 boys in his living 
unit (institutionalized subjects). The circles were 
numbered from 1-10 and, in reference to 8 different 
traits, the child was asked to indicate his relative 
position in the group. In other words, for each of the 8 
traits, of which 4 were positive (e.g., popular, capable, 
etc.) and 4 were negative (e.g-, aggressive, dependent, 
etc.), the child was told that Circle Number 1 repre- 
sented the boy who was lowest on the trait and Circle 
10 the boy who was highest on the trait, and he was 
asked to indicate his relative standing in the group. 
The child’s ratings from 1-10 on each of the positive 
traits were added to provide a measure of positive 
self-concept, and the ratings on the negative traits were 


facilitating the study of the emotionally disturbed 


2 Now at Northwestern University. 
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added to provide a measure of negative self-concept. 
In order to arrive at a single over-all self-rating, the 
negative score was subtracted from the positive score, 
resulting in a final score that could range from a low of 
—36 to a high of +36, 

Food preference inventory. Food preferences were 
assessed by an extended version of Wallen’s (1945) list 
of foods. The child was asked to indicate his feeling 
toward each of the 50 foods on the list. The rı 
were in terms of the following categories: (a) like; (6) 
never tasted, but think I would like; (c) dislike; (d) 
never tasted, but think I would not like; and (e) re- 
fuse to eat. Scoring of this inventory simply entailed 
counting the number of responses in each of the five 
categories. Scores in categories a and b were summed to 
provide a total “like” score, and the sum of the re- 
maining three categories provided a total “dislike” score. 

Blacky test. The children were presented with six 
cards from the Blacky test developed by Blum (1950). 
These cards, Numbers 1, 4, 7, 8, 9, and 11, were selected 
on the basis of clinical judgment that they provided 
scenes likely to elicit material indicative of mother- 
child relations. In response to each picture, the child was 
asked to answer a series of questions from the Blacky 
test inquiry. The answers were scored to provide meas- 
ures of the following characteristics in the child: ag- 
gression, anxiety, dependency, pessimism, security, 
independence, and optimism. For each stimulus card, 
each variable was scored along a five-point scale, run- 
ning from a slight implication of the variable to very 
strong indication of the variable. The total score for 
the variables of security, independence, and optimism 
was computed to provide a measure of positive self- 
concept, and the total score for aggression, anxiety, 
dependency, and pessimism served as a measure of 
negative self-concept. An over-all self-concept score was 
obtained by subtracting the negative score from the 
positive score. The answers to the inquiry were also 
scored to provide indices of the following characteristics 
in the mother-figure: aggression, anxiety, dominance, 
rejection, competence, nurturance, and permissiveness, 
For each stimulus card, each of these variables was 
scored on a five-point continuum. A positive mother 
concept score was obtained by adding the scores on 
competence, nurturance, and permissiveness, and a 
negative mother concept score resulted from summation 
of the scores on aggression, anxiety, dominance, and 
rejection. An over-all mother concept score was ob- 
tained by subtracting the negative score from the posi- 
tive score. 

Story completion technique. The Story Completion test 
was modified from a procedure employed previously by 
Barndt and Johnson (1955), and Davids and Parenti 
(1958). Briefly, the procedure consists of presenting the 
child with a sentence describing some event and asking 
him to make up a story using the sentence as a stimulus, 
In the present study, the following five sentences were 
used to elicit story completions: a lady and her little 
boy went to visit the zoo one day; a young boy brought 
a new friend home from school one day to meet his 
mother; when a certain boy woke up, he knew that 
something unusual was going to happen today; while a 
boy and his sister were playing, they knocked over the 
lamp that their father had given their mother for her 
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birthday; a boy wanted to go out and work in the garden 
with his daddy, but his mommy wouldn’t let him go. 
This procedure is designed to elicit fantasy material 
similar to that obtained from TAT stories, but enabled 
us to present stimuli that seemed likely to call forth 
fantasies concerning mother-child relations. The 
thematic content of the stories was scored for the same 
variables, and employing the same scoring scheme, as 
with the Blacky test. Thus, an over-all self-concept 
score and a mother concept score were obtained from 
this technique. 

Reliability of scoring. Two raters independently 
scored responses to the projective techniques. The 
interrater agreement, as measured by product-moment 
correlations, was .95 for the self-concept scores and .75 
for the mother concept scores derived from the story 
completion technique, and .90 and .92 for the self- 
concept and mother concept scores derived from the 
Blacky test. These cofficients, which are significant well 
beyond the .01 level, indicate a quite satisfactory degree 
of concordance in the scoring of this projective material, 

Statistical treatment of the data. Within the two 
groups, correlations among the self-concept, mother 
concept, and food aversions scores were computed by | 
the product-moment method. Differences between the 
two groups on each of these three variables were 
evaluated by Bartlett’s test for homogeneity of variance | 
and ¢ tests between means. Since the nature of associa- 
tions and direction of differences were predicted in 
advance, one-tailed tests of significance were employed. 


RESULTS 

Findings within the Normal Group 

As shown in Table 1, the results derived 
from both projective tests indicate significant 
positive association between measures of the 
self-concept and the mother concept. Further- 
more, the findings presented in Table 2 show a 
consistent tendency for a low self-concept to 
be associated with a high number of food dis- 
likes. Of the four measures of self-concept, 
the one derived from the adjective check list 
shows a significant negative correlation with 
food aversions. Moreover, there are two signifi- 
cant correlations between measures of low 
self-concept and relatively high number of 
foods that have never been tasted but are 


TABLE 1 


Propuct-MoMENT CORRELATIONS BETWEEN SELF- 
CONCEPT AND MOTHER Concept 


Measures Normal Gap oe, 
Blacky test s65** .34* z 
Story completions asses 50% 


* Significant at the .05 level for a one-tailed test. 
** Significant at the .01 level for a one-tailed test. 


SELF-CONCEPT, MOTHER CONCEPT, AND FOOD AvERSIONS 


TABLE 2 


PRODUCT-MOMENT CORRELATIONS BETWEEN SELF- 
CONCEPT AND FOOD AVERSIONS IN THE NORMAL 
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TABLE 3 


PRODUCT-MOMENT CORRELATIONS BETWEEN MOTHER 
CONCEPT AND FOOD AVERSIONS IN THE NORMAL 


Group GROOP 
(N = 30) W = 30) 
Food Inventory Food Inventory 
Self-Concept Measures ais Mother Concept Measures TE Neves 
oo Tested, | Like Dislike sted Like 

i A Blacky test —.12 —.21 .55** 
Adjective check list —.41* | —.35* | .82"* leti T T 
pe ratings # -15 25 2, 19 tory completions 16 .23 17 

Aa re ES J m the 01 

Story completions —129 | —.23 | .54** Significant at the .01 level. 


* Significant at the .05 level for a one-tailed test. 
** Significant at the .01 level for a one-tailed test. 


disliked. In this regard, the magnitude of the 
correlation (—.83) between the Blacky test 
measure of self-concept and the “never tasted, 
but dislike” responses on the food aversion 
list is particularly noteworthy. Comparison 
of the self-concept indices with the number of 
foods indicated as liked also reveals quite 
striking associations. Both the adjective check 
list (objective method) and the story comple- 
tion technique (projective method) reveal 
highly significant correlations (.82 and .54) 
between a high self-concept and liking a rela- 
tively great variety of foods. The findings in 
regard to mother concept and food aversions, 
contained in Table 3, also show consistent 
trends in the predicted direction. Moreover, 
there is a significant positive correlation be- 
tween high mother concept and number of 
foods indicated as liked. Thus, the results ob- 
tained from the group of normal children are 
in keeping with the predictions formulated in 
Hypotheses I, II, and III. 


Findings within the Disturbed Group 


As shown in Table 1, with measures derived 
from both projective methods, there is signifi- 
cant positive association between the self- 
concept and mother concept, with children 
who have a relatively high self-concept tend- 
ing to also have a high mother concept. In the 
_ disturbed group, the findings shown in Table 
4 are generally consistent with the predictions. 
Low self-concept, measured by the picture 
self-rating technique, is correlated significantly 
with both a high total number of disliked 
foods and high number of foods that have 


TABLE 4 
PRODUCT-MOMENT CORRELATIONS BETWEEN SELF- 
Concert AND FOOD AVERSIONS IN THE 
DISTURBED GROUP 
(N = 25) 


Food Inventory 


Self-Concept Measures 


* Significant at the .05 level for a one-tailed test. 
** Significant at the .01 level for a one-tailed test. 


TABLE 5 


* Significant at the .05 level for a one-tailed test, 


never been tasted but are indicated as disliked. 
The findings presented in Table 5 are not con- 
sistently in accord with the predictions and, 
in general, do not support Hypothesis TI. 
However, the one statistically significant 
coefficient indicates a relationship between a 
low mother concept, defined by the story 
completion technique, and a high number of 
foods that have never been tasted but are 
disliked. Thus, the results obtained from the 
disturbed children confirm Hypothesis I, lend 
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partial support to Hypothesis I, and fail to 
confirm Hypothesis III, although they are 
somewhat equivocal in this last regard. 


Comparison of Findings in the Two Groups 


In evaluating Hypothesis IV, it is necessary 
to compare the two groups on scores indicative 
of self-concept, mother concept, and food 
aversions. First, the results presented in Table 
6 show that, on the four different measures, 
the normal children have a higher self-concept. 
Three of these differences are statistically 
significant well beyond the .01 level. On the 
picture self-rating procedure, which is the 
only one not yielding a significant difference 
between means, Barlett’s test indicates that 
the disturbed children are significantly more 
variable in the self-concept scores obtained 
by this direct method of assessment. As shown 
in Table 7, in response to both projective 
methods employed in this study, the normal 
children have a significantly higher mother 
concept than do the disturbed children. 


TABLE 6 
COMPARISON OF SELF-CONCEPT SCORES IN THE Two 
Groups 
Group 
Normal Distur! 
Measures (V = 302) | N= 23) Pie 
Vari- Vari- 
M ance x ance 
Adjective check list | 11.9] 29.9 | 7.4) 37.7 | 1.26 |2,94"* 
Picture ratings 13.5] 84.7 | 11.6] 165.7 | 1.96*] .60 
Blacky test 1.1] 11.7 | —4.4| 6.1 | t.on*/4.119* 
Story completions 3.6| 31.2 | —1.2| 21.6 | 1.44 |3.53°* 


* Significant at the .05 level. 
** Significant at the .01 level. 
TABLE 7 


Comparison OF MOTHER CONCEPT SCORES 
IN THE Two GROUPS 


Group 
Normal Disturbed 
Measures (N = 30) (N = 25) F t 
Vari- Vari- 
M| ance | # | ance 
Blacky test S| 2.6 |—1.5| 2.6 |1.00|2.31° 
Story completions 2| 5.8 SR) 1.36 | 2.64** 


* Significant at the .05 level. 
** Significant at the .01 level, 
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TABLE 8 


COMPARISON OF FOOD AVERSION SCORES 
IN THE Two GRrouPs 


Group 
Normal | Disturbed 
Measures (N = 30) | (NV =25)) F i 

Vari- Vari- 

M | ance ance 
Total Dislike 10.2| 22.2 |12.1| 40.0 | 1.80 | 2.10% 
Refuse to Eat 3.3| 11.8 | 5.6| 18.8 | 1.59 | 2.14% 
Never Tasted, Dislike 3.3) 4.9 | 4.1] 10.6 | 2.18° | 1.00 
Like 37.7| 22.4 |36.4| 53.2 | 2.38% | 17 


* Significant at the .05 level, 


The group differences in regard to food 
aversions, shown in Table 8, are also consistent 
with the prediction. The two statistically sig- 
nificant differences indicate that, as a group, 
the disturbed children have more food dislikes 
and also refuse to eat more foods than do the 
normal children. The two nonsignificant differ- 
ences suggest a tendency for the disturbed 
children to indicate fewer foods as liked and 
more foods that have never been tasted, but 
are disliked. In regard to these trends, it is 
noteworthy that on both measures the dis- 
turbed children are significantly more variable 
than the normal children. 4 

Thus, viewed as a whole, the statistical 
findings confirm Hypothesis IV. % 


Discussion 


The results obtained in the present investi- 
gation lend empirical support to the theoreti- 
cal and clinical conviction that mother-child 
relations play a crucial role in the formation 
of the child’s self-concept. The case histories 
of the emotionally disturbed children who 
were studied in this research contain consider- 
able evidence of traumatic early experiences, 
with much turmoil in the child’s early home 
environment. Often the child experienced 
little emotional warmth and little psycho- 
logical security. As Erikson (1950) has empha- 
sized, if the child does not experience consist- 
ent mothering and does not receive loving 
gratification of his needs, he is likely to de- 
velop a basic mistrust of himself and the 
world around him. Instead of developing con- 
fidence, he becomes anxious, unhappy, and 
unsure of himself and others. In keeping with 
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this psychoanalytically oriented theory of ego 


development, in the present study it was’ 


found that emotionally disturbed children, 
the majority of whom had negative family 
experiences from very early childhood, had a 
lower self-concept and a lower mother concept 
than did a group of normal children. The 
present findings do not provide answers about 
cause and effect, but they do show that the 
more positive the child’s self-concept, the more 


positive is his concept of the mother. That is, / 


the children whose self-picture contains signs 
of such characteristics as competence, inde- 
pendence, optimism, and social facility, tend 
to view the mother figure as being competent, 
nurturant, and emotionally stable. 

In the group of normal children, both the 
self-concept and the mother concept are re- 
lated to measures of food aversions, with 
. children who have a more positive self-picture 
and more positive mother concept tending to 
have relatively few food aversions and greater 
liking for a variety of foods. In the disturbed 
group, the self-concept appears to be related 
to food aversions, with fewer aversions in 
those children who have a relatively high 
self-concept. In this group, however, the rela- 
tionship between mother concept and food 
aversions is not so clear-cut. In general, the 
| disturbed children evidence a greater number 
of food dislikes and refuse to eat a significantly 
| greater number of foods than do the normal 
children. Particularly noteworthy is the finding 
that the disturbed children indicate a greater 
number of foods that they have never tasted 
but feel they would dislike. The disturbed 
children are also significantly more variable 
| in their scores indicative of food likes, with 
some of them saying that they like practically 
every item on the list and others indicating 
very few foods that they actually like. 

At present, there is almost universal agree- 
ment among psychologists and psychiatrists 
that the mother figure, love, and food are 
interwoven in the child’s unconscious proc- 
Ta There is little doubt that the feeding 
relationship between the mother and child 
during the oral stage of psychosexual develop- 
© ment is of high importance in the youngster’s 
developing psychodynamics. The way in which 
the feeding situation and oral gratification 
are handled or mishandled can have last- 
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ing effects on the child’s intellectual and emo- 
tional development. In the group of institu- 
tionalized children, in whom the case records 
provide ample evidence of early feeding con- 
flicts, there are much more variable reactions 
to foods, and these reactions do not appear to 
be systematically related to the child’s con- 
ception of the mother as evidenced in his fan- 
tasies. 

If the developmental process has been more 
orderly, as in the case of the normal children 
in this research, then relations between the 
mother concept and food aversions should 
be more predictable and understandable. With 
severely disturbed children, on the contrary, 
the specific types of psychopathology and in- 
appropriate use of defense mechanisms should 
lead to more complex and equivocal relations 
that fit less well with predictions based on 
theories of normal child development. 

In the present study, the focus is on statis- 
tical relationships and group differences, not 
on individual cases. However, by way of point- 
ing to some of the complexities and perplexing 
phenomena that await greater clinical under- 
standing, let us describe briefly two extreme 
examples from the emotionally disturbed 
group of boys. One boy who was very obese 
and actually ate voraciously avowed that he 
liked an extremely small number of items on 
the food inventory. Further evincing his pre- 
occupation with foods and oral activities, he 
was in the habit of calling other people “fat 
turkey” and would tell various staff members 
that they ate too much and were too fat. 
Another boy, however, who was exceptionally 
underweight and actually looked like the 
emaciated people who were released from con- 
centration camps, indicated that he liked 
practically every item on the food inventory. 
This boy, who was often teased about his 
physique by the other children and was nick- 
named “Boney,” had an extremely ambivalent 
relationship with his mother from infancy into 
later childhood. She was overprotective yet 
demanding, seductive yet hostile. Thus, with 
cases such as these, there were ample grounds 
for inconsistencies between conscious and un- 
conscious concepts of self and mother, as well 
as inconsistencies in relations between these 
schema and avowed feelings about food. 

In conclusion, then, it should be emphasized 
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that in conducting research on such complex 
subject matter, with extreme degrees of psy- 
chopathology, neurotic interpersonal relations, 
unconscious conflicts, and confusions between 
fantasy and reality, one should expect am- 
biguities in experimental findings. However, 
in the present research, all hypotheses were 
confirmed with the group of normal children, 
and findings from the disturbed children were 
generally consistent with theoretical expecta- 
tions. Thus, the theoretical framework used 
here seems fruitful for research on the com- 
parative psychological development of dis- 
turbed and normal children. 


SUMMARY 


Measures of self-concept, mother concept, 
and food aversions were obtained from a group 
of 11-year-old normal boys and a group of 
11-year-old boys who were institutionalized 
for emotional disturbances. Indices of self- 
concepts and mother concepts were derived 
from direct methods and projective methods 
of assessment, and food aversions were as- 
sessed by an objective inventory. It was hy- 
pothesized that self-concepts and mother con- 
cepts are positively associated, and there are 
negative correlations between these measures 
and number of food aversions. These predic- 
tions were confirmed in the normal group and 
generally supported by findings from the 
emotionally disturbed group. However, there 
were more inconsistencies in the relations 
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found in the disturbed sample. As predicted, 
in comparison with the disturbed childre 
the normal boys evidenced a significant 
higher self-concept, higher mother concept, 
and fewer food aversions. : 

Some relations between these empirica l 
findings and theories of child development 
were discussed, and pathological early mothe: i 
child relations were considered in accounting 
for the findings from the institutionalized 
children. It was concluded that the results of 
this study are quite encouraging and that, in 
spite of the complexities involved in research 
on such subject matter, much can be learned ~ 
about the process of child development and 
personality formation from comparative stud- 
ies of normal and emotionally disturbed 
dren. 
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CHILDREN’S ATTITUDES TOWARD RATIONAL VERSUS 
INHIBITING PARENTAL AUTHORITY 


ANATOL PIKAS 
Uppsala University, Sweden 


LTHOUGH the work by Adorno, Frenkel- 
Brunswik, Levinson, and Sanford 
(1950) has shaped the concept of the 
authoritarian personality for social psycho- 
logical research, the book did not define the 
term, and, as has often been pointed out, it 
lacks a concise statement of the hypotheses of 
the research project as a whole (Christie & 
Jahoda, 1954; Smith, 1950). It has been ob- 
served that The Authoritarian Personality 
began as a study of the Fascist personality. 
The essential traits of the syndrome with 
which the book deals center around problems 
of ethnocentrism, and it seems likely that the 
book was not given its title until the final 
stage of the research (Christie & Jahoda, 
1954). The present author has been especially 
struck by the fact that in The Authoritarian 
Personality and still more in later research 
originating in the F Scale, there is little ex- 
plicit attention to the conception of authority 
(Gregory, 1955). 

Fromm (1941), considered to have intro- 
duced the concept of the authoritarian per- 
sonality in modern psychology (Bay, 1958), 
based his definition on the word authority: 


Authority refers to an interpersonal relation in 
which one person looks upon another as somebody 
superior to him, But there is a fundamental difference 
between a kind of superiority-inferiority relation which 
can be called rational authority and one which may be 
described as inhibiting authority. 


Both enactment of the role of inhibiting 
authority and the tendency to submit to this 
type of authority were included by Fromm 
(1941) in the “authoritarian personality” 
syndrome that became the subject of exten- 
sive treatment. On the other hand, neither 
enactment of the role of rational authority nor 
attitudes towards such authority were exten- 
sively treated by Fromm or by Adorno et al. 
(1950). The purpose of this study is to re-es- 
tablish the distinction between the two types. 
Attitudes of adolescent children towards 
parental authority are studied with respect to 
the crucial context in which the authority 
expresses restrictive norms and performs ac- 


tions that block or check the spontaneous ac- 
tivity of the adolescent. The basic assumptions 
underlying the variables and measures of the 
study are these: 

1. Norms and actions of the parents that 
hinder the spontaneous activity of the adoles- 
cent may, according to the kind of justifica- 
tion they are given, be a manifestation of 
either rational or inhibiting authority. 

Definition I. Attitudes of children towards 
those restrictive norms and checking actions 
of their parents that are accompanied by mo- 
tivations giving expression to the parents’ aims 
for the children’s welfare, are called the Ara 
variable (“A” for authority and “ra” for 
rational); children’s attitudes towards those 
restrictive norms or checking actions of their 
parents that are accompanied by motivations 
expressing egoistic domination or the desire 
for authority for authority’s own sake, are 
called the Ainh variable (“A” for authority 
and “inh” for inhibiting). 

Since measures of the two variables were 
constructed for this study in such a way that 
members of each pair of items were identical 
to each other except for ascribed parental 
motivations, the variable Ara should’ elicit 
more positive attitudes than the variable Ainh 
(Ara > Ainh), This assumption has an axio- 
matic rather than a hypothetical nature. If 
Ara > Ainh holds empirically, the validity of 
the test is supported. 

2. In these two variables a distinction is 
made between two types of items designated 
by (c) and (a). f f 

Definition IT. Restrictive norms or checking 
actions that have a relatively concrete charac- 
ter, as pertaining to given material objects, 
are represented by the letter c in parentheses, 
which is added to the variable symbol. The 
restrictive norms or checking actions that be- 
long to more abstract or general relations are 
represented by the letter a in parentheses. 

Thus a distinction is made between Ara(c) 
and Ara(a), likewise between Ainh(c) and 
Ainh(a). The distinction between (c) and (a) 
was introduced because pilot investigations 
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showed that restrictive norms and checking 
actions concerning relatively concrete objects 
or relations tended to evoke more negative 
attitudes. 

3. In order to stimulate the child’s imagina- 
tion, items to measure the Ara and Ainh varia- 
bles were given the form of unfinished impli- 
cation-sentences (if...then...). All the 
sentences were given in both direct and projec- 
tive form. In the latter case the children an- 
swered on behalf of an imaginary child of their 
own age. The variables measured projectively 
are represented by Pr and those directly ob- 
tained by Di. Thus so far eight variables have 
been defined: Ara(c)Pr, Ara(a)Pr, Ainh(c)Pr, 
Ainh(a)Pr, Ara(c)Di, Ara(a)Di, Ainh(c)Di, 
Ainh(a)Di. 

Attitudes of children towards the restrictive 
norms and checking actions of their parents 
should be more positive in the direct than in 
the projective form of the test (ADi > APr). 

4. Because of methodological interest in 
results yielded by the incomplete implication 
sentences and by more conventional attitude 
questions, two more variables were added to 
the system. 

Definition ITT. A(c) and A(a) are variables 
having the same content as Ara and Ainh 
except that the items have the form of atti- 
tude questions and lack the additions showing 
whether the parental authority is of a rational 
or inhibiting nature, 

The A(c) and A(a) variables occur only in 
the direct form. 


5. To permit exploration of how the varia- 
bles thus far defined correlate with the sub- 
* own responses in the role of rational or 
puoli, authority, a final variable was 


Definition IV. By variable Ora is meant the 
extent to which the adolescent prefers altruis- 
tic or task centred alternatives to authori- 
—_ ones in imaginary occupational situa- 
tions. 

The O variable was measured both directly 
and projectively in the same way as the Ara 
and Ainh variables. 

6. The investigation set out to study all the 
foregoing variables in relation to the chrono- 
logical age of the subject. The recognized fact 
that authoritarian scales correlate negatively 
with intelligence (Titus & Hollander, 1957) 
makes it important to control for intelligence 
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level. The battery of the tests was therefore 
supplemented by a short verbal intelligence 
test. The general line of thought here was that 
besides the emotional preferences that the 
attitude inventories were to measure, one must 
also reckon with cognitive factors. On the view 
that the intellectually less developed person 
is, in Lewin’s terms, not well structured cogni- 
tively, he should show a tendency to agree 
with the authoritarian items because these 
provide a simpler solution. According to this 
reasoning two basic assumptions are advanced 
which can also be considered as the chief hy- 
potheses of the study: 

a. Younger children, as cognitively less well 
structured than older children, show more ap- 
proval than do older children of parental 
checking actions that have purely authori- 
larian motivations: Ainh decreases with age. 

b. Variables that reflect the function of 
rational authority show the opposite relation: 
Ara increases with age. 

METHOD 
Subjects 

The study used 656 boys and girls from four different 
schools: two in Stockholm, one in Uppsala, and one 
from a community in central Sweden. The schools 
differed in social class and standard of intellectual 
ability. In order to obtain an equal number from each 
social group and sex on each of the five age levels, a 
stratified sample of 300 subjects was selected: 10 boys 
and 10 girls from each of the three social classes on cach 
age level, The results pertain partly to the total number 
of subjects, partly to the stratified sample. 


Instruments 


a atiy fy~ cheb arya as sumiratna 
1. Examples of items assigned to pa 
variables follow. + 

Items in the two A inventories have certain sim- 
ilarities in content. Two items from the A inventory of 
attitude are: T 


Item 45 in A(c) 
Should children go to bed at a set time? YES ? NO 
Ttem 52 in A(a) 


Do you think that parents should sometimes be 
strict? 


If his parents think that he should go to bed at a set 
so that they can be left in peace, he thinks 


RATIONAL VERSUS INHIBITING PARENTAL AUTHORITY 


317 


TABLE 1 
VARIABLES MEASURED BY THE Timer InvENTORIES 


‘of the 


Nature of I 
ure inventory v 


A inventory consisting 
of implication sen- 
tences 


A inventory consisting 


Dimensions 
of attitude questions ent in this inventory 


O inven Ora 
of paired altermatives sie ot task cen- 
resenting imagined tered motives 
Sopalin situations) 


ra and inh not pres- 


Oinh | Alternatives that reflect 
authoritarian motives 


Ara items of a more | Ara items of a more ab- 
stract nature = Ara(a) 


Ainh | Parental checking action | Ainh items of a more | Ainh items of a more 
as an expression of im- 


concrde nature = nature = 


abstract 
Ainh(a) 


Alternatives that reflect | Dimensions (c) and (a) are not present in the O 
altruistic inventory 


Note—All the variables with the exception of Alc) and A(a) were tested both projectively (Pr) and directly (DI), Each of 


the variables thus obtained has nine items, 


The A(a) item has in its turn two corresponding items 
in the implication sentence inventory: 

Item 4 in Ara(a)Pr 

If his parents are strict sometimes, because 


they want 
him to grow up to be a good person, he thinks 


t 
Item 23 in Ainh(a)Pr 


two for each subject, according to the rotation principle. 


are two examples from the Pr form for boys: 
As a judge, he feels 1, when he knows that no 


most satisfied criminal escapes pun- 
ishment. 
2 ye pad 
just as 
As the captain of a 1. when the ship carries its 
large merchant ship, cargo quickly to the 
he feels most satis- port of destination. 
fied 2. when he is allowed to 


BE 
i 
ti 
H 
i 


„erto PEPE 
iit 
zF 
ji; 

ii 

rf 
i 
E 

rii 


dl 
i 
4 


tudes with satisfactory reliability. Only 2% 
of the answers could not be classified (for only 
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one item was the value as high as 5%) and 
the phi coefficient between independent raters 
was greater than .95. 


Reliability and Homogeneity 

In 60 different item analyses (12 variables 
at five different age levels each), the differ- 
ence between the higher and lower halves was 
strongly in the expected direction for each 
item, with two exceptions: one item in Ara- 
(c)Pr showed a reversed direction in four of 
the age levels, and two other items in Ara- 
(c)Pr and Ara(a)Di showed no difference 
between the higher and lower halves in their 
respective age levels, 

The Kuder-Richardson reliability, calcu- 
lated for each variable in the age levels 11, 
13, and 15, was generally low, one reason being 
the shortness of the nine-item inventories, 
The average for the 36 K-R coefficients was 
54, 


Construct Validity 


The assumption that the subjects would be 
more positive towards parental restrictive 
norms and checking actions when these reflect 
the parental role as rational authority (Ara) 
than when they reflect purely authoritarian 
parental motives (Ainh) is empirically con- 
firmed (p < .005). Similarly, the assumption 
that more positive attitudes towards the re- 
strictive norms and checking actions of parents 
would be elicited by direct than by projective 
items was confirmed (p < -005) for the data 
from the entire sample. 


Developmental Trends 


The results for the selected sample equated 
for social class did not differ importantly from 
those for the total sample with respect to age 
trends. Only the general trends for the equated 
sample are considered here. The curves pre- 
sented in Figures 1-4 warrant the following 
general conclusions: 

1. All the Ainh variables decrease. In other 
words, positive attitudes towards parental re- 
strictive norms and checking actions dictated 
by authoritarian motives decrease with age 
(Figure 1). 

2. The Ara variables do not show uniform 
trends. When trends appear, namely, in 
Ara(c)Di and Ara(a)Pr they have an opposite 
tendency; (a) Ara(c)Di decreases and (b) 
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Ainh(c)Pr 


N 


Ainh(c)Di 
N 


Fic. 1. Developmental trends of the four Ainh 
variables. 


Ara(a)Pr increases (Figure 2). Or to put it 
another way: (a) Positive attitudes towards 
parental restrictive norms and checking ac- 
tions, accompanied by rational motivations, 
decrease with age when the items are of a 
more concrete nature and the attitudes have 
been expressed in the direct form of the sched- 
ules; (b) Positive attitudes towards parental 
restrictive norms and checking actions, accom- 
panied by rational motivations, increase with 
age when the items are of a more abstract 
nature and the attitudes have been expressed 
in the projective form of the schedules. 

3. The A variables consisting of simple 
attitude questions showa tendency to decrease, 
which is more clearly marked in A(c) than in 
A(a). That is, positive attitudes towards gen- 
eral restrictive norms of upbringing decrease 
with age, this tendency being more obvious in 
items of a more concrete content (Figure 3). 

4. The variable Ora shows a tendency to 
tise, that is to say, preference for altruistic 
alternatives over authoritarian ones in imagi- 
nary occupational situations tends to become 
greater (Figure 4). Ora is the only variable 
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Ara(c)Pr 
ee = 


p2] 


Ara(c)Di 
cS) S 


= 
© 


11 12 13 14 15 
Age 


1 12 13 14 15 
Age 


Fic. 2. Developmental trends of the four Ara 
variables. 


that shows distinct sex-differences. However, 
since the boys’ or girls’ inventories contained 
different occupations the frequency of the 
answers of the sexes cannot be compared. 


Correlation of Attitude Variables with Verbal 
Intelligence and with Preferences for Aliruistic 
or Authoritarian Alternatives (O) 


The correlational data of the study, com- 
Prising tetrachoric coefficients based on di- 
chotomizing the data as nearly as possible to 
the median, are presented in Table 2. The cor- 
relations with verbal intelligence and with 


Boys e——e 
Girlso--- 


11 12 13 1⁄4 15 


11 12 13 14 15 
Age 


Age 


Fic, 3. Developmental trends of the two A vari- 
ables, 
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BoySe——e Girlsoe--~-0 


Age 
Fic. 4, Developmental trends of the two Ora vari- 
ables. 


Ora are based on the smaller stratified sample. 
Correlations of the attitude variables with the 
chronological age (CA) are based on the entire 
larger sample. 

A study of the columns of the table for Ora 
correlations shows that although the individual 
correlation coefficients seldom attain signifi- 
cance, when regarded as a whole they exhibit 
a pattern that appears to justify two general 
conclusions: Ora correlates positively with 
Ara; and Ora correlates negatively with Ainh. 
One of the Ara variables is an exception to 
this, namely, Ara(c)Di which correlates nega- 
tively with Ora on one age level. In other 
words, children’s preferences for altruistic 
occupational motives as opposed to authori- 
tarian ones are positively related to their 
attitudes toward rationally motivated check- 
ing actions by their parents, and negatively 
related to their attitudes toward authori- 
tarianly motivated parental checking actions. 

Regarding the connection between the atti- 
tude variables and verbal intelligence, the 
preference for altruistic alternatives (rather 
than for authoritarian ones) in imaginary occu- 
pational situations (Ora) increases with rising 
verbal intelligence. 

In regard to the correlation of the A varia- 
bles with verbal intelligence, the general trend 
of the relationship agrees with the age trends 
already described. Ainh shows on the whole 
negative correlations, while the pattern of 
‘Ara correlations is not so clear. Because of the 
incompleteness of the table of correlations, it 
would be hard to draw more precise conclu- 


sions. 
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TABLE 2 


CORRELATIONS OF ATTITUDE VARIABLES WITH CHRONOLOGICAL AGE (CA), VERBAL INTELLIGENCE, AND THE 
Ora VARIABLE 


Content of Variable Symbol w es 


Verbal Intelligence Ora Variable* 


Age Groups Age Groups 


13 15 11 13 15 
(N=60) | (N=60) | (N=60) | (N=60) | (N=60) 


—.49** 
—.40** 


Conventional attitude ques- 
tions (concrete) 

Parental authoritarian 
checking (concrete and 
direct) 

Parental authoritarian 
checking (abstract and 
direct) 

Parental rational authority 
checking (concrete and 
direct) 

Parental authoritarian 
checking ty and 
projective: 

Parental authoritarian 
checking (abstract and 
projective, 

Conventional attitude ques- 

tions (abstract) 

Parental rational authority 
checking (concrete and 
projective) 

Parental rational authority 
checking (abstract and 
direct) 


Parental rational authority 
checking (abstract and 
projective) 

Preference for altruistic mo- 
tives to authoritarian 

rojective) 

Preference for altruistic mo- 
tives to authoritarian 


A(c) 


Ainh(c)Di 
Ainh(a)Di —.38** 
Ara(c)Di —.21%* 
Ainh(c)Pr —.18** 
Ainh(a)Pr —.14* 


A(a) 
Ara(c)Pr 


—.13* 


02 
Ara(a)Di -07 
Ara(a)Pr -14* 
OraPr Ghee 


OraDi -30** 


(direct) 


.45* 
—.38*| > 


.14 | —.45* | —.27 


—.21 


16 


-60* | —.27 | —.18 
—.15 


— .38* 


-40* 


+24 -23 


.39* .76* .32* 


.26 .55* aif 


a ADi variables have been correlated with OraDi; APr variables have been correlated with OraPr. 
Where the scatter diagrams failed to satisfy requirements of linearity or homoscedasticity, no coefficient is presented. 


* Significant at the .05 level. 
** Significant at the .01 level. 


Discussion 

As the present study seems to be the first 
empirical investigation to stress the distinction 
between rational and inhibiting authority, it 
has been difficult to suggest a precise system 
of hypotheses. The general hypothesis ad- 
vanced at the outset was that the variable 
Ainh, which could be considered to measure 
authoritarian attitudes, would decrease, and 
Ara, or positive attitudes towards rational 
authority, increase during the developmental 
stage of 11-15 years. In the same way, Ora— 
preference of altruistic to authoritarian alter- 
natives in imaginary occupational situations— 
would increase. 


This approach may be characterized as cog- 
nitive and formalistic. The distinction drawn 
between the two types of authority is based 
on the parental motives to which the parental 
restrictive norms and checking actions are 
ascribed in the situational questions presented 
to the children. Certain cognitive qualities 
are required of the subjects before they can at 
all understand the rational motives accom- 
panying the checking actions and give the 
appropriate answer. According to this view, 
the positive correlations of attitude variables 
with verbal intelligence should be regarded as 
the primary relationship and those with 
chronological age as secondary: the older chil- 


inadai 


RATIONAL VERSUS INHIBITING PARENTAL AUTHORITY 


dren give certain answers only because of their 
increased intellectual ability. 

Besides the cognitive factors, the investiga- 
tion also involves variables that may be identi- 
fied as conative, in the distinction between 
(c) and (a) elements. The (c) items, where the 
parental checking actions are given a concrete 
form, evoke, according to our assumptions, 
more ego involvement than the (a) items, 
which are more general and abstract. We have 
seen that the (c) elements contribute to the 
decrement and the (a) elements to the incre- 
ment of the positive attitudes toward parental 
checking actions and restrictive norms during 
the developmental stage of 11-15. These find- 
ings may be understandable in terms of ego 
development. With increasing age, the young 
adolescent has less need for the support of 
parental norms concerning concrete things, 
and parental restrictions of the parents 
therefore become more and more threatening 
to personal freedom. 

Similar conative factors may be assumed to 
be at work in the distinction between direct 
and projective items. As already noted, the 
relation ADi > APr is valid for the data as a 
whole, i.e., children’s attitudes towards paren- 
tal checking actions are shown to be more 
positive in the direct form. However, as the 
figures show, this relation is subjected to a 
characteristic change so that ADi > APr tends 
with age towards ADi = APr. The decrease 
of Di with age may be interpreted along lines 
similar to those invoked with (c): when the 
items are given in the Di form, older subjects 
feel more threatened by the restrictions than 
do younger ones. 


SUMMARY 


Starting with Fromm’s (1941) distinction 
between rational and inhibiting authority, the 
study investigates children’s attitudes toward 
their parents’ restrictive norms and checking 
actions, assigned to either rational or inhibit- 
ing authority according to their ascribed moti- 
vation. Further distinctions between checking 
actions of a more concrete versus a more ab- 
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stract nature, between direct and projective 
forms of inventories, and between different 
formulations of the items as implication sen- 
tences versus conventional attitude questions 
make a coherent system of variables that is 
applied according to the cross-sectional 
method to a sample of 656 Swedish children 
from 11-15 years of age. 

The results indicate that if parents’ restric- 
tive norms and checking actions are accom- 
panied by authoritarian motivations (inhibit- 
ing authority), the children’s positive attitudes 
towards these are negatively correlated with 
chronological age, verbal intelligence with the 
age factor eliminated, and preference for al- 
truistic alternatives to authoritarian ones in 
imaginary occupational situations, but that 
these correlations are greater for items con- 
cerning more concrete situations, and for in- 
ventories applied in a direct rather than pro- 
jective form. 

Tf the restrictive norms and checking actions 
are accompanied by rational motivations (in- 
dicating rational parental authority), the atti- 
tudes have a tendency to correlate positively 
with these three independent variables if they 
are given general and abstract formulations 
(as opposed to concrete) and the projective 
forms of inventories are employed. 
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schizophrenics, the motivation to avoid 

punishment is stronger than the motiva- 
tion to achieve reward, at least when both 
reinforcers are mild. When punishment follows 
any incorrect response or failure to respond, 
this differential leads to faster learning by 
schizophrenic subjects under punishment con- 
ditions than reward conditions, whereas the 
reverse is usually true for normal subjects 
(Bleke, 1955; Buss, Braden, Orgel, & Buss, 
1956; Cohen, 1956; Leventhal, 1959; Ullman 
& Forsythe, 1959). Only one relevant study 
reports the negative effects of punishment to 
be stronger than the positive effects of reward 
with normal subjects (Ferguson & Buss, 
1959). 

The two separate studies reported here 
(Atkinson, 1957; Robinson, 1957) were de- 
signed to test the generality of these findings 
with independent samples and to investigate 
the role of the interpersonal element in the re- 
inforcement. The present experiments differed 
from previous studies in that the learning task 
was designed to provide the correct answer 
mechanically so that the subject was not 
dependent on the experimenter for information 
during the learning process, and a control 
group was employed whose responses received 
no reinforcement of any kind. 

1 This report is based upon PhD dissertations sub- 
mitted in 1957 by Atkinson to the Department of 
Psychology, Indiana University, and by Robinson to 
the Department of Psychology, Stanford University. 
The experiments were conducted while the authors 
were trainees in the Clinical Psychology Service, 
Veterans Administration Hospital at Palo Alto, 
California. 

2 Special thanks are due Paul McReynolds, Chief of 
Research, Psychology Service at that hospital for his 
invaluable aid and encouragement, and to the many 
members of the staffs of the Veterans Administration 
Hospital and nearby Agnews State Hospital who pro- 
vided facilities and aided in the selection of subjects, 
The authors are also grateful for the assistance of their 
respective dissertation committees, D. C. Beier, J. P. 
Egan, A. M. Binder, and D. Williams at Indiana 
University, and C. L. Winder, Q. McNemar, and P. W. 
McReynolds at Stanford University. 
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A number of authors have stressed the im- 
portance of threats of punishment and recur- 
rent failure in the development of the schizo- 
phrenic syndrome. Dollard and Miller (1950), 
Mowrer (1950), and others have discussed 
neurotic and psychotic symptoms as habits 
developed through reduction of a fear drive, 
For them, escape from unpleasant or punish- 
ing situations serves to reinforce the appar- 
ently maladaptive behavior characteristic of 
psychotic individuals. Reward is seen as a 
weak reinforcer for these individuals because 
it has represented a conflictful response from 
the mother and cannot be “trusted.” The re- 
sponse to reward may take either the form of 
blocking or selective inattention. An excellent 
extension and elucidation of a similar view- 
point has been presented by Rodnick and 
Garmezy (1957); in particular, these authors 
give evidence for the importance of the schizo- 
phrenic’s early relationship with his parents in 
determining his later learning behavior. 

The theoretical framework employed here 
led to the following predictions: 

1. Compared with normal subjects, schizo- 
phrenics are more highly motivated to avoid 
punishments and less highly motivated to at- 
tain rewards. Thus, schizophrenic subjects 
learn more rapidly under conditions of punish- 
ment for incorrect responses and for failures 
to respond than under conditions of reward for 
correct responses. For normal subjects, the 
reverse situation holds; learning is faster under 
reward conditions. 

2. Schizophrenic subjects whose incorrect 
responses and failures-to-respond are punished 
learn faster than nonreinforced control sub- 
jects. 

3. Schizophrenic subjects whose correct re- 
sponses are rewarded learn no faster than con- 
trols and may, indeed, learn more slowly. 

4. A social reinforcement leads to intensifi- 
cation of the effects exhibited by a mechanical 
(impersonal) reinforcement. 
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METHOD 


The first experiment used four groups of 20 female 
subjects each, a total of 80. Two groups were selected 
from the female schizophrenic population at Agnews 
State Hospital to represent individuals in rather acute 
stages of illness. No subjects had had more than two 
previous hospital admissions or one year of total 
hospitalization, or had any received EST or tranquil- 
lizing drugs during the 4 months previous to testing. 
The other two groups consisted of normal female 
volunteers at the Palo Alto Veterans Administration 
Hospital. 

The second experiment used a total of 100 male 
schizophrenic subjects, randomly divided into five 
groups of 20, all drawn from the patient population of 
the Palo Alto Veterans Administration Hospital and 
representing those patients who were relatively more 
acute and in better contact than the average for that 
hospital. This sample was similar in age to the patient 
population in the first experiment (means of 32 years in 
each) but had an average of three hospitalizations, a 
longer period of continuous hospitalization, and less 
education (means of 12.8 and 11.6 school years, re- 
spectively). Approximately three-quarters of the male 
group were receiving tranquilizing medication, al- 
though none had had EST during the previous 6 months. 
The normal female volunteers on the average were 
older (40 years) and better educated (13.9 school years). 
No subgroup differed significantly from the mean of its 
respective sample in any of these factors. 

The experimental task consisted of learning three 
lists of 15 pairs of fairly similar adjectives, a pretest list 
and two test lists. The lists had been equated for dif- 
ficulty in a pilot study. In both experiments, List I 
(the pretest list) was the same for all subjects. In the 
first experiment, the master list of 30 pairs comprising 
Lists II and III (the test lists) was arranged to provide 
two lists of 15 pairs each. This master list was circular 
and was “rotated” with every third item designated as a 
possible starting position. Each subject was randomly 
assigned to one of the 10 starting positions with the 
restriction that each position was used equally often for 
a given experimental group. This complex pı lure, 
originally intended for the study of serial effects, was 
abandoned in the second experiment in the interest of 
simplicity. The 30-item list was split into Lists II and 
IN at a point which had provided equality of difficulty 
in the first study. 2 

Testing was done individually, the subject being 
seated in front of a Hull-type memory drum with the 
experimenter beside him. The drum revolved one place 
every 4 seconds. The first time through each list, the 
word-pairs appeared in the window, and the subject was 
required to say them aloud. During subsequent trials, 
only the stimulus word appeared, followed in 4 seconds 
by the pair, then by the next stimulus word, etc. The 
subject was required to respond before the pair ap- 
peared. In other words, the subject was jnformed by 


one errorless trial. During the first list, no group Te- 
ceived any reinforcement. Lists II and Til were run 
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under Teward, punishment, or control conditions, 
depending on the subject’s group assignment. List I, 
thus, served as a base line against which the effects of 
the reward or punishment conditions could be sub- 
sequently evaluated. Between Lists I and II and be- 
tween Lists II and III, the subjects sorted a series of 
cartoons according to their judgment of the degree of 
humor involved. This was done primarily to provide a 
rest period and to minimize interference between lists. 

In each case in which a form of “reward” served as 
the reinforcement, it followed correct responses and was 
administered as the drum turned to reveal the correct 
pair. In each case in which “punishment” served as the 
reinforcement, it followed either incorrect responses or 
failures to respond, and it, too, occurred as the drum 
turned to reveal the correct pair. 

The samples were divided as follows: one female 
schizophrenic group, one male schizophrenic group, and 
one female normal group were designated “personal 
reward” groups. Likewise, one female schizophrenic, 
one male schizophrenic, and one female normal group 
were designated “personal punishment” groups. One 
male schizophrenic group received “impersonal re- 
ward” and one male schizophrenic group received 
‘impersonal punishment.” The final male schizophrenic 
group received no reinforcement at all. 

Under personal reward conditions, one of the fol- 
lowing statements was made by the experimenter after a 
correct response to Lists II and III: “good,” “umhmm,” 
“right,” and “that’s right,” in random order. Similarly, 
under personal punishment conditions, following an 
incorrect response or a failure to respond, the experi- 
menter said, “no,” “wrong,” “that’s wrong,” or “that’s 
an error” in an even, quiet tone. No mention was made 
by the experimenter of the reinforcement. 

Subjects in the impersonal reinforcement groups 
were told at the beginning of List II that the procedure 
would be the same as on the first list except that the 
checking system would differ. An old-style Dalton add- 
ing machine was employed which made a loud sound 
when the lever was pulled. The reinforcement, the loud 
sound, was identical for these impersonal-reinforcement 
groups except that for the reward group it followed 
correct responses and for the punishment group it 
followed incorrect responses or failures to respond. It 
was made as the drum turned to reveal the correct pair. 

‘At the completion of the session, which averaged an 
hour in length, the male subjects were asked a series of 
questions designed to elicit their awareness of the Tein- 
forcement, their emphasis on the “rightness or 
“yrongness”” of their responses, and their conception of 


the experimenter’s role. 
RESULTS 


Analysis of the results was carried out in 
terms of trials arid errors on each list, and the 
differences between the subjects’ performance 
on List I and List III (change scores). In 

eral, the data for trials and errors were 
e consistent, although differences between 


quit e 
groups tended to be cumulative so that the 
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List II data were not significant. For the sake 
of brevity, only trials will be reported. 

For the female subjects, complex analysis of 
variance applied to the change scores produced 
an F ratio of 16.05 for the interaction term, 
while there were no statistically significant 
group or treatment effects. This F ratio, sig- 
nificant beyond the .001 level, indicates that 
the reward and punishment treatments had 
differential effects on the two diagnostic 
groups; the normal subjects improved under 
the reward condition and not under punish- 
ment, whereas the schizophrenic subjects 
showed an increase in trials to criterion under 
reward and a decrease under punishment. 

Simple analysis of variance technique ap- 
plied to the change data of the male groups 
yielded an F ratio of 4.04, significant beyond 
the .01 level. To compare each male group with 
every other male group, probabilities were 
calculated on the basis of one-tailed ¢ tests 
except for those comparisons for which the 
direction of difference had not been predicted 
as in the control group against each reward 
group. Table 1 presents the probabilities, thus, 
obtained. 

Table 1 indicates that the two punishment 
groups were clearly and significantly different 
from the two reward groups and the control 
group, but very similar to each other. The dif- 
ference between the reward groups and the 
control group failed to reach the .05 level of 
significance, however, and the impersonal re- 


TABLE 1 


Mean TRIALS PER List FoR EACH 
'AL GROUP 


(N = 20 per group) 


Trials 
Treatment 
List | List | List |List I- 
I I | IM |List m1 
Female schizophrenics 
Personal reward 6.95/7.15|7.75| —.80 
Personal punishment 6.65/6.10)5.05| 1.60 
Female normals 
Personal reward 6.35|5.95|5.05| 1.30 
Personal punishment '5.70|5.40/5.70}  .00 
Male schi; ics 
Personal reward 7.50/6.55|9.65|—2.15 
Impersonal reward 8.35/6.85/8.70| —.35 
No reinforcement 8.30/7.15/8.25| .05 
Impersonal punishment 8.80/6.70|7.20| 1.60 
Personal punishment 8.20/6.95|6.85| 1.35 
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ward group differed very little from the control 
group in mean change scores. The difference 
between the two reward groups failed to reach 
the .05 level of significance. 


DISCUSSION AND CONCLUSIONS 


The results suggest the following conclu- 
sions, especially when fortified by further 
analysis of the kinds of errors made by the 
subjects and the data from the ‘‘awareness”’ 
questionnaire administered following List III. 

1. Learning by schizophrenic subjects in 
this kind of experimental situation is more 
rapid under conditions of mild punishment 
(personal or impersonal) for incorrect responses 
and failures to respond than under conditions 
of mild reward (personal or impersonal) for 
correct responses. For normal subjects the 
reverse situation is true; learning is more rapid 
under reward conditions. 

2. In comparison with a procedure which 
furnishes no reinforcement, punishment ac- 
tually has a facilitating effect upon learning 
by schizophrenic subjects. Analysis of the 
types of errors suggests that the punishment 
may have facilitated performance by serving 
to break perseverative tendencies. 

3. Reward with schizophrenic subjects in 
this kind of situation does not function as a 
positive reinforcement in the sense of increas- 
ing the probability of the occurrence of correct 
responses. This is in opposition to the results 
found with normal subjects. 

4. The situation with respect to the possible 
decremental effects of reward is less clear. The 
t values were of such magnitude that it is not 
possible to say definitely whether the reward 
groups were clearly different from the control 
group, in which case the reward may have 
blocked the learning process in some manner, 
or similar to the control group, in which case 
it is probable that the subjects were inattentive 
to the reinforcement. The possibility remains 
that with larger groups of subjects, reliable 
differences would have become apparent. 

5. Concerning the personal element in the 
reinforcement, it is clear from the data that 
the two punished groups did not differ in 
change scores. In addition, these two groups 
showed no significant difference in any of the 
three categories included in the questionnaire 
on awareness of reinforcement. Thus, it ap- 
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pears likely that the two forms of punishment 
produce very similar effects. 

6. The two reward groups, on the other 
hand, differed in their change scores, with the 
personal reward group showing greater im- 
provement in learning from List I to List III 
than the impersonal reward group; however, 
the difference did not reach statistical signifi- 
cance. On the awareness questionnaire the 
personally reinforced group emphasized the 
factor of “being right” more frequently than 
did the impersonally reinforced group, and 
much more frequently verbalized the experi- 
menter’s role as that of an evaluator. 

It appears possible that in this situation, 
responses to punishment generalized to other 
impersonal cues but responses to reward did 
not, the two responses differing markedly in 
strength. The argument supporting this hy- 
pothesis would state that personal (parental) 
punishment, a clear-cut and strong stimulus, 
motivates a strong learned “fear” drive, a 
response which is called forth readily by other 
“punishing” cues. The schizophrenic is a per- 
son who has not learned to differentiate the 
danger inherent in environmental cues from 
that inherent in rejection by his parents, and 
who has, thus, withdrawn as best he can from 
all possible punishment in his environment. 
The difference between the two kinds of reward 
would be expected to appear because reward 
has not been learned as a strong motivator, 
and the generalization of effects from one kind 
of reward to another is not so marked. 

An additional word is in order regarding the 
apparent contradiction between the present 
studies and those which have employed less 
specific kinds of reinforcement with schizo- 
phrenic subjects. The latter have consistently 
found that punishment leads to a decrement in 
performance, accompanied by various kinds of 
task irrelevant behavior. Persons involved with 
the care of the mentally ill have consistently 
advocated therapeutic acceptance and kindly 
attention toward psychotics, at the same time 
discouraging punishment and failure experi- 
ences. The results of these experiments should 
not be interpreted as challenging this point of 
view. In the first place, this experimental 
situation was specialized in that every incorrect 
response or failure to respond was punished. 
In the usual circumstance, a wide range of 
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maladaptive responses is possible, only part of 
which will be overtly disapproved as “socially 
unacceptable.” In the typical experimental 
situation in which the subject is told that his 
previous performance was in general “poor,” 
he may withdraw from the situation or make 
task irrelevant responses which decrease his 
vulnerability to punishment but also produce 
a decrement in performance. 

In the second place, hospitalization is not 
aimed at producing rapid learning, as such, but 
rather with getting people well. Perhaps “get- 
ting well” for the schizophrenic involves re- 
orientation with respect to those reinforcers 
which are effective and his developing the 
ability to seek rewards and avoid punishment 
in a constructive manner. 


SUMMARY 


Two separate paired-associate learning stud- 
ies, using a total of 180 subjects drawn from 
male and female schizophrenic and normal 
populations, suggest that: 

1. For schizophrenic subjects, unlike normal 
subjects, the motivation to avoid mild punish- 
ment is stronger than the motivation to achieve 
mild reward, so that in a situation in which 
giving a correct response is the only way to 
avoid punishment, punishment has a facilitat- 
ing effect on performance. 

2. Reward with schizophrenic subjects in 
this kind of situation does not serve as positive 
reinforcement and may actually have some 
debilitating effects. 

3. The two kinds of punishing reinforce- 
ment, both with and without the personal 
element, have similar effects; the two kinds of 
rewarding reinforcement were not so clearly 
similar. Differences in the generalization effects 
of reward and punishment may be involved. 
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HE disconcerting vulnerability of peo- 

ple’s convictions in forced exposure 

situations provides a provocative con- 
trast to the relative ineffectiveness of the 
voluntary exposure persuasion campaigns con- 
ducted over the mass media (Hovland, 1959). 
The high effectiveness of forced exposure situa- 
tions is seen in laboratory experiments, face-to- 
face discussion, and political indoctrination 
of captive audiences (United Kingdom Minis- 
try of Defense, 1955; United States Senate, 
1956). The various election studies (Berelson, 
Lazarsfeld, & McPhee, 1954), on the other 
hand, indicate the relatively small effects of 
indoctrination campaigns depending on the 
voluntary exposure of the audience. The 
student of persuasive communication is in- 
clined to attribute this contrast, at least in 
part, to the fact that people characteristically 
defend their convictions by avoiding exposure 
to counterarguments. This postulate of self- 
selected exposure to arguments so as to avoid 
receipt of information at variance with one’s 
already formed opinions has been too widely 
discussed in the recent literature to need devel- 
opment here (Festinger, 1957; Janis, 1957; 
Klapper, 1957). 

While defense-by-avoidance is likely to be 
highly effective for belief maintenance so long 
as the person can adequately regulate his own 
exposure to arguments, it has the disadvantage 
of leaving him poorly prepared to resist coun- 
terarguments should he be involuntarily ex- 
posed to them. Living in an ideologically 
monolithic environment, the person tends to 
underestimate the vulnerability of his beliefs 
and the likelihood of their being attacked. 


1 This study is one of a series on developing resistance 
to persuasion which are being supported by grants from 
the University Research Board (University of Illinois) 
and the National Science Foundation. 

H. C. Triandis and C. E. Osgood made many helpful 
suggestions after reading an earlier version of this report. 


Hence, he will have had little motivation or 
practice in developing supporting arguments 
to bolster his belief or in preparing refutations 
for the unsuspected counterarguments. This 
postulate of defense-by-avoidance, and its 
implication of a deficiency in both practice 
and motivation for developing belief resistance, 
suggested several hypotheses regarding the 
immunizing effectiveness of alternative types of 
prior belief-defense, before forced exposure to 
massive doses of counterarguments. 

The first of these hypotheses is suggested by 
the closely analogous health problem of the 
person who has been brought up in so aseptic 
an environment that he has failed to develop 
resistance to infection and, hence, although 
appearing in very good health, proves quite 
vulnerable when suddenly exposed to a massive 
dose of an infectious virus. The disease resist- 
ance of such a person might be raised by either 
of two procedures; he might be given suppor- 
tive therapy—good diet, exercise, rest, etc:— 
designed to better his physical condition; or 
alternatively, he might be given an inoculation 
of the infectious virus itself (in a weakened 
form) such as would stimulate, without over- 
coming, his defenses. With respect to develop- 
ing immunity to specific diseases, the inocula- 
tion procedure is generally more effective. 

Pursuing this medical analogy into the area 
of resistance to persuasion, it is hypothesized 
that the “supportive therapy” approach of 
pre-exposing a person to arguments in support 
of his belief has less immunizing effectiveness 
than the “inoculation” procedure of pre- 
exposing him to weakened, defense stimulating 
forms of the counterarguments. This prediction 
is made primarily for the case of widely ac- 
cepted cultural beliefs, since the person’s 
inclination to avoid exposure to counterargu- 
ments is likely to have been successful with 
such uncontroverted beliefs. However, even 
with regard to a culturally controversial issue, 
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it would be predicted on the basis of the 
postulate of defense-by-avoidance that a 
person would tend to choose his associates so 
as to remain within an ideologically homoge- 
neous subpopulation. Studies of political be- 
havior (Berelson et al., 1954) demonstrate this 
social homogeneity even with respect to so 
controversial an issue as presidential preference 
during the quadrennial United States cam- 
paigns. 

The hypothesized superiority of pre-exposure 
to weakened counterarguments runs counter 
to recommendations frequently repeated in 
investigations of prisoner-of-war behavior 
during the Korean conflict (Kinkead, 1959; 
United States Senate, 1956). The readiness of 
American personnel under prison camp condi- 
tions to surrender their prior convictions 
regarding proper behavior has been attributed 
to the failure of our formal and informal 
instructional institutions—home, school, army, 
etc.—to provide these soldiers with sufficient 
knowledge of “American” ideals and the 
reasons for maintaining them. Consequently, 
it was concluded, such vulnerability can be 
reduced in the future if our schools and other 
institutions teach more explicitly the reasons 
for maintaining these beliefs. The present 
hypothesis predicts that, on the contrary, pre- 

to the counterarguments themselves 
(in weakened form) is more effective as an 
immunizing agent than is the presentation of 
arguments supporting the cultural truism. 

A second hypothesis follows from the postu- 
late of defense-by-avoidance and its implica- 
tion that the person, due to a lack of practice, 
is inept at defending his beliefs when the 
necessity arises. In the case of such beliefs, the 
requirement that the person actively partici- 
pate, without guidance, in the defense should 
have a deleterious effect on the amount of 
immunity to later persuasion that the defense 
confers. Very likely the person cannot employ 
such an opportunity for participation effec- 
tively and the time could be more efficiently 
used in presenting him with already developed 
defensive material. Furthermore, such required 
participation should tend to be so difficult a 
task as to interfere with the reception of any 
defensive material presented, and to bring 
home to him forcibly how unable he is to defend 
his belief, thus tending to weaken rather than 
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strengthen his confidence in it. Other theo 
cal formulations leading to this second 
pothesis are considered in later discussion. A 
to be considered later are some theore' 
implications of the forced compliance studies 
and of studies involving active recitation dur- 
ing learning that might seem to lead to the 
opposite prediction. 

A third hypothesis involves an interaction 
effect between the two main variables, type 
of defense and amount of participation. The 
requirement of active participation should be 
less detrimental with the supportive type of 
defense than with the type involving pre- 
exposure to counterarguments. This prediction 
was based on two considerations, one having — 
to do with motivation, the other with the 
danger of overexposure to counterar guia f 
during the defense. 

Although the second hypothesis points to a 
net negative effect from a requirement of active 
participation under the present situation, one 
component of the participation effect is likely 
to be beneficial, namely, its effect on motiva- 
tion. By postulation, the person is not only 
unprepared, but also unmotivated to defend 
the supposedly unassailable cultural truisms ` 
prior to any attack. Hence, having to make 
an observable contribution during the defen- 
sive session should tend to supply some useful 
extrinsic motivation for working seriously at 
the defensive task. The lack of intrinsic motiva- 
tion (and the consequent beneficial contribu- 
tion of the requirement for active participa- 
tion) is likely to be more pronounced in the 
supportive defense condition, which is likely 
to appear to involve belaboring the obvious. 
The pre-exposure to (weakened) counterargu- 
ments involved in the other defense suggests 
that the “truism” might indeed be assailable, 
and, hence, provides more intrinsic motivation 
for working seriously at the defensive task than 
does the supportive defense. 

A second consideration leading to the pre- 
dicted interaction effect involves the possibility 
that the active participation condition, being 
less under the experimenter’s control, might 
overexpose the person to the dangerous mate- 
rial involved in the weakened counterargu- 
ments. This possible basis of the predicted 
interaction effect (as well as theoretical con- 
siderations leading to the prediction of an 
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opposite interaction effect) are discussed more 
fully in connection with the results obtained. 


METHOD 


General Procedure 


‘The experiment was represented to the subjects as a 
study of the relationship between reading and writing 
skills. Each subject took part in two one-hour experi- 
mental sessions, The first was designed to give the sub- 
ject various types of belief-immunizing treatments; the 
second, 2 days later, exposed him to strong counterargu- 
ing messages attacking the beliefs. 

First session. In the first, or “immunizing,” session, 
each subject was given defensive treatments on two be- 
liefs. The first defensive treatment involved assigning 
him a belief that is almost universally held by his peers 
and instructing him (under experimentally varied con- 
ditions) to write a 20-minute essay defending that be- 
lief. 

‘After the 20 minutes were up the subjects were given 
the second immunizing treatment, which involved read- 
ing for 5 minutes a 1,000-word essay (of an experimen- 
tally varied type) on another cultural truism and then 
answering a series of comprehension questions about 
the material read. 

‘The subject’s final task in the first session was to 
complete an attitude questionnaire designed to measure 
strength of belief in the cultural truisms. The subject 
was told to indicate his own belief, regardless of the 
positions taken in the essays that had been assigned to 
him, purportedly to allow us to check on whether the 
person’s beliefs about the particular subject matter used 
in the test had any effect on his performance level. The 
belief scores obtained on the immunized issues in this 
first session questionnaire furnished a measure of the 
direct effect of the defensive treatments themselves, 
prior to the second session presentation of the strong 
counterargument messages. 

‘Second session. The subject took part 48 hours later 
in a second session which involved presentation of three 
1,000-word essays containing strong counterarguments 
against each of the two previously defended, and one 
additional belief. After being allowed 5 minutes for 
reading each of these three messages, the subject was 
required to answer a series of multiple-choice questions 
on their contents. Finally the subject was once again 
given the opinion questionnaire just d ibed, and told 
to indicate his present personal belief, whatever might 
have been argued in any of the previous essays. The 
beliefs which the subjects indicated on these question- 
naires on the immunized and unimmunized beliefs 
provided the measure of the effectiveness of the various 
immunizing conditions. 

‘At the end of the experiment considerable effort was 
expended to inform the subject of the nature of the 
study and of the deceits used. Particular stress was put 
on the fact that the immunization and counterargu- 
ment messages were both constructed for 
purposes and that no weight should be given to any 
argument simply because it had been included in one 
of these propaganda messages. 
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Independent Variables 

Amount of active participation in the defense. The 
main manipulation of the degree of active, unguided 
participation was the already described reading vs. 
writing variation, the writing condition, of course, being 
defined as requiring more active participation than the 
reading. Within each of these conditions there was a 
further manipulation of the variable. There were passive 
and active subtypes of the reading condition; half of the 
subjects were told simply to read the material silently 
for 5 minutes, so as to be able later to answer questions 
testing their comprehension of it, the other half were 
instructed in addition to select and underline the one 
sentence in each paragraph most fully and succinctly 
summarizing that paragraph. The passages read were 
identical in active and passive subtypes, cach consisting 
of a 1,000-word, five-paragraph essay defending one of 
the cultural truisms. The first h summarized 
four arguments and each of the following paragraphs 
developed one of these in detail.? Within the writing 
condition also, there were two variations—guided and 
unguided writing. Half the subjects were given an out- 
line summarizing four defensive arguments that could 
be used, and the other half received no such guiding 
outline. 

Type of defense. Two types of defensive material 
ployed. In half the treatments, the prior defense 
involving presenting the subject with arguments sup- 

the cultural truism to which he initially ad- 
hered, in the other half, it involved exposure to possible 
counterarguments against the truism together with 
refutations of these counterarguments. In the reading 
conditions both of these types of defenses took the form 
of 1,000-word, five-paragraph essays, each essay pre- 
senting four supporting arguments, or mentioning and 
then refuting four counterarguments. In the writing- 
from-outline participation condition, the two t; of 
defense involved presenting the subject either with one- 
sentence synopses of each of four supporting arguments 
or with four two-sentence synopses, each mentioning A 
counterargument and an argument refuting it, In the 
writing-without-outline participation condition, the 
subjects in the supportive conditions were instructed 
to write an essay giving arguments in support of the 
truism, and those in the refutational, an essay mention- 
ing and refuting possible counterarguments against the 
truism. 

tal condition 

Each subject served in an experiment on 

each of four cultural truisms, On one, he received 


neither the defensive treatment in the first session nor 
the strong counterargument treatment in the second 


American 
Institute. Order Document No. 6745 from ADI Auxili- 
Publications Project, Photoduplication Service, 
Library of Congress; Washington 25, D.C., remitting in 
advance $1.75 for microfilm or $2.50 for photocopies. 
Make checks payable to: Chief, Photoduplication Serv- 
ice, Library of Congress. 
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second (to provide a measure of the effect of the attack 
in the absence of prior defense). On the other two be- 
Mieis, he received both the prior defensive and subse- 
it strong counterargument treatment, The pair of 

b peaini treatments given any one subject was not 
assigned completely at random: cach subject received 
one reading and one writing defense, (Since 20 minutes 
were allowed for writing and only 5 for reading, this 
restriction was imposed to assure that all subjects served 
the same amount of time in the immunization session.) 
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first (immunization) session, but did not for 
the second (counterargument) session are omitted from 
the analysis, Since the rhetoric course is required of 
almost all freshman students who enter any college of 
the university, the subjects probably constitute a rep- 
resentative (as regards age, sex, intelligence, 
etc.) of first-semester freshman students in all colleges 
of this large state university. 


ResuLTS 
General Effects 


The control data from the first session indi- 
cate that the initial beliefs were extreme and 
homogeneous, amounting, as intended, to 
cultural truisms. Even though the beliefs were 
already at the high initial level of 13.26 points 
on the 15-point scale (see Table 1), the eight 
defensive conditions did further strengthen 
them, moving them up to about 50% of the 
remaining scale distance (p < .001), 

Despite their unqualified initial acceptance 
the beliefs proved highly vulnerable to the 
massive counterarguments in the second ses 
sion, which reduced them from the 13.26 point 
level of the first session to 6.64 points (p < 
.001) when not preceded by any immunization 
treatment, Even with prior immunization, the 
strong counterarguments lowered the beliefs 
considerably (see Table 2) though the mean 
effect of the eight immunization treatments 
was to attenuate their effect by about one-third 
(p < 001), These eight treatments differed 
considerably among themselves in regard to 
both direct strengthening effects (see Table 1) 
and immunizing effects (see Table 2). We turn 
now to this question of comparative effects. 


Comparative Efficacy of Supportive and Refuta- 
tional Defense 


ve supportive defense (writing-with- 
) was superior to no-defense only on the 
and the poorest 
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TABLE 1 
Briers AFTER IMMUNIZATION WUT BEFORE EXPOSURE TO STRONO COUNTERARGUMENTS 
disagreement) 


TABLE 2 
BELIEYS AFTER DOTH IMMUNIZATOIN AND EXPOSURE TO THE STRONG COUNTERAROCMENTS 
(15 = absolute agreement; 1 = absolute disagreement) 
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tudes (which can be seen in Tables 1 and 2) fall 
short of conventional significance levels. 


Interaction between Type of Defense and 
Participation 

The interaction between type of defense 
( ive vs, refutational) and participation 
( vs. writing) is in the predicted direc- 
tion but of a magnitude that attains only 
borderline significance. The superiority of 
reading over writing is greater with the refuta- 
tional than with the supportive defense, in 
regard to both direct strengthening effect (p = 
.10) and immunizing effect (p = .08), as can 


regard to which there is a double reason for 
= the predicted interaction—with the 
other three participation conditions, This con- 
dition has the greatest immunizing effective- 
ness of the four with the supportive type of 
defense and the least of the four with the 
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likely ever to be exposed to the subsequent 
massive dose of counterarguments. The data 
indicate, however, that no such direct “boom. 
erang” effect occurred. On the contrary, the 
scores presented in Table 1 show that the 
refutational treatments had the immediate 
effect s strengthening the belief by 0.65 points 
(p = 02). 

The finding of a reversed relationship be- 
tween tbe immediate direct strengthening 
effect and the immunizing effectiveness of the 
refutational as compared with the supportive 
defense, raises the question of how well the 
immunizing effectiveness of a treatment can 
be predicted from its immediate apparent 

hening effect. The results over the eight 


strengt 
defensive conditions indicate that immuniza- — 


tion is very poorly predicted by the direct 
effect: over all eight defensive 

conditions, the rank order coefficient of correla» 
tion between immediate strengthening and 
subsequent resistance is actually negative 
(p = —0,38), Hence, the initial strength of 
belief or the amount of induced strengthening 
is a very poor indicator of conferred resistance 
to subsequent strong counterarguments. Some 
treatments (¢.g., the refutational defenses) that 
appear to leave the beliefs relatively weak 
actually confer on them hidden reserves of 
resistance, while other treatments (e.g, the 
supportive defenses) that seem to have much 
strengthened the beliefs have actually left them 
The 5 refutational manipula- 

su vs. uta 

pat: ep nda omnes fap aro 
two-sided argument pulation in previous 
studies (Hovland, & Sheffield, 
1949; Lumsdaine & Janis, 1953), Their one- 
sided condition is much the same as our suppor- 
tive; their two-sided condition is, however, & 
less close analog to our refutational. While 
their two-sided defense involved the supportive 
arguments unsupported or refuted, our 
refutational defense involved no supportive 
arguments, only some mention of counterargu- 
ments with detailed refutations thereof. The 
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Interaction Effect 


The interaction hypothesis received the 
least impressive confirmation of the three 
predictions. However it received its most sub- 
stantial support when the writing-from-outline 
participation condition, in which both the 
postulated mechanisms (lack of need for 
extrinsic motivation and danger of overexpo- 
sure to the counterarguments) are operative, is 
contrasted with the other three participation 
conditions. This finding suggests that both 
mechanisms contribute to the effect, since the 
condition in which only one mechanism (lack 
of extrinsic motivation) is operative, writing- 
without-outline, falls between this condition 
and the conditions (reading) in which neither 
mechanism is operative, in regard to the 
superiority of refutational over supportive 
defense. 

The interaction effect predicted and ob- 
tained in this study cannot be generalized to 
situations that fail to present the refutational 
material in such wise that the subject perceives 
it as a defense of his belief, and not an attack 
on it. (There is always some danger, which 
must be minimized for the present interaction 
effect to occur, that the refutational defense, 
since it mentions the counterarguments at the 
outset, may be misperceived by the subject as 
an attack on his belief.) The basic postulate of 
defense-by-avoidance underlying the present 
study would indicate that where the refuta- 
tional defense is likely to be initially misinter- 
preted by the subject as an attack on his 
belief, the subject should tend to avoid expos- 
ing himself to it (Janis, 1957). Hence, in such 
ambiguous situations the opposite direction of 
interaction would be predicted, namely, that 
the extrinsic pressure to expose oneself to the 
defensive material, in the requirement of overt 
participation, would be more important with 
the refutational than the supportive defense. 
Such interaction should be increasingly pro- 
nounced as the subject’s confidence in his 
belief declines (Brodbeck, 1956; Festinger, 
1957). 


SUMMARY 


It was postulated that people characteris- 
tically defend their beliefs (particularly cul- 
tural truisms) by avoiding exposure to con- 


WILLIAM J. McGuire AND DEMETRIOS PAPAGEORGIS 


tradictory information, with the result that in 
forced exposure situations these untried beliefs 
prove highly vulnerable to persuasion. Since 
the person is unmotivated to develop a defense 
of his belief to the extent that he considers it 
invulnerable, it was hypothesized that such 
beliefs are more effectively immunized against 
persuasion by pre-exposure to counterargu- 
ments (in a weakened form that stimulates, 
without overcoming, the receiver’s defenses) 
than by pre-exposure to arguments supporting 
the belief. Second, since the person is un- 
practiced in the defense of such beliefs, it was 
predicted that the immunizing pretreatments 
would lose effectiveness to the extent that they 
required the person to participate actively, 
without guidance, in the defense. Finally an 
interaction effect was predicted such that the 
detrimental effect of requiring active participa- 
tion is greater with the defense involving pre- 
exposure to the counterarguments than with 
the supportive defense. 

The subjects were 130 college freshmen who 
took part in two experimental sessions. In the 
first, they received immunizing treatments on 
various strongly held health beliefs. Type of 
defense (pre-exposure vs. supportive) and 
amount of participation in the defense were 
varied orthogonally. The subjects’ beliefs were 
then measured to ascertain the direct effects of 
the defensive treatments. In the second session, 
the subjects were exposed to strong counter- 
arguments on both immunized and nonim- 
munized beliefs and then the final beliefs were 
measured to ascertain the comparative im- 
munizing effectiveness of the defensive treat- 
ments. 

Both of the predicted main order effects were 
confirmed at the .001 level. Some support (p = 
.08) was also found for the predicted interac- 
tion effect. The defensive treatments, besides 
conferring immunity against subsequent coun- 
terarguments, also had an immediate direct 
strengthening effect on the beliefs (p = .01), 
but the amount of direct strengthening was not 
predictive of the latent resistance to counter- 
arguments conferred by these treatments. 
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AFFECTIVE TONE DURING ENVIRONMENTAL 
REGULARITY AND CHANGE! 
SALVATORE R. MADDI 
University of Chicago 


changed one. That the explanatory value of — 
McClelland’s position has not been unequivo- 
cally established is indicated by results such as 
those reported by Kenny (1955). His study of 
appreciation of joke endings as a function of 
their unexpectedness is only partially consist- 
ent with this position. 

McClelland et al. (1953) also state that — 
when expectancy and occurrence do not differ, 
there is no affective arousal and no behavior 
that can be characterized as approach or 
avoidance, unless the behavior is caused by 
some other affect-arousing property of the 
situation. However, completely expected situa- 
tions at least sometimes seem to lead to bore- 
dom, anger, or avoidance. For example, Kar- 
sten (1928) reported that the effects of con- 
tinued repetition of the same activity include 
an increase in emotional outbursts, the tend- 
ency to vary the task and concentrate on 
something else, and the attractiveness of other 
activities. It is proposed that both greatly 
unexpected and completely expected situations 
arouse negative affect, while moderately un- 
expected situations arouse positive affect. This 
Statement would apply most clearly in situa- 
tions in which rewards and punishments, as 
usually conceptualized, are at a minimum. 


A series of events is a group of stimuli that 
form, or are arranged in, an order according to 
some temporal principle. A repetitive series 
consists of events that are similar to, and have 
the same sequence as, those of a previous 
series. The occurrence of two or more succes- 
sive series that do not differ constitutes em- _ 
vironmental regularity. A nonrepetitive series is 
one that differs from the previous series in 
order of events, content, or both. The occur- 
rence of a nonrepetitive series following en- 
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vironmental regularity is called environmental 
change. The degree of change constituted by a 
nonrepetitive series refers to the difference 
between it and the repetitive series it follows. 
An expectancy is the cognitive representation 
of a series of events, which permits prediction 
of the order in which the events occur. 

In the experiment reported below, an ini- 
tially novel series of events was repeated a 
number of times (environmental regularity). 
As a series is repeated, its unexpectedness 
should decrease, The first hypothesis to be 
tested is that as a series is repeated, affective 
tone is initially negative, becomes positive, and 
then becomes negative again. After a number 
of series repetitions, a final series of events was 
presented, This final series was repetitive (no 
change) for some subjects, and nonrepetitive 
(environmental change) for other subjects. Of 
the latter subjects, some a rela- 
tively large degree of change, while others 
experienced a relatively small degree of change. 
The larger the degree of change present in the 
final series, the more unexpected it should be. 
The second hypothesis to be tested is that 
conditions of no change, and of relatively 
large degree of change are associated with 
negative affect, while a relatively small degree 
of change is associated with positive affect, 
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Environmental Change 


Following the first eight series, all subjects received a 
final series. Each group was subdivided into Subgroups 
A (8 subjects), B (16 subjects), C (16 subjects), and D 
(8 subjects). Each subgroup in a group received a dif- 
ferent version of the final series (treatment). Treatment 
O was a series consisting of a number, a number, and a 
stem, in that order. Subgroup A of each group received 
this treatment. It constituted no change (a continuation 
of regularity) for the subjects of the Prediction and No 
Prediction groups, because its events had the same 
order as, and were similar to, those of previous series. 
Treatment 1 consisted of either a number followed by a 
stem, or of three numbers and a stem in succession (i.e., 
number, number, number, stem). In each group, one- 
half of Subgroup B received the former series, while 
the other half received the latter series. Treatment 1 
constituted one degree of change for the subjects of the 
Prediction and No Prediction groups, because the stem 
was displaced either one position forward or backward 
from its usual position, Treatment 2 consisted of either a 
stem alone, or of four numbers and a stem in succession, 
In each group, one-half of Subgroup C received the 
former series, while the other half received the latter 
series. This treatment constituted two degrees of change 
for the subjects of the Prediction and the No Prediction 
groups, because the stem was displaced either two 
positions forward or backward from its usual position. 

In the Prediction and No Prediction groups, Sub- 
groups A, B, and C experienced, respectively, 0, 1, and 
2 degrees of change in order of events. Although the 
comparable subgroups of the No Regularity group 
received the same treatments during the final series, 
they did not experience environmental change as it has 
been defined because they had not previously experi- 
enced regularity. 

Treatment 3 consisted of two numbers and a stem in 
succession, but the stem was printed in much larger 
type than had been the case previously. Included for 

oratory purposes, this treatment was experienced 

Subgroup D of each group. While the subjects of the 

rediction and No Prediction group who received this 

reatment experienced no change in order of events, 
there was a change in size of one of the events. 


Stimuli 


The specific numbers presented to each subject were 
selected at random from those between 10 and 57, with 
the restriction that no subject should experience any 
number more than once. The stems utilized in the first 
eight series were “In winter,” “The stranger,” “When 
he heard,” “I feel,” “Americans,” “He thought,” “My 
memories,” and “When John met.” The order of presen- 
tation of the stems was determined by an 8 X 8 latin 
square with the property that each stem was followed 
by each other stem only once throughout the eight 
possible orders. Each subject in a group was assigned 
one of the eight orders in such a manner that every 
subject experiencing a particular final series had re- 
ceived a different order. The stem “Bill seemed” was 
utilized in the final series for all subjects. 

During the experiment, the experimenter sat behind 
the subjects and timed the exposure of stimuli with a 
stopwatch. The subjects of the No Prediction and No 
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Regularity groups were given the signal to turn the page 
after 30 seconds in the case of stems, and after 10 sec- 
onds in the case of numbers. The subjects of the 
Prediction group were given an additional 5 seconds per 
stimulus in which to predict. 


Minimization of Rewards and Punishments 


As discussed earlier, the effects of the unexpectedness 
of the series upon affective tone should be most apparent 
when the other affect-arousing properties are at a 
minimum. Potential rewards were minimized in the 
following manner: the experimenter did not converse 
with the subjects, nor did he mention such matters as 
the importance of either their participation or of the 
experiment itself. The $1.00 each subject received, while 
it was a reward, cannot be considered to have had dif- 
ferential effects upon the various aspects of the subject’s 
performance. Potential punishments were minimized by 
ensuring the subject’s anonymity, by putting his partic- 
ipation essentially on a voluntary basis, and by testing 
him as part of a group. In addition, the instructions were 
not particularly ego involving, and an attempt was 
made to select nonthreatening stems. Nonetheless, one 
source of reward and punishment was the possible 
significance of a particular stem for a particular subject. 
The control of this factor will be discussed later. 


Measure of Unexpectedness 


It has been assumed that a series becomes less un- 
expected as it is repeated, and that a nonrepetitive 
series is unexpected to the degree that it differs from 
the repetitive series it follows. In this experiment, in- 
accuracy of prediction can therefore be considered an 
index of unexpectedness. Such a measure can be ob- 
tained for the prediction group as a whole by computing 
the percentage of inaccuracy in predicting the events 
comprising a particular series. For any series, this 
percentage is based on the proportion of inaccurate 
predictions to the total number of predictions made by 
the group. 

No such measure was available for the No Predic- 
tions and No Regularity groups. However, a very indi- 
rect check on the assumptions was available, Following 
the experiment, all subjects were asked to indicate 
whether they had found it possible to anticipate when 
there would be a stem on the next page, and to state, if 
possible, how many numbers preceded the stem in 
each series but the last. 


Measure of Affective Tone 


The sentence completions were used as indicants of 
affective tone. This usage appears reasonable, as Bous- 
field (1950) has shown that pleasant and unpleasant 
moods determine the affective value of word associa- 
tions. It was assumed that the affect evidenced in 4 
given sentence completion would be a function of the 
significance of the particular stem for that subject and 
the degree of unexpectedness associated with the series 
of which the stem was a part. Investigation of the sec- 
ond factor alone was permitted by the counterbalancing 
of stem position, the assignment of stem orders to 
subjects at random, and the analysis of data by group 
and subgroup only. 


AFFECTIVE TONE 


Affective tone was considered to vary along a five- 
point scale, with 1, 2, 3, 4, and 5 representing, respec- 
tively, strongly negative affect, slightly negative affect, 
neutral affect, slightly positive affect, and strongly posi- 
tive affect. On the basis of pilot study data, four such 
bipolar, five-point dimensions of affectively-determined 
content—to be described—were isolated. Each sentence 
completion was scored for affect on the dimensions 
relevant to it, and the resulting scores were combined 
into a composite affect score for that completion. These 
composite affect scores were used in testing hypotheses. 

The composite affect score was assigned as follows: 
A completion to which none of the dimensions was 
relevant received a composite affect score of 3. If only 
one dimension applied to a completion, the completion 
was given the affect score obtained on that dimension, 
However, a completion could occasionally be scored 
toward both the positive and negative poles of the one 
dimension applying to it. If the intensity involved in 

~ both scores was the same (i.e., 1 and 5, or 2 and 4), the 
completion was given the composite affect score of 3. 
If the positive and negative scores did not agree in 
intensity (e.g., 1 and 4), the composite affect score was 
determined in the direction of the stronger of the two 
(in the example given, the final score would be 2). 
Occasionally, two dimensions were relevant to a par- 
ticular completion. These cases were treated, in effect, 
as if the completion had been scored toward both the 
positive and negative pole of one dimension. 

The dimension of expected outcome (negative pole is 
pessimism, and positive pole is optimism) was relevant 
whenever some future event or condition was referred 
to, or definitely implied in the completion, regardless of 
the unit of time involved. Completions scored on this 
dimension include “Z feel that the world will end in 
destruction” (affect score of 1) and “Americans will 
triumph over the forces of Russia” (affect score of 5). 
The dimension of evaluation (unfavorable evaluation- 
favorable evaluation) was relevant whenever the com- 
pletion indicated a point of view concerning a situation, 
event, or person. Completions scored on this dimension 
include, “He thought the experiment was nonsense” 
(affect score of 1), and “Bill seemed to be in favor of 
American foreign policy” (affect score of 5). The di- 
mension of interest (indifference-enthusiasm) was 
relevant whenever there was some indication of the 
subject’s involyement in the task, or in the subject 
matter of the completion. Completions scored on this 
dimension include, “J feel bored” (affect score of 1), 
and “Jn winter I am very active in work and play” 
(affect score of 5). The dimension of description (agree- 
able description-disagreeable description) was relevant 
whenever the completion described persons, conditions, 
or events, whether real or hypothetical. Completions 
scored on this dimension include “The stranger had a 
big knife” (affect score of 1), and “In winter the snow 
swirls through the streets covering everything with a 
blanket of lovely whiteness” (affect score of 5). The 
dimension of expressed feeling (unpleasant feeling- 
pleasant feeling) was relevant whenever the subject 
referred to his own feelings, or to those of others. 
Completions scored on this dimension include, “He 
thought the anxious, sinking feeling would never leave” 
(affect score of 1), and “7 feel happy” (affect score of 
5). In keeping with usual interpretation, expressive 
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disturbances, such as failure to complete the stem, 
erasures, and cross-outs, were considered indications of 
strongly unpleasant feelings,* 

The composite affect scores attributed to comple- 
tions by the experimenter were used to test hypotheses. 
While scoring a particular completion, the experimenter 
did not know the group or subgroup to which it be- 
longed, or (with the exception of completions involving 
the stem “Bill seemed,” which was only used during the 
fina] series) during which of the nine series it had been 
written. In order to check reliability, the completions 
were scored by another person who, although trained 
by the experimenter, had no knowledge of the design of 
the experiment or of the hypotheses being tested. 


RESULTS 
Interscorer Reliability 


The product-moment correlation between 
the composite affect scores attributed to the 
separate sentence completions by each of the 
scorers was .91, indicating that the scoring 
system is sufficiently objective to be employed 
effectively. 


Environmental Regularity 

The percentage of inaccurate predictions 
made by the prediction group in each of the 
first eight series in succession was 76, 47, 33, 
16, 16, 10, 10, and 6. The decrement in inac- 
curacy that occurred during this period of 
regularity is significant at the .001 level 
(x? = 62.88; df = 7). These results indicate 
that in this group the unexpectedness of the 
first series decreased as it was repeated. 

Comparable data are not available for the 
other two groups. However, an indirect check 
on the assumption that regularity reduces 
unexpectedness can be made. Forty-one of 48 
subjects in the prediction group correctly an- 
swered the question concerning how many 
numbers had immediately preceded each stem 
but the last (x? = 22.68; p < .001). In the No 
Prediction group, 37 of 48 subjects answered 
correctly (xè = 13.02; p < .001). These results 
suggest that the conditions of the experiment 
were sufficient to establish an expectancy for 
both of these groups. This question is not an 
adequate basis for comparison of the groups 
experiencing regularity and the No Regularity 
group, because the subjects of the latter group 
could not answer correctly by stating any 
single number. But the question concerning 
the ability to anticipate stems does provide 

3 For a more complete description of the scoring 
system, including examples, see Maddi (1959). 


342 


some basis for comparison. In the No Regu- 
larity group, 8 of 48 subjects felt that they 
could anticipate stems. In contrast, 42 of 
48 subjects in the Prediction group, and 38 of 
48 subjects in the No Prediction group felt 
that they could anticipate stems. The corre- 
sponding 2 X 2 contingency table yields a chi 
square value of 62.33 (p < .001), suggesting 
that conditions of the experiment were not 
sufficient to establish an expectancy in the 
No Regularity group. Taken together, these 
results suggest that in both the Prediction 
and No Prediction groups environmental regu- 
larity reduced the unexpectedness of the series 
that was repeated. 

An assumption implicit in the design of the 
experiment is that the period of regularity 
lasted long enough to reduce unexpectedness 
to a near-minimal value. As the inaccuracy of 
prediction on the eighth series was 6%, this 
assumption seems warranted for the Prediction 
group. There are no comparable data for the 
No Prediction group. However, there was 
only a small difference between these two 
groups in the number of subjects who could 
correctly answer the question concerning how 
many numbers had immediately preceded 
each stem but the last. The 2 X 2 contingency 
table comparing the Prediction and No Pre- 
diction groups with respect to correctness of 
prediction yields a chi square value of 0.62 
(p > .75), suggesting that the assumption is 
warranted for the No Prediction group as well. 

Figure 1 shows the relationship between 

ean affect scores and number of series ex- 

erienced. The affect curves provide a test of 
the first hypothesis, which states that as the 
series is repeated, affective tone is initially 
negative, becomes positive, and then be- 
comes negative again. The curves of both 
groups experiencing regularity are consistent 
with this hypothesis, while the curve of the 
No Regularity group is not. 

The procedure followed in determining 
whether the curvilinearity shown in the data 
of the groups experiencing regularity could 
have occurred by chance was essentially that 
of comparing the two highest points on each 
curve with the two points on either side of 
them.‘ For the Prediction group, the sum of 
each subject’s affect scores on the third and 


4 The author wishes to thank Frederick Mosteller for 
suggesting this procedure, 
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—— PREDICTION GROUP 
—--—NO PREDICTION GROUP 
—-— NO REGULARITY GROUP 


MEAN AFFECT SCORES 


SERIES 


Fic. 1, Mean affect scores as a function of number 
of series experienced, (These series constituted environ- 
mental regularity for the Prediction and No Prediction 
groups.) 


fourth series (called his Sum B) was compared 
with the sum of his scores during the first and 
second series (called his Sum A), and with the 
sum of his scores on the fifth and sixth series 
(called his Sum C). Each time Sum B was 
larger than were both Sum A and Sum C, the 
subject was given a comparison score of 2. 
Whenever Sum B was greater than one, but 
not both of the other sums, the subject 
was given a comparison score of 1, and when- 
ever Sum B was the smallest, a score of 0 
was recorded. If Sum B was tied for largest 
with one other sum, a score of 1.5 was given, 
and if Sum B was tied for smallest, a score of ” 
0.5 was recorded. The same procedure was 
followed for subjects of the No Prediction 
group, except that Sum B was computed from © 
the individual’s affect scores during the fourth 
and fifth series. Accordingly, Sum A involved 
the second and third series, and Sum C the 
sixth and seventh series. 2 
The mean comparison score-that would be 
expected by chance is 1.00, since an individ- 
ual’s affect curve could take any shape, and, 
hence, he was as likely to obtain one of the 
possible comparison scores as another. For the 
Prediction group, the mean and standard 
deviation of comparison scores were, respec- 
tively, 1.28 and 0.68. The No Prediction group 
mean was 1.26, with a standard deviation of 
0.86. The mean is significantly greater than 
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chance expectancy in the Prediction group 
(t = 2.80; df = 40; p < .01) and the No 
Prediction group (¢ = 2.17; df = 40; p < .05). 
These results indicate that in both groups, 
the curvilinearity could not very likely have 
occurred by chance alone. 

The peak of the Prediction group’s affect 
curve occurred in the third series, during 
which the group was inaccurate in 33% of its 
predictions. While inaccuracy of prediction 
decreased to 6% during the next five series, 
affective tone went from positive to negative. 
While inaccuracy of prediction decreased from 
76% to 33% during the first three series, 
affective tone went from negative to positive. 
The affect curve of the group can be under- 
stood in terms of the proposed effects of de- 
creasing levels of series unexpectedness upon 
affective tone. 


Environmental Change 


Treatments 0, 1, and 2 in the ninth series 
constituted 0, 1, and 2 degrees of change in 
order of events for the Prediction group. The 
percentages of inaccurate predictions made 
during Treatments 0, 1, and 2, were, 8, 42, and 
64, respectively. This trend, significant at the 
.001 level (x? = 16.40; df = 2), is consistent 
with the assumption that the series is more 
unexpected the larger the degree of change 
it constitutes. 

The most direct test of the second hypothe- 
‘sis involves the mean affect scores associated 
with Treatments 0, 1, and 2 in the final series. 
But in such an analysis each mean would be 
determined from the data of different sub- 
groups which had not been equated for level of 
affect prior to the final series. A more accurate 
estimate of the effects of the various final 
series treatments involves partialing out the 
effects of prior level of affect. This was done 
by subtracting each subject’s affect score on the 
eighth series from his score on the final series. 
When the result was a positive number, 
affective change toward the positive pole of 
the affective tone dimension had occurred; 
when the result was a negative number, 
negative affective change was inferred. 
. In order to establish the relevance of af- 

fective change scores, some implications of the 
second hypothesis will be made explicit. A 
precondition for the applicability of this 
hypothesis is that the period of regularity has 
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lasted long enough to reduce unexpectedness 
to a near-minimal value, Following such a 
period of regularity, affect should be in the 
negative range. The occurrence of another 
repetitive series (i.e., no change) should pro- 
duce negative change in affect. The occurrence 
of a moderate degree of change should produce 
positive affective change, while a large degree 
of change should produce negative affective 
change. 

Figure 2 shows the relationship between 
mean difference scores and Treatments 0, 1, 
and 2, which represent, respectively, 0, 1, and 
2 degrees of change in order of events for the 
Prediction and No Prediction groups. The 
difference between the means associated with 
0 and 1 degrees of change was significant in the 
Prediction group (¢ = 2.09; df = 22; p < .05), 
and in the No Prediction group (¢ = 2.09; 
df = 22; p < .05). The difference between the 
means associated with 1 and 2 degrees of 
change was not significant in the Prediction 
group (t = 1.44; df = 30; p > .05), or in the 


4.0 
—— PREDICTION GROUP 
——--NO PREDICTION GROUP. 
—"— NO REGULARITY GROUP 
2.0 


MEAN DIFFERENCE SCORES 


o 1 2 
TREATMENTS 


Fic. 2. Mean difference scores associated with 
Treatments 0, 1, and 2. (The treatments represent, 
respectively, 0, 1, and 2 degrees of change in order of 
events for the Prediction and No Prediction groups.) 
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No Prediction group (¢ = 0.33; df = 30; 
p > .50). Although the curves seem somewhat 
consistent with the second hypothesis, part of 
the data could have occurred by chance. 
Possibly, the predicted curve might have been 
demonstrated more conclusively if greater 
degrees of change had been included in the 
experimental design. The interpretation of the 
results as somewhat consistent with expecta- 
tion is perhaps strengthened by the lack of 
curvilinearity shown by the affective change 
curve of the No Regularity group. 

In the Prediction group, 1 degree of change 
was associated with positive affective change, 
and with inaccuracy of prediction of 42%. 
The treatment constituting 2 degrees of 
change was associated with inaccuracy of 64%, 
and with negative affective change. The no 
change treatment was associated with inac- 
curacy of 8%, and with negative affective 
change. The group’s affective change curve 
can be understood in terms of the proposed 
effects of increasing levels of series unexpected- 
ness upon affective tone. 

Treatment 3, which constituted a change in 
physical characteristics of the series, was pre- 
sented to the subjects of Subgroup D in each 
group. The mean difference scores for Sub- 
group D in the Prediction, No Prediction, and 
No Regularity groups were 1.38, 1.25, and 
—0.25, respectively. Each of these means was 
compared to the Subgroup A mean in its 
group. The resulting differences are significant 
in the case of the first mean (¢ = 2.26; df = 14; 
p < .05), and the second mean (t = 2.17; 
df = 14; p < .05), but not in the case the 
last mean (¢ = 1.67; df = 14; p > .10). 


Discussion 


The results obtained during environmental 
regularity are consistent with the proposition 
that both greatly unexpected and completely 
expected situations arouse negative affect, 
while moderately unexpected situations arouse 
positive affect. The results obtained during 
environmental change are less clearly in sup- 
port of this position. Additional investigation 
involving a wider range of change would be 
useful. 

An alternative explanation of the results of 
this experiment would involve the assumption 
that the task of completing sentences may 
have initially been experienced as threatening, 
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and therefore unpleasant. The individual may 
have feared that he would not function ade- 
quately, or that he would disclose embar- 
rassing aspects of his personality. Some experi- 
ence with the task may have led to a feeling of 
mastery, and even enjoyment. After complet- 
ing even more stems, the task may have be- 
come tedious, and no longer challenging. While 
such a position would account for the affect 
curves of the Prediction and No Prediction 
groups during regularity, it does not explain 
the positive affective change that was asso- 
ciated in these groups with 1 degree of change. 
Presumably, completing the final stem ‘was no 
more challenging than completing the stem 
before the last. In addition, if such a position 
were tenable, the No Regularity group’s affect 
curve during the first eight series should have 
been similar to the curves of the other two 
groups. 

It will be recognized that the correct and 
incorrect predictions made by the Prediction 
group can be considered mild rewards and 
punishments, respectively. During environ- 
mental regularity, the frequency of rewards 
increased, while that of punishments decreased. 
That the curve of affect did not remain in the 
positive range once having become positive, 
indicates the difficulty of explaining the results 
of this experiment in terms of reinforcement 
theory. Of course, the rewards and punish- 
ments were probably very mild. If accurate 
prediction had been more important to the 
subjects, the group curve might not have 
become negative again after so few repetitions 
of the series. 

Although it has been assumed that the 
effects of the unexpectedness of the situation 
are most clear when rewards and punishments, 
as usually conceptualized, are at a minimum, 
it need not be concluded that the unexpected- 
ness factor is trivial in determining affective 
tone. Results reported by Spector (1956) sug- 
gest that the positive affective tone produced 
by a rewarding event can be enhanced if the 
event is also relatively unexpected. Using a 
procedure in which small groups were given a 
task to perform, he found that among the 
subjects who received “promotions,” those 
who had perceived this rewarding event as low 
in probability responded with higher morale 
than did those who had perceived it as high in 
probability. The nature of the interaction 
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between the affect produced by the unexpected- 
ness of the situation and by its reward value 
deserves further study. 


SUMMARY 


This experiment concerned affective tone 
during environmental regularity and change. 
Two groups experienced environmental regu- 
larity, which consisted of eight presentations 
of an initially novel series of events. Following 
this, the various subgroups within each group 
experienced different versions of a final series 
of events. Each of the versions constituted a 
different degree of change in order of events. 
A control group received the same number of 
series as did the other two groups, but did not 
experience environmental regularity in that 
the order of events in each successive series 
was different from that of the preceding series. 

The last stimulus in each series was a sen- 
tence completion stem. As there were nine 
series in all, each subject perceived nine sen- 
tence stems. The subjects were instructed to 
complete the stems whenever they encoun- 
tered them in the course of the experiment. 
The completions were used as indicants of 
affective tone. 

The results indicated that as the series was 
repeated (environmental regularity), affec- 
tive tone was initially negative, became posi- 
tive, and then became negative again. The 
clearest finding concerning the effects of 
change in order of events was that the rela- 
tively moderate degree of change was asso- 
ciated with affective change in the positive 
direction. The results are somewhat consistent 


with the proposition that greatly unexpected 
and completely expected situations arouse 
negative affect, while moderately unexpected 
situations arouse positive affect. 
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THE EFFECTS OF FRUSTRATION AND ANXIETY 
i ON OVERT AGGRESSION: 


JACK E. HOKANSON 
Florida State University 


Te study is aimed at investigating some 
of the direct, physical resultants of 
experimentally aroused anger. For this 
purpose three major variables are considered: 
the degree of frustration, the retaliatory powers 
of the frustrator, and subject differences in 
apparent readiness to express aggression. Each 
of these variables have been dealt with by 
Dollard, Doob, Miller, Mowrer, and Sears 
(1939) in their original formulation of the 
frustration aggression hypotheses, and the 
empirical findings with respect to these factors 
have been recently reviewed by Berkowitz 
(1958). 

While most investigations support the 
hypotheses put forth in Frustration and Ag- 
gression, a general limitation of these studies 
involves the measuring devices employed. 
Typically, “overt aggression” is measured by 
means of questionnaire responses, ratings, or 
interviews concerning feelings of unfriendli- 
ness toward the frustrator, and a number of 
indices representing displaced aggression— 
eg. doll play and TAT content. Such ques- 
tionnaires and ratings are vulnerable to the 
subject’s conscious control over his responses 
in a socially desirable direction. Further, 
unfavorable ratings of the experimenter, or 
aggressive TAT protocols, may express merely 
a negative evaluation of the frustrator, not 
“aggression” in the strict sense required by 
the specification (Dollard et al., 1939) that the 
goal response of the instigation to aggression 
involves inflicting injury to an organism. 

The present research attempts to overcome 
these difficulties by using the application of 
direct, physical pain to the frustrator (elec- 
tric shocks) as measures of aggression. Fur- 
thermore, through the use of a specially 
designed apparatus, the physical characteris- 
tics of a subject’s aggressive behavior can be 
observed along three major dimensions— 
number of aggressive acts, average duration of 
aggressive act, and average intensity of ag- 


1 This study was conducted as part of the research 
program sponsored by Grant M-1540, National Insti- 
tute of Mental Health, United States Public Health 
Service, Leonard Berkowitz, principal investigator. 


gressive act. This type of approach should 
provide more direct tests of frustration aggres- 
sion hypotheses as they apply to overt, physi- 
cal hostility, and permit a more detailed 
analysis of the physical dimensions of aggres- 
sive behavior. 


METHOD 


The aim of this study was to establish, in a labora- 
tory setting, some of the conditions affecting the 
expression of overt, physical aggression. Subjects were 
presented with the following series of events: They were 
led to believe that this was an experiment on the 
physiological concomitants of intellectual performance;? 
informed that if they did not fully cooperate, or dis- 
pleased the experimenter in any other way, the experi- 
menter could punish them by administering an electric 
shock (retaliatory anxiety manipulation) ; degraded and 
insulted by the experimenter for a presumably poor 
performance on an intellectual task (frustration ma- 
nipulation); given an opportunity to hurt the experi- 
menter physically by giving him electric shocks. 

The manipulations of anxiety and frustration have 
control conditions in which the subjects were not so 
treated, and in the overall design, these two variables 
are orthogonal to each other. A third, subject variable is 
orthogonal to these. Subjects were divided into two 
groups depending on whether they scored low or high on 
a number of previously administered psychometric 
measures of hostility (High or Low Test Hostility). 
Thus there are two levels of each of the three variables 
arranged in a factorial design: High or Low Test Hostil- 
ity subjects, high or low retaliatory anxiety manipula- 
tion, and high or low frustration. 


Subjects 


Eighty white male undergraduate students at the 
University of Wisconsin selected from a pool of 375 
males taking an introductory or intermediate course in 
psychology served as subjects, Subjects were chosen on 
the basis of their scores on three psychometric instru- 
ments: Siegal’s (1956) Manifest Hostility Scale (MHS), 
three TAT protocols (SBM, 3BM, 2) scored for hostile 
content, and first impression ratings of two photographs 
in terms of the degree of friendliness the subject felt 
toward the persons portrayed. Subjects who con- 
sistently scored above the median on each test distri- 
bution (High Test Hostility subjects) and subjects who 
consistently scored below the median on each distribu- 
tion (Low Test Hostility subjects) were considered 
eligible to participate. These subjects were subsequently 


2This paper is part of a larger research project 
involving physiological responses to anger and fear 
provocation, the details of which are reported elsewhere 
(Hokanson, 1959). 
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contacted by phone, with approximately 85% of those 
contacted agreeing to participate in the research. The 
selection procedure thus provided 40 Low and 40 High 
Test Hostility subjects. Details of the selection pro- 
cedure are available elsewhere (Hokanson, 1959), 


Procedure 


The experimental session, an individual meeting 
lasting approximately 20 minutes, was divided into a 
number of phases. The experimenter introduced the 
experiment as one involving measurement of the phys- 
iological concomitants of routine intellectual work. 
‘The subject was then placed in the blood pressure and 
GSR recording apparatus and allowed to rest for 8 
minutes. After the adaptation period the subjects who 
had been randomly assigned to the high anxiety condi- 
tion were told that on subsequent tasks it was highly 
important that they cooperate fully and devote maxi- 
mum energy to their performance, and that if the 
experimenter felt they were not doing this he could 
administer an electric shock to them, (Actually this 
was never done, but merely served as a threat of physi- 
cal pain to the subject.) Subjects in the low retaliatory 
anxiety condition were also told that cooperation and 
maximum energy were important, but that they would 
probably do better if they just relaxed and did the best 
they could, 

‘The first “intellectual task” of this session was then 
introduced, The subject was asked to count backwards 
from 100 to 1 by threes as quickly as possible, Subjects 
assigned to the high frustration condition were then 
exposed to repeated interruptions, insults, and sarcastic 
remarks by the experimenter, In the low frustration 
condition the experimenter omitted these remarks and 
said “Good” when the subject reached zero, The 
task took approximately 1 minute. 

Ratings of subjective feelings of anxiety and anger 
were obtained at several points in the procedure in 
order to check the success of the experimental manipula- 
tions, Each scale consisted of three items to which the 
subject was to respond by placing a check mark at an 
appropriate point along a 160 centimeter line. Only the 
end points of each line were anchored with a verbal 
description of an extreme subjective feeling. A typical 
item on the anxiety questionnaire was “How keyed up 
do you feel at present?” The hostility items asked such 
things as “What were your general feelings toward the 
experimenter during the experiment?” The subject's 
score consisted of the sum of the distances his check 
marks were from the positive end of each line for the 
three items. 

In evaluating the success of the retaliatory anxiety 
manipulation, the postmanipulation anxiety rating was 
subtracted from the score obtained during the initial 
adaptation period. Similarly, in evaluating the success 
of the frustration manipulation, the postfrustration 
aggression rating* was subtracted from the same rating 


* This rating was postponed until after the subject 
had an opportunity to aggress physically against the 
experimenter. While it would have been preferable to 
have the subject rate his feelings immediately after 
frustration, focusing the subject's feelings at that time, 
and having these feelings expressed to the experimenter 
indirectly via the rating, might have contaminated the 
subsequent behavioral measures of aggression. 


347 


obtained prior to frustration. The analyses of these 
data, presented in an earlier report (Hokanson, 1959), 
indicate that both the retaliatory anxiety and the 
frustration manipulations had their desired effects, 

Measurement of overt aggression, The method for 
measuring overt, physical aggression was one originally 
suggested by Hokanson and Gordon (1958) in which 
the subject had the option of administering electric 
shocks to the experimenter, Following the manipula- 
tion of frustration, the experimenter introduced the 
situation with the following instructions: 

The next part of this session is something quite 
different. As you are probably well aware, one of the 
biggest sources of error in the interpretation of psy- 
chological tests is the interpreter himself, For in- 
stance, if I were to interpret your tests [taken some 
3 weeks earlier as part of the screening battery] such 
factors as being in a bad mood or feeling physically 
ill would certainly affect my interpretation. This is 
the problem I am attempting to investigate in this 
part of the session. I am going to read out loud some 
items from a personality questionnaire and try to 
predict how I think you would answer each item, 
If my prediction is wrong I want you to give me an 
electric shock through this apparatus, If I am cor- 
rect in my prediction, simply don’t do anything and 
I will go on to the next item. In this way I will feel 
physical discomfort when my predictions are wrong 
and I would like to see how this affects my subsequent 
predictions, 

The questionnaire was one developed to measure 
attitudes toward dating at the University of Wisconsin. 
The items are rather straightforward, innocuous 
statements that appear to keep personal involvement 
at a minimum, Thus either to agree or to disagree with 
a particular item should make relatively little difference 
to the subject. His decision to shock or not shock 
should therefore be determined mostly by his readiness 
to express hostility overtly, The number of times the 
subject disagreed with the experimenter’s prediction 
(i.e, number of times the subject shocked the ex- 
perimenter) was thus used as one measure of overt 
hostile expression. 

The questionnaire consisted of 15 items on which at 
least 80% of the standardization groups of University 
of Wisconsin male students gave the same response. 
Each “prediction” was made in a direction opposite 
from that which the normative population indicated to 
be the prevailing attitude, In this way, each subject was 
faced with 15 predictions about himself with which he 
most likely disagreed. Situated in front of the subject 
was an elaborate “shock apparatus” with electrodes 
connected to the experimenter’s hand, 

The subject was thus given 15 opportunities to 
administer electric shocks to the experimenter by 
depressing a plunger situated on top of the apparatus, 
(The experimenter never actually received a shock, but 
merely behaved as if he had.) The subject’s responses 
with respect to the plunger were recorded along three 
major behavioral dimensions: the number of shocks 
administered, the average duration per shock, and the 
average maximum pressure exerted per shock, 

The “shock” apparatus was constructed especially 
for this research in order to provide these measures, 
(The plunger that presumably activated the shock was 
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actually connected directly to a spring scale with a 
maximum reading of 6 pounds, When the subject de- 
pressed the plunger a direct reading from the large 
face of the scale (visible only to the experimenter) was 
made and recorded. In addition, the base of the plunger 
consisted of a small metal contact that acted as one 
pole of a switch in an electric timer circuit, Depression 
of the plunger with one fourth pound of pressure closed 
the circuit, which remained closed until the plunger 
returned to its resting position. A cumulative record of 
the time that the subject was actually shocking the 
experimenter was thus produced. 

As part of a larger battery of physiological measures, 
systolic blood pressure was recorded at several points 
during the procedure: at the end of the initial adapta- 
tion period, after the anxiety manipulation, after 
frustration, during the expression of aggression to the 
experimenter, and during a final rest period 1 minute 
after aggression towards the experimenter. By com- 
paring a particular blood pressure level with the 
immediately preceding level, systolic changes at each 
of the experimental steps could be calculated. These 
measures are included in the present report as informa- 
tive correlates of the behavioral measures of aggression, 


RESULTS 


Table 1 presents an analysis of variance of 
the number of shocks administered to the 
experimenter. The significant main effect for 
frustration indicates that the subjects who were 
in the high frustration condition administered 
a greater number of shocks to the experimenter 
than did the subjects in the low frustration 
condition. Subjects under high frustration 
administered an average of 9.25 shocks while 
those under low frustration shocked the experi- 
menter an average of 7.87 times. Also of note 
in this analysis is the absence of significant 
differences between the subjects in the high 
and low retaliatory anxiety conditions and 
between the subjects High and Low in Test 
Hostility. 

Table 2 presents an analysis of variance of 
the scores for average duration per shock 
(subjected to square root transformation). 
None of the terms in this analysis is significant. 

The analysis of variance of the scores for 
average pressure per shock is shown in Table 
3. Note the significant Frustration X Anxiety 
interaction. The cell means and the results of a 
Duncan range test, presented at the bottom of 
Table 3, indicate that the mean of the Low 
Frustration-Low Anxiety condition (2.78) is 
just short of being significantly lower than the 
means in the Low Frustration-High Anxiety 
(3.78) and High Frustration-Low Anxiety 
(3.80) conditions (p = .06). Thus there is 
suggestive evidence that under conditions of 


_experimental condition because of the small 
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TABLE 1 
NUMBER OF SHOCKS ADMINISTERED TO 
THE EXPERIMENTER 


Source df | ss MS F 

Test Hostility 1 0.3 | 0.3 
Frustration 1| 37.8 | 37.8 | 6.30* 
Anxiety 1 4.5/ 4.5 
Test Hostility X Frus- | 1| 10.5 | 10.5 

tration 
Test Hostility X Anx- | 1 = al OE 

iety 
Frustration X Anxiety 1| 20.0 | 20.0 
Test Hostility X Frus- | 1 L220) 22 

tration X Anxiety 
Residual 72 | 432.3 | 6.0 

Total 79 | 515.7 

i 
*p <05. 
TABLE 2 


AVERAGE DURATION PER SHOCK ADMINISTERED TO 
THE EXPERIMENTER (in seconds) 


Source df SS MS 
ne SS ES Nt eet 
Test Hostility 1 .59 59 
Frustration 1 -06 06 
Anxiety 1 01 01 
Test Hostility X Frustra- 1 -70 70 

tion 
Test Hostility X Anxiety 1 15 15 
Frustration X Anxiety 1 -67 67 
Test Hostility X Frustra- 1 31 31 
tion X Anxiety 
Residual eye GRGUR C 27 
Total 79 21.94 


Note.—No significant F was obtained. 


low retaliatory anxiety, high frustration leads ‘ 
to increased pressure per shock, while under 
low frustrating conditions, being subject to 
high retaliatory anxiety also leads to increased 
pressure per shock. However, under both high 
frustration and high retaliatory anxiety, the ” 
pressure per shock is not significantly differ- 
ent from that occurring under Low Frustra- 
tion-Low Anxiety. 

Table 4 presents a summary of the Pearson 
product-moment correlations obtained between 
the three behavioral measures of aggression 
and the systolic blood pressure and subjective 
feeling changes of the 40 subjects in the high 
frustration condition. Individual correlations 
were not routinely computed for each separate 


number of cases available. However, where a 
significant 7 was obtained for all 40 subjects, 
individual cell scattergrams were plotted and 
inspected for possible differences among con- 


TABLE 3 
AVERAGE PRESSURE PER SHOCK ADMINISTERED TO 
THE EXPERIMENTER (in pounds) 
Source af SS MS F 

Test Hostility 1 61 61 
Frustration 1 DSL| 1.51 
Anxie 1 1.30 | 1.30 
Test Hostility X Frus- | 1 -13 13 

tration 
Test Hostility X Anx- | 1 17 -17 

iet; 
Frustration X Anxiety | 1 | 11.12 | 11.12 |9.11** 
Test Hostility X Frus- | 1 1.05 | 1.05 

tration X Anxiety 
Residual 72 | -87.51 | 1.22 

Total 79 | 104.00 


Cell Means in the Frustration X Anxiety Interaction 


Low Anxiety | High Anxiety 
Low Frustration 2.78 3.78 
High Frustration 3.80 3.31 


Note.—There are no significant differences among the cell 
means by Duncan range test. 
"*d< 01. 


TABLE 4 
PEARSON Propuct-MoMENT CORRELATIONS OF 
AGGRESSION MEASURES WITH SysTOLIC BLOOD 
PRESSURE AND SUBJECTIVE FEELING 
CHANGES 
(N = 40 subjects in high frustration condition) 


Change 
Change during | Shock | Gut nel 
Frustration the Ex- Final 
Measure lie 
AnECE Isystolic|Systolic|Systolic 
Number of Shocks —.23 |—.02 |—.22 |—.13 
Duration per Shock —.03 |—.15 |+.10 |—. 19. 
Pressure per Shock — .32*|—.38*|-+ .39*|— 42 
* p< .05, 


ditions. Note that the four significant correla- 
tions all occur with the scores for average 
pressure per shock, previously thought to be a 
reliable indicator of a presumed instigation to 
direct, overt aggression. 

The r of —.32 between pressure per shock 
), and changes in subjective ratings of anger 
| appears to be somewhat paradoxical. It will be 
recalled however, that the postfrustration 
anger rating was actually obtained after the 
“Shock the experimenter” portion of the pro- 
cedure, Thus this negative y indicates that the 
| greater the vigor of the subjects’ responses 
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while aggressing against the experimenter, 
the less they subsequently report having felt 
angry at the experimenter. Inspection of the 
four individual condition scattergrams re- 
vealed no major differences in this trend among 
the cells. 

The r of —.38 between pressure per shock 
and systolic changes during frustration re- 
quires further elaboration. Because of the 
obvious differences in the scatterplots of each 
condition, individual correlations based on an 
N of 10 in each cell were computed. The cor- 
relations for Low and High Test Hostility 
subjects under low retaliatory anxiety condi- 
tions were +-.15 and +-.03, respectively, while 
under high anxiety conditions they were —.71 
and —.77, respectively. These latter two sig- 
nificant correlations seem to indicate that 
when there is an objective possibility of 
physical punishment, the greater the increase 
in systolic blood pressure (very likely a relia- 
ble indicator of anxiety) during frustration, 
the less the vigor of the subjects’ subsequent 
aggressive behavior. 

The positive correlation of .39 found between 
pressure per shock and systolic increases during 
“Shock the experimenter” indicates that the 
greater the vigor of aggressive responses, the 
greater the increase in systolic blood pressure 
during the expression of overt aggression. By 
the same token, the —.42 correlation between 
pressure per shock and elevation of systolic 
blood pressure at the final rest period suggests 
that the more vigorous the overt behavior 
while shocking the experimenter, the greater 
the reduction in systolic pressure after the 
aggressive responses are completed. 


Discussion 

The present study focuses upon three di- 
mensions of direct, physical aggression: the 
number of aggressive acts, the duration of such 
acts, and the intensity of physical exertion 
during these acts. Apparently, these three 
aspects are not equally sensitive as indicators 
of the frustration produced instigation to 
overt hostility, assuming the validity of the 
classical analysis by Dollard et al. (1939), 
While number of aggressive acts did reliably 
increase with frustration, judging from the 
number of significant relationships with each 
measure, the vigor with which the hostile act 
is carried out seems to be at least an equally 
sensitive index of this instigation. Frustration 
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aggression theorists maintain that overt hos- 
tility is inhibited by threat of punishment for 
aggression, but that as itself is frustrating, the 
blocking of the hostile behavior only further 
increasing the strength of the instigation to 
aggression. Consistent with this, there was the 
strongest pressure per shock (i.e., the most 
vigorous direct hostility) in the High Frustra- 
tion-Low Anxiety and Low Frustration-High 
Anxiety conditions (Table 3). The similar 
results under these two treatments suggest that 
retaliatory anxiety did indeed serve to heighten 
the instigation to overt aggression. 

However, if both anticipation of punishment 
and the experimental frustration increase this 
instigation, the question arises as to why, 
under a combination of both high frustration 
and high retaliatory anxiety, there was not an 
even greater pressure exerted per response, 
rather than the reduced pressure actually 
found. An application of Miller’s “‘approach- 
avoidance” conflict model suggests that the 
subjects in the High Frustration-High Anxiety 
condition may have been so strongly angered 
that they recognized the danger of the experi- 
menter’s retaliation, and therefore inhibited 
their hostile acts. In other words, the strength 
of the instigation to aggression had to exceed 
a certain minimum before the inhibition could 
take place. This minimum level may not have 
been passed in the High Frustration-Low 
Anxiety or the Low Frustration-High Anxiety 
conditions and the subjects more readily ex- 
pressed their hostility towards the experi- 
menter. 

There are several studies in the literature 
offering evidence that high levels of aggressive 
drive can be anxiety arousing. Schachter (1957) 
found a trend in his data indicating that the 
subjects rated as being moderately angry after 
frustration manifested a nor-epinephrine-like 
physiological response, whereas the subjects 
rated as being either mildly or intensely angry, 
showed an anxiety type of physiological reac- 
tion (epinephrine-like). Along similar lines, 
Hokanson (1959) demonstrated that the sub- 
jects who admit to strong feelings of anger on 
personality tests manifest more of an anxiety 
type of physiological response following frustra- 
tion than do the subjects who get low aggres- 
sion scores on the tests. 

The study provides further evidence in 
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support of the contention that anxiety results 
in reduced manifestations of overt hostility 
There is ample evidence suggesting that 
crease in systolic blood pressure is a reflectio 
of anxiety (Ax, 1953; Schachter, 1957); thus 
the significant negative correlation betw: 
systolic increases during frustration and sı 
sequent pressure exerted per shock in 
high anxiety condition indicates that 
greater the physiological manifestations 
anxiety during an instigation to aggression, 
less the vigor of postfrustration hostile ac 
Another line of indirect evidence pertaining to _ 
the reduction in aggressive tendencies, pre- 
sumably because of the arousal of anxiety, 
comes from the significant negative correla- 
tion between pressure exerted per shock and 
ratings of anger felt during frustration. It will 
be recalled that this anger rating was obtain 
after that subject had an opportunity to a 
minister shocks to the experimenter. 
negative relationship thus indicates that 
greater the vigor of the subject’s respon: 
while he was aggressing against the frustrato 
the less his subsequent rating of how angry 
he was during frustration, as if the previous 
anger were now being denied. This findin 
conforms to Berkowitz’ (1958) contention th 
high intensity of aggressive behavior 
provoke guilt and/or aggression-anxiety whit 
in turn tends to reduce subsequent man 
tations of aggression. í, 
Of added interest are the correlations be- 
tween pressure per shock and systolic changi 
occurring during and after the expression 
aggression towards the experimenter. The 
pattern of changes indicates that the vigor 
response is positively correlated with in 
in systolic blood pressure during the a 
expression of aggression, but negatively co 
related with elevation of blood pressure wh 
the aggressive behavior is completed. These 
findings suggest that while overt aggression 
may momentarily increase certain physio- — 
logical activities, it is also associated with a 
later reduction in those activity levels. Worchel — 
(1957) suggested a similar cathartic-like redu 
tion in “tension” following the expression of - 
aggression in his study of frustration produced | 
interferences with intellectual performance. 
The present findings are only suggestive, 


however, owing to the possible operation of 
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sequence effects and homeostatic controls on 
the course of blood pressure during the latter 
part of the experiment. 

The overall results of this study suggest that 
both number of aggressive acts and the vigor 
with which aggressive behavior is carried out 
may be reliable indicators of hostile motivation. 
While neither measure provided entirely con- 
sistent data, their potential usefulness in 
studying the relationship between physiologi- 
cal and behavior processes in the area of 
aggression is indicated. 


SUMMARY 


This study investigates some of the be- 
havioral concomitants of experimentally 
aroused anger. An orthogonal design, with 80 
undergraduate males, varied test hostility, 
level of experimental frustration, and level of 
retaliatory anxiety, the latter variable being 
manipulated by leading half of the subjects 
to believe that the experimenter had the poten- 
tial for inflicting physical pain to them via 
electric shocks if they did not cooperate fully 
during the experiment. Following these experi- 
mental manipulations the subjects were given 
a number of socially sanctioned opportunities 
to aggress against the experimenter by ad- 
ministering electric shocks to him. The sub- 
jects’ behavior with respect to the plunger 
activating the shock was analyzed along three 
dimensions: the number of shocks adminis- 
tered, the average duration per shock, and 
the mean pressure exerted while activating 
the shock. 

The major findings in this study are: 

1. Subjects who were frustrated administer 
significantly more shocks to the frustrator 
(the experimenter) than the subjects in the low 
frustration condition. 
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2. Subjects exposed to either high frustra- 
tion-low retaliatory anxiety or high retaliatory 
anxiety-low frustration manifest a strong trend 
toward greater vigor of aggressive responses 
than the subjects under low frustration-low 
retaliatory anxiety conditions. 

3. Among frustrated subjects under high 
retaliatory anxiety conditions there is a signifi- 
cant negative correlation between the pre- 
sumed degree of anxiety aroused during frus- 
tration (systolic blood pressure increases) and 
the vigor of subsequent aggressive behavior. 

4. Among frustrated subjects there is also a 
significant negative correlation between the 
vigor of aggressive responses and subsequent 
ratings of the degree of anger felt toward the 
experimenter during frustration, 
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THE PERFORMANCE OF PSYCHOPATHIC INDIVIDUALS UNDER 
CONDITIONS OF POSITIVE AND NEGATIVE 
PARTIAL REINFORCEMENT? 


DONALD H. PAINTING? 
United States Public Health Service Hospital, Lexington, Kentucky 


HILE there is general agreement 

among clinical observers regarding 

the distinctive characteristics of 
psychopathic behavior, disagreement exists 
concerning the variables which control this 
behavior. One group (Cleckley, 1955; Hender- 
son, 1947; Karpman, 1948) interprets the poor 
judgment and need for immediate gratifica- 
tion of the psychopath as manifestations of an 
insufficient internalization of the mores of 
society. Thus, the probability of responses 
in violation of legal and moral codes is higher 
as these responses are not followed by anxiety 
and guilt. A second group (Aichhorn, 1935; 
Lindner, 1944, 1946; Pennington, 1954) takes 
an opposing view. The seeming inability of 
the psychopath to profit from experience 
(differential rewards and punishments) springs 
from overly severe standards of self-conduct. 
According to this latter conception, asocial 
behavior serves to secure punishment and, 
hence, to mitigate guilt. Some attempt has 
been made to reconcile this conflict (Karpman, 
1948; McCord & McCord, 1956) by the sugges- 
tion that two subdivisions exist within psycho- 
pathy, the primary or “‘ideopathic” type and 
the secondary or “symptomatic” type, where 
the basic personality is neurotic in nature. 

Unfortunately, relatively little experimental 
evidence is available concerning these issues. 
Fairweather (1954), using a rote serial learning 
method, has demonstrated that psychopathic 
criminals do not perform in an inferior fashion 
under appropriate motivating conditions. 
Doubt is therefore cast on the assertion that 
psychopathy is invariably accompanied by a 
basic deficiency in learning capacity. Experi- 
mental support for the poor judgment ob- 

1 This study is part of a dissertation submitted in 
partial fulfillment of the requirements for the degree of 
Doctor of Philosophy to the Graduate School of the 
University of Kentucky. 

Appreciation is gratefully expressed to John W. 
Donahoe for his critical suggestions and helpful assist- 
ance in this project. 

2 Now at Veterans Administration Hospital, Coates- 
ville, Pennsylvania. 


served clinically in the psychopath has been 
obtained, however. A motor skill task was em- 
ployed by Lonstein (1952) in a study using 
level of aspiration as the dependent variable. 
The goal setting behavior of the psychopath 
was found to be unrealistic and largely un- 
modified by success or failure in the task. 

The present investigation was a further 
effort to explore the validity of clinical ob- 
servation in a more controlled environment. 
A two-choice partial reinforcement procedure 
was used, both because of the greater similar- 
ity to naturally occurring reinforcement 
schedules (Ferster & Skinner, 1957) and the 
comment by many writers concerning the ab- 
normal risk taking behavior of psychopathic 
individuals (Noyes, 1955). Negative as well 
as positive reinforcement was used to deter- 
mine whether the behavior of psychopathic 
individuals was differentially affected by a 
correct response’s being followed by the at- 
tainment of a reinforcing stimulus or the 
avoidance of an aversive one. Three proba- 
bility learning sequences, in which a correct 
response on trial (n) was determined by a 
preceding stimulus of successively greater 
distance from trial (m), were employed. This 
served to provide a test of the contention that 
the psychopath is less capable of perceiving 
the relationships between past events and the 
consequences of present actions. 


METHOD 
Subjects 


On the basis of internalization ratios (IR) (Welsh, 
1952) derived from the MMPI (Hathaway & McKin- 
ley, 1951), two experimental groups of 36 subjects each 
were selected from among Negro male postnarcotic 
drug addicts hospitalized at the United States Public 
Health Service Hospital, Lexington, Kentucky. The 
IR was designed to indicate the direction in which 
anxiety is expressed, either toward the self, IR > 1.00, 
or toward the environment, IR < 1.00. One group, the 
primary psychopaths (PP), obtained a mean IR of 
0.79, while the other group, the neurotic psychopaths 
(NP), received a mean IR of 1.02. By this device, two 
psychopathic groups (both had MMPI Pd scores 
greater than 70) were formed which appeared to cor- 
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TABLE 1 
Meran T Scores on MMPI Scares 
L F K Hs D Hy På uf Pa Pt Se Ma 
Normal 46 53 53 52 53 53 60 53 47 56 59 65 
NP 50 58 53 65 77 64 79 63 59 69 63 63 
PP: 53 53 57 49 56 56 74 59 50 50 53 65 


respond to the suggestion made by Karpman (1948) 
and McCord and McCord (1956) regarding the subdi- 
visions within psychopathy. The complete MMPI scores 
for all groups are presented in Table 1. The control (C) 
group, consisting of 36 Negro male college students, had 
a mean IR of 0.90. The mean of C differed by one 
standard deviation from the means of both psychopathic 
groups, indicating a minimum of overlap. Shipley-Hart- 
ford vocabulary scores (Shipley, 1940) were obtained 
from all subjects to insure that differences in treatment 
effects did not arise from original differences in intel- 
lectual ability, The mean vocabulary scores were 26.08, 
26.53, and 26.75 for PP, NP, and C, respectively. In 
no comparison was there a statistically significant 
difference. 


Apparatus 


The apparatus used in the experiment was similar to 
that employed by Grant, Hake, and Hornseth (1951). 
Two white lights were located to the left and right of 
the center of a horizontal panel. The subject sat facing 
the lights, while the experimenter sat behind the panel, 
inaccessible to the subject’s vision. A small red signal 
light located between the white lights served to alert 
the subject before each trial. Two push-button switches 
were mounted on the base of the panel, one below each 
of the white lights. A hinged door and a shelf were 
placed in the center of the panel below the signal light. 


Procedure 


In order to determine which response alternative, 
right (R) or left (L), was correct on a given trial, three 
sequences were constructed according to a method 
described by Smith (1955). In Sequence I, the correct 
alternative on trial () was independent of all pre- 
ceding trials. In Sequence II, an alternative repeated 
itself with p = .25 and changed to the other alternative 
with p = .75. Thus, trial (n) was dependent on trial 
(n-1). For Sequence II, trial (n) was independent of 
trial (n-1) but dependent on trial (n-2). That is, if R 
was correct on trial (n-2), R was correct on trial (m) with 
p = .25. In all three sequences, the probability of 
occurrence of both alternatives was .50 in a series of 
200 trials. 

The three groups (PP, NP, and C) were partitioned 
into three equal subgroups, each of which received one 
of the sequence treatments. For one-half of each sub- 
group, 5 correct responses resulted in a reward of 2 
cigarettes. For the other half, 5 incorrect responses 
resulted in the loss of 2 cigarettes from an original 80 
given to the subject at the beginning of the experiment 
by the experimenter. Poker chips, which could be ex- 
changed for cigarettes at the end of the experiment, 
were used as the medium of exchange. The chips were 


given to or taken from the subject through the hinged 
door in the apparatus. All subjects, after having been 
told the manner in which cigarettes could be won or 
lost, were given 200 trials on the appropriate sequence. 
The response on each trial was recorded. 


RESULTS 


Sequence I 


Performance under Sequence I, in which 
the conditional probabilities approximated the 
overall probability of an alternative, served 
both as a baseline against which Sequences II 
and III might be compared and as an index of 
behavior under conditions which precluded a 
strategy insuring greater than 50% reinforce- 
ment. The number of instances in which the 
subject selected a response opposite to the 
response which was correct on the preceding 
trial served as the dependent variable, The 
median test was used throughout to evaluate 
the data. No differences were found between 
groups under positive reinforcement; however, 
in the condition in which a correct response 
served to avoid the loss of a cigarette, signifi- 
cant differences between groups emerged 
(p < .02). Whereas PP made fewer alterna- 
tions than either NP or C (p < .01 in both 
cases), NP and C did not differ significantly 
from one another (see Table 2). 


Sequence II 

Optimum performance on Sequence II re- 
quired that the subject learn to choose that 
response which was incorrect on the preceding 
trial. Under avoidance conditions, all groups 
approximated the proportion of actual alterna- 
tion (.75) in the last 100 trials (see Table 2). 
Under the reward condition, group differences 
were observed (p < .05). Again, NP and C 
did not differ significantly, while PP alter- 
nated more frequently than did NP (p < .01) 
and C (p < .05). Thus, PP alternated in 
excess of the actual conditional probability, a 
strategy superior to that adopted by either 
of the other groups. 
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TABLE 2 
PROPORTION OF ALTERNATED RESPONSES DURING THE 
FINAL 100 TRIALS OF THE SEQUENCE 


Avoidance 


PP | NP c PP | NP c 


Sequence I +50 | .38 | .42 | .33 | .49 | .47 
Sequence IL -80 | .69 | .74 | .75 | .72 | .75 
Sequence III -67 | .73 | .76 | .64 | .67 | .73 


TABLE 3 
Cur SQUARE VALUES OBTAINED FROM THE MEDIAN 
Test COMPARING PERFORMANCE ON SEQUENCE 
II anp SEQUENCE IIT 


Reward Avoidance 
chi square $ chi square $ 
NP 2.23 <.20 > 2,27" "<.20 
PP 8.82 <.01 4.74 <.05 
Control 3.4 | <.1 0.9 <.50 
Sequence ITI 


In Sequence II, adequate performance re- 
quired the subject to alternate his responses 
on trial (n) in accordance with the correct 
response on trial (n-2). Group differences 
verged on significance under avoidance (p < 
-10) and attained significance under reward 
(p < .02). The difference between the extreme 
groups, PP and C, was significant (p < .01). 
Here, PP alternated Jess than did C. When 
performance on Sequences II and III were 
compared, only PP showed a significant de- 
crease in the frequency of alternation, the per- 
formance of NP and C remaining on the same 
level in the more difficult sequence (see Table 
3). PP alternated less often in Sequence III 
under both reward (p < .01) and avoidance 
(p < .05). In short, PP performance deterio- 
rated generally under conditions necessitating 
a temporally remote determination of re- 
sponse. 


Discussion 


The data clearly demonstrate that psycho- 
pathy, when defined solely in terms of a 
deviant Pd score on the MMPI, is not a 
homogeneous behavioral entity. Rather, a sub- 
division of this group on the basis of the Welsh 
IR led to the formation of two subgroups, 
neurotic psychopaths (IR > 1.00) and pri- 
mary psychopaths (IR < 1.00), whose re- 
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sponses in the partial reinforcement situation 
were quite different. With manipulation of se- 
quence and reinforcement conditions, the per- 
formance of the neurotic psychopath was in- 
distinguishable from that of the normal, 
whereas the behavior of the primary psycho- 
path was significantly affected by both treat- 
ment variables. This finding is compatible 
with the Karpman (1948) and McCord and 
McCord (1956) hypothesis and is supported by 
the work of Hill, Haertzen, and Glaser (1960) 
and Painting (1954) with the MMPI. 

The frequency with which the primary psy- 
chopath alternated his responses under the 
various experimental conditions provides an 
interesting parallel to clinical observation. 
Where the reinforcement schedule disallowed 
any regularity in stimulus-response contin- 
gency (Sequence I), the primary psychopath 
group tended to repeat that response which 
had been correct on the preceding trial. To 
the extent that the alternation measure re- 
flects an attempt to secure an appropriate 
strategy, the behavior of the primary psycho- 
path became stereotyped and rigid in display- 
ing a strong positive recency effect. 

With a schedule permitting the control of — 
reinforcement by responses (Sequence II), alll 
groups approximated the (m = 1) dependency 
with the primary psychopath tending to maxi- 
mize his gains in the reward condition. It is _ 
relevant to note that while normals generally 
match event probabilities, maximizing be 
havior has been demonstrated with an experi- 
mentally induced “gambling set” (Goodnow, 
1955). These data suggest that the gambling ` 
set is the customary attitude of the primary — 
psychopath, an interpretation which is in 
agreement with that of many clinical ob- 
servers. Notice that the performance in Se- 
quence II is not at all compatible with the 
position that the psychopath is incapable of 
profiting from experience. If anything, when 
the stimulus and response are in close tem- 
poral proximity, the level of psychopathic 
performance is superior to that of normals. 

Under a more remote stimulus-response cor- 
relation (Sequence III), the performance of 
the primary psychopath deteriorates, however. 
This deterioration tends to be general and not 
restricted to a particular reinforcement opera- 
tion. It would appear, therefore, that with 
remote determination of the correct response; 
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the most common naturally occurring relation- 
ship, the primary psychopath is, in truth, less 
proficient than the normal. The origins and 
stability of this deficiency under different 
experimental conditions are, of course, un- 
known. These findings do indicate, however, 
the danger in generalizations concerning the 
psychopath which do not take into account 
the heterogeneity of the group as convention- 
ally defined and situational variables such as 
response consequences and contingencies. 


SUMMARY 


A total of 108 subjects (36 primary psycho- 
paths, 36 neurotic psychopaths, and 36 nor- 
mals) were studied with a two-choice partial 
reinforcement method. Three probability 
learning sequences of differing conditional 
probabilities were used: independent, (m = 1) 
dependent, and (n = 2) dependent. These se- 
quences were used in combination with two 
response consequences, reward and avoidance. 
In all comparisons, the behavior of the neu- 
rotic psychopaths was indistinguishable from 
that of the normals. Performance of the pri- 
mary psychopaths, however, tended to become 
inflexible in the independent sequence under 
avoidance conditions and to deteriorate gen- 
erally with the more remote stimulus-response 
contingencies. The implications of these find- 
ings for statements based upon clinical ob- 
servation were discussed. 
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reports that anticipated exposure to a 

painful external stimulus determines the 
degree to which persons wish to affiliate with 
each other: the greater the anticipated pain, 
the stronger the desire to await the onset of 
that pain in the company of others in the 
same predicament. In attempting to account 
theoretically for this finding, Schachter men- 
tions such motivational forces as the subjects’ 
needs for reassurance, distraction, escape, and 
information. However, among the various 
possible explanations, Schachter appears to 
favor one derived from Festinger’s (1954) 
theory of social comparison processes. Adapting 
that theory to the phenomena under investiga- 
tion, Schachter postulates that the arousal of 
any strong emotion evokes a need for compari- 
son. Emotions are assumed to be quite un- 
specific states of affect. Hence, persons can 
only evaluate the quality, intensity, and ap- 
propriateness of their emotions properly by 
comparing their own reactions with those of 
others. Moreover, novel emotion producing 
stimuli should induce a greater tendency to 
affiliate than familiar stimuli. By definition, a 
novel stimulus is one that is more difficult to 
fit into a person’s established frame of refer- 
ence for emotive states. Accordingly, the in- 
dividual is more obliged to seek out others in 
order to define the emotional effects of novel 
stimuli. 

The explication of Schachter’s (1959) re- 
sults in terms of the theory of social compari- 
son processes is appealingly parsimonious. 
However, it requires the assumption that all 
emotive states have the same effect on affilia- 


I his recent monograph, Schachter (1959) 


1 The authors are indebted to the Yale Communica- 
tion Research Project, directed by C. I. Hovland, for 
its material support of this study, and to C. I. Hovland 
for his encouragement and helpful suggestions. Our 
thanks are extended to Jacob Rabbie and Harold 
Gerard for their contributions to the study. We also 
wish to acknowledge the expert and enthusiastic partici- 
pation of our research assistants, Ted Sheldon, Sally 
Whitcher, and Ira Grushow. 
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tive behavior. Thus, Schachter, like many con- 
temporary psychologists, does not deal with 
the possible conceptual distinctions between 
fear and anxiety. Yet, it seems to us that, by 
adopting an alternative assumption about the 
psychological properties of emotions, to be 
presented briefly below, it is possible to formu- 
late predictions concerning affiliative responses 
that could not have been derived from the 
theory of social comparison processes. Indeed, 
by employing Freud’s (1949a, 1949b) concep- 
tual distinctions between fear and anxiety, we 
are led to predict a tendency toward social 
isolation—rather than affiliation—as a con- 
sequence of certain conditions of emotional 
arousal. 

The present experiment was, thus, under- 
taken with two objectives: to assess the em- 
pirical validity of conceptual differentiation 
between fear and anxiety, and to evaluate the 
extent to which the theory of social comparison 
processes may be applied to the relationship 
between all emotions and affiliative behavior. 
In order to implement these objectives, we 
have conducted an experimental investigation 
of the differential effects of fear and anxiety 
upon social affiliation. 


Functional Relationship between Emotions and 
Motives 


The guiding assumption of our experiment 
holds that all emotions are consciously experi- 
enced epiphenomena of motives.? When our 
motives are aroused, we experience subjective 
reactions to which we learn, over time, to at- 
tach commonly agreed upon labels that signify 
the various emotions. 

Motive, on the other hand, is defined as a 
tension producing stimulus that provokes be- 
havior designed to reduce the tension. Each of 
our motives (innate or learned) requires the 


2 The concept of motivation which we have chosen 
to employ has been elaborated elsewhere (Sarnoff, 
1960a). 
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performance of a different response for the 
maximal reduction of its tension. 


Fear and Anxiety Viewed as Motives 


The motive of fear (which Freud called ob- 
jective anxiety) is aroused whenever persons 
are confronted by an external object or event 
that is inherently dangerous and likely to pro- 
duce pain. Only one type of overt? response can 
maximally reduce our fear: separation from 
the threatening aspects of the feared object, 
accomplished by flight from the object, at one 
extreme, and conquest, at the other. In the 
case of fear, then, one’s energies are mobilized 
toward dealing with the external stimulus; to 
eliminate, through some mode of escape or 
attack, the threat that is clearly and objec- 
tively present in the stimulus. 

If we examine the consequences of anxiety 
(which Freud termed neurotic anxiety), we see 
no such correspondence between the internal 
disturbance of the person and an objectively 
harmful environmental stimulus. Instead, anx- 
iety is typically aroused by stimuli which, ob- 
jectively considered, are innocuous. For ex- 
ample, in the case of the classical phobias, 
harmless objects possess a special motivational 
significance for certain people. These objects 
activate some motive other than fear, and this 
other motive, in turn, arouses the consciously 
perceived motive of anxiety. Hence, the emo- 
tional reaction of the anxious person is inap- 
propriate to the inherent characteristics of the 
external stimulus. 

Regardless of their content, the motives 
whose arousal evokes anxiety share a common 
property: they are all repressed. These re- 
pressed motives continue unconsciously to 


3 Space limitations do not permit a consideration of 
the two types of covert (ego defensive) responses, 
denial and identification with the aggressor, which 
persons may employ in their efforts to cope with external 
threat. A full discussion of these ego defenses is pre- 
sented by Sarnoff (1960a). 

4In fact, since anxiety arousing stimuli are often 
related to unconscious libidinal motives, they may be 
regarded by most people as intrinsically pleasurable, 
rather than in any way painful. For example, owing to 
the manner in which their heterosexual motives have 
been socialized, some men may tend severely to repress 
their sexual cravings for women. Hence, when such 
men are shown photographs of voluptuous nudes, 
stimuli which might be quite evocative of pleasurable 
fantasies among most of their fellows, they are likely to 
experience anxiety (Sarnoff & Corwin, 1959). 
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press for the reduction of their tensions; and 
anxiety signals the threat of possible expression 
of these repressed motives. Consequently, the 
person develops a number of additional ego 
defenses that function to safeguard the initial 
effects of repression. If the ego defenses do 
their work effectively, the motives are kept 
under repression, the inner danger passes and 
the individual’s anxiety is reduced, 


Implications of the Motives of Anxiety and Fear 
for Affiliative Behavior 


It follows from the foregoing discussion that, 
when their anxieties are aroused, people are 
more inclined to become preoccupied with the 
reassertion of inner self-control than with 
modes of dealing with the anxiety evoking ex- 
ternal object. Because the anxious person tends 
to be aware of the element of inappropriateness 
in his feelings, he is loath to communicate his 
anxieties to others. To avoid being ridiculed 
or censured, he conceals anxiety aroused by 
stimuli which he guesses do not have a similar 
effect upon others, and which, he feels, ought 
not so to upset him, Thus, when anxiety is 
aroused, a person should tend to seek isolation 
from others. On the other hand, when fear is 
aroused and he is unable to flee from the 
threatening object, he welcomes the opportu- 
nity to affiliate. Since the usual responses to 
fear, flight and fight, are restricted in the ex- 
perimental situation, the subject seeks other 
fear reducing responses. Therefore, the proba- 
bility of affiliation increases because it mediates 
fear reduction through the potentiality for 
catharsis and distraction as well as the emo- 
tional comparison offered by interpersonal con- 
tact. 

We are led, therefore, to the hypothesis that 
the motives of fear and anxiety should influence 
social affiliation behavior differently: the 
greater the fear aroused, the more the subjects 
should choose to be together with others while 
they await actual contact with the fear arous- 
ing object. Conversely, the greater the anxiety 
elicited, the more the subjects should choose 
to be alone while they await contact with the 
anxiety arousing object. 


METHOD 


The experiment was presented to the subjects as a 
physiological investigation of the cutaneous sensitivity 
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of various parts of the body. A 2 X 2 design was used 
in which two levels of fear and of anxiety were experi- 
mentally aroused. The dependent variable of social 
affiliation was measured by having the subjects state 
whether they preferred to spend an anticipated waiting 
period alone or in the company of others. 
Subjects 

The subjects were 72 unpaid, male undergraduate 
volunteers from six introductory psychology classes in 
Yale University. An additional 36 subjects were used 
to pretest the manipulations and measuring devices, 
and an additional 13 subjects were excluded from the 
analyses because they did not qualify as acceptable 
subjects, i.e., were friends, misunderstood the instruc- 
tions, did not believe the rationale. 


Procedure 


Background information was collected by an 
accomplice alleged to be from the counseling program 
of the Student Health Department. A questionnaire 
was designed to obtain background information on 
the subjects and also their preferred mode of defense 
mechanism, The latter data were in response to four 
Blacky cards, As in a recent experiment by Sarnoff 
(1960b), each card was accompanied by three alterna- 
tives that were to be rank ordered according to the 
subjects’ reaction to the theme of the card (sibling 
rivalry, achievement, and two of sucking). The alterna- 
tives reflected predominantly an acceptance of the 
motive, projection of the motive upon others, or a 
reaction formation against the motive. 

About one month later, the experimenter was in- 
troduced to the psychology classes as a physiological 
psychologist studying physiological responses to sensory 
stimuli. The subjects were subsequently recruited in- 
dividually, and randomly assigned to the four experi- 
mental treatments. The specious purpose of the experi- 
ment and of the conditions of waiting were further 
established by marking the experimental room “Sen- 
sory Physiology Laboratory”and two nearby rooms 
“Waiting Room A” and “Waiting Room T.” Because 
of absentees, the size of the groups tested varied from 
three to five, and was usually composed of four sub- 
jects. In order to avoid the development of superficial 
friendships during the experiment, and eliminate the 
possibility that the subjects might react to cues from 
each other or from the experimenter, the subjects were 
isolated in adjacent cubicles, no communication was 
allowed, and the tape-recorded instructions were 
presented through earphones. 

The experimental conditions and instructions 
common to all subjects will be presented first. After 
rolling up their sleeves, removing their watches from 
their wrists, and gum or cigarettes from their mouths 
(“They interfere with the recording electrodes”), the 
subjects were told: 

Our experiment falls in the general area of physio- 
logical psychology. As you may know, one branch of 
physiological psychology is concerned witł the re- 
actions of the sense organs to various kinds of stimu- 
lation. Our present experiment deals with the skin 
{or mouth] as an organ of sensation. We are interested 
in studying individual differences in response to 
particular stimuli applied to it. 
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There has been a good deal of controversy about 


the relative sensitivity of the fingertips [lips] as — 


compared to the palms [tongue], and upper surface 
of the hand [palate]. Our experiment will help to 
provide data upon which we may be able ultimately 
to draw a detailed map of the cutaneous sensitivity of 
the human hand [mouth]. 

In order to measure your physiological reactions, 
we are now going to attach some instruments to 
your arm and finger [corner of your mouth]. These 
instruments are electrodes which are connected to a 
machine which records exactly the strength of your 
response to each stimulus. . . . Electrode jelly will be 
applied first to the area to insure that we get a good 
electrical contact. (The electrodes were then at- 
tached by a female laboratory assistant of middle 


age.) 

In order to provide a reasonable basis for asking the 
subjects to wait in other rooms (and, thus, for making 
the choice of affiliation or isolation), the subjects were 
told that it was necessary to assess their basal rates of 
responding prior to the application of the actual stimuli. 
They were led to believe that their individual sensitivi- 
ties were being recorded while they viewed a series of 
slides of a typical subject who had participated in the 
experiment. They anticipated that a waiting period 
would come after the slides, and then in the second— 
and purportedly major—part of the experiment their 
direct reactions to the actual stimuli would be meas- 
ured. Accordingly, they were told: 

Now that your basal rates have been recorded on 
our polygraph recorder, it will take us about 10 
minutes while we tally the data and reset our measur- 
ing instruments so that they will be geared to your 
individual basal rates as you are run one at a time 
through the rest of the experiment. While we are 
doing these things, we are going to ask you to wait 


in other rooms which are available to us. We will — 


come to get you when it is your turn to go through 
with the experiment. Incidentally, we have found 
that some of our subjects prefer to do their waiting 
alone, while others prefer to wait together with other 
subjects. Therefore, we are going to give you your 
choice of waiting alone or with others. In either case, 
you will be ushered to a comfortable room furnished 
with adequate reading material. 

After indicating their preference of waiting alone 
or together with others, the subjects also indicated the 
intensity of this preference on an “open-ended” scale 
in which 0 represented a very weak preference and 100 
a very strong preference. On this relatively unstructured 
scale there was as much as 175 points of difference 
between subjects (from “75-alone” to “100-together”)- 

Presentation of the slides during the experiment 
served two purposes in addition to the one previously 
mentioned. The content of the slides (appropriate to 
each experimental treatment) served to reinforce the 
subjects’ differential expectations of the nature and 


— 


severity of the stimulus situation. Furthermore, the — 


subject seen in the slides became a focal point for 
measuring the effectiveness of the experimental manipu- 
lations. It was assumed that a direct attempt (by 
means of a scaled question) to appraise the level of the 
subjects’ fear or anxiety would be likely to: sensitize 
them to the true purpose of the experiment; yield 


ÅNXIETY, FEAR, AND SOCIAL AFFILIATION 359 


unreliable results since the subjects might neither be 
consciously aware of, nor able to vexbalize, their anx- 
iety reaction; and evoke resistance since some subjects 
might not want to admit to being anxious or fearful, 
calling their masculinity into question. 

Therefore, it was necessary to use an indirect, 
disguised measure to evaluate whether the experimental 
inductions had actually aroused two levels of both fear 
and anxiety. Immediately after the slides had been 
shown (but before the affiliation choices had been made), 
the subjects were told: 

As you may know, an individual shows his physi- 
ological reaction in a variety of behavioral forms. We 
are interested in seeing whether it is possible to 
estimate how ill-at-ease or upset individuals are at 
the prospect of receiving the stimulation in this 
experiment. Recalling the subject whom you just 
saw in the slides, how upset or ill-at-ease did he seem 
to you? Please assign a number anywhere from zero to 
100 to indicate your feeling. (Zero = unconcerned, 
at ease; 100 = extremely concerned and ill-at-ease.) 
Since the subject in the slides was a posed model 

instructed to remain poker faced throughout, it was 
assumed that there was no objective difference in his 
expression. Thus, any systematic difference in ratings 
between groups should reflect a projection of the sub- 
jects’ own motives upon this screen. 

However, because the content of the slides was not 
identical for every group but rather “tailored” to each 
specific treatment, it was possible that the model may 
have actually looked more fearful in the slides shown to 
the subjects in the High Fear than in the Low Fear 
condition. As a control check on this possibility, four 
additional introductory psychology classes (V = 108) 
served as judges. They were told that the slides were of 
a typical subject in a recently completed experiment, 
and their task was to estimate how illat-ease and 
concerned he appeared (on the same scale used by the 
experimental subjects). Two of the classes saw only the 
face of the model (the rest of the slide was blacked out) 
and were told only that he was a subject in a physio- 
logical experiment in which stimuli were applied and 
responses measured. The other two classes saw the 
entire stimulus field of the slides and were given the 
same complete description that the experimental sub- 
jects received. Since each class of judges rated the slides 
for all four experimental treatments, the order of 
presentation was counterbalanced. 

After the projective measure of motive arousal and 
the measure of affiliation, the electrodes were removed 
and a measure taken of the subjects’ reasons for choosing 
to affiliate or be isolated. This was done with the ration- 
ale that a social psychologist had become interested in 
the fact that some of our subjects preferred to be to- 
gether while others preferred to be alone, and he had 
asked us to get some information for him about the 
reasons underlying this preference. 

The questionnaire, designed by Gerard and Rabbie 
(1960), contained both open-ended and structured 
questions asking for reasons for the affiliation choice. 
Finally, the subjects noted whether or not they wished 
to continue in the experiment. Only one subject (in 
the High Fear condition) refused to remain for the 
“stimulation” part of the experiment. 

The true purpose, hypothesis, design, and reasons 


for the various deceptions (and, at a later time, the 
results) were explained fully to each subject. 


High Fear 

A high level of fear was induced by leading the 
subjects to anticipate a series of painful electrical 
shocks. Although they expected to endure each of the 
shocks for 2 minutes, the subjects were assured that the 
shocks would not cause damage or injury. 

The female assistant (dressed in a white lab coat, as 
was the experimenter) then attached electrodes to each 
subject’s arm and fingertip and strapped his arm onto 
a cotton-padded board. The leads from the electrodes 
appeared to go to a polygraph recorder, which also was 
seen inthe series of slides of the typical subjects. Another 
slide showed an enormous electrical stimulator, and the 
implication was that it was behind a curtain in the 
experimental room. It was called to the subjects’ at- 
tention that: 

The four dials shown in the upper right-hand 
corner of the stimulator enable us to regulate auto- 
matically the frequency, duration, delay, and in- 
tensity of the shock you will get. 

The other slides portrayed the subject with earphones 
and electrodes attached (like the subjects themselves), 
“listening to the instructions”, and then “about to 
receive his first painful shock,” administered by the 
experimenter, who could be seen in the background 
manipulating the dials on the stimulator. A final 
situational factor that may have enhanced the effec- 
tiveness of the High Fear manipulation was that the 
experimental room housed electrical generators which 
made a continuous buzzing sound, a cue interpreted 
by the High Fear subjects as the electrical stimulator 
“warming up,” but unnoticed or quickly adapted to 
by the other subjects. An unobtrusively posted sign 
reading “Danger/High Voltage,” present only for the 
High Fear subjects, gave further credence to this 
notion. 


Low Fear 

In the Low Fear condition the word “shock” was 
never used, and all cues in the situation associated with 
shock, fear, or pain were removed; i.e., no white lab 
coats, arms not strapped to boards, etc. The expecta- 
tions of these subjects were guided by instructions 
stating that our methodology was to apply a 10-second 
stimulus of very low intensity that would be just suf- 
ficient to elicit a measurable physiological response. 

In the series of slides viewed by these subjects, the 
imposing electrical stimulator was replaced by a small 
innocuous looking apparatus (actually a voltmeter), and 
the experimenter was seen not in the active role as an 
agent of pain, but in the passive role of recording data 
from the polygraph recorder. 


High Anxiety 

Anxiety was manipulated by arousing a motive 
that was assumed to have been repressed by most of 
the subjects. In Freudian terminology, the motive 
might be called “oral libido,” a desire to obtain pleasur- 
able gratification by sucking on objects that are clearly 
related to infantile nursing experiences. The female 
breast is, of course, the prototype of such objects, but 
others include nipples, baby bottles, and pacifiers. 
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Thus, to arouse this oral motive and, hence, the anxiety 
that should follow its arousal, subjects in the High 
Anxiety condition were led to believe that they would 
have to suck on a number of objects commonly associ- 
ated with infantile oral behavior. They were told that 
their task would be to suck on these objects for 2 min- 
utes while we recorded their physiological responses 
from the skin surfaces stimulated by the objects. In 
clear view in front of the subjects were the following 
items: numerous baby bottles, oversized nipples, 
pacifiers, breast shields (nipples women often wear 
over their breasts while nursing), and lollipops. 

The same variety of stimulus objects was shown 
arrayed in front of the subject in the slides. He could 
be seen, tongue hanging out, lips puckered, about to 
suck his thumb (as one of the objects of stimulation) or 
one of the other objects. Subjects were told that the 
contact taped to the mouth recorded the direct reac- 
tion to the oral stimulation, while the arm contact 
recorded peripheral reactions. 


Low Anxiety 


The instructions to the Low Anxiety subjects did not 
mention “suck,” nor any stimulation that they would 
receive from putting the objects in their mouths. 
Moreover, they were led to believe that they would 
keep each object in their mouths for only 10 seconds. 
The stimulus objects were not in immediate proximity 
to the subjects while their electrodes were being at- 
tached. The stimulus objects which they anticipated 
putting in their mouths were shown in the slides: 
whistles, balloons, “‘kazoos,” and pipes. Since these ob- 


jects do not require sucking (but rather, in general, 
blowing), the model’s tongue was not seen as he pre- 
pared to use the stimuli in the slides. 


RESULTS 


Evidence of the Effectiveness of the Experimental 
Manipulations 


In using the subjects’ estimates of the degree 
to which the model seen in the slides was upset 
by the prospect of receiving the stimulation in 
the experiment, it was assumed that the sub- 
jects would tend to project their induced level 
of fear and anxiety. Table 1, which presents 
the mean projection scores for each experi- 
mental treatment, offers evidence that this 
assumption was valid and the manipulations 
effective. The High Arousal subjects perceived 
the model to be significantly> more upset, con- 
cerned, and ill-at-ease than did the Low 
Arousal subjects. 

Our theoretical distinction between fear and 
anxiety, and the way these concepts were 
operationally defined in this experiment, lead 
to the prediction that, assuming similarity of 


6 All p values reported throughout the paper are 
based on two-tailed tests of significance. 
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TABLE 1 
MEAN PROJECTION SCORES FOR EACH 
NTAL TREATMENT 


Level of Arousal 
Motive 
Low High 
Fear 24 42 <.01 f 
Anxiety 14 318 <.01 
ns ns 


Note.—The larger the score, the greater the degree of projec 
tion. 

^ Variance greater than in High Fear group, p < .10; SD fo 
High Anxiety = 24, for High Fear = 16. i 


past experience, persons facing the sa 
clearly, objectively present threat should re 
in a relatively homogeneous fashion. This 
correspondence between stimulus and respo 
is not assumed to hold for anxiety. We hi 
already noted that a stimulus that prodi 
anxiety for some persons is not an anxi 
producing cue for many others. Since the s 
nificance of the stimulus depends upon 
symbolic and generally idiosyncratic associ 
tions, one would expect that a stimulus wh 
elicited anxiety for persons with relevant p 
dispositions (repressed motives) would 

less effect on those who had more adequat 
resolved the conflict over the expression ol 
same motives. Thus, one way of determi 
whether our experimental manipulations pi 
duced two different motives, fear and an 
(rather than only two levels of one motivi 


The heterogeniety of response in the Hig 
Anxiety group is, as predicted, greater than 
the High Fear and the Low Arousal condi 
The same difference in response variab' 
between the High Anxiety group and all oth 
groups is manifested as well in the depen 
variable of social affiliation. The questionn 
data to be presented in a later section 
further support to the distinction between fea 
and anxiety. ; 

Before presenting the major results, it is 
necessary to account for two possible sources 
of artifact in the just reported data on pri 
jection. They are: by chance sampling, the Hi 
Arousal groups could have contained mo 
subjects who characteristically used projectio 
as a mechanism of defense than the Lo 
Arousal groups; and the subject seen in th 
High Fear and High Anxiety slides was ob- 
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jectively more upset and concerned than he 
was in the Low Fear and Low Anxiety slides. 
Tf either of these alternatives were true, then 
the projection measure would not be a reflec- 
tion of differences due to the experimental 
arousal of levels of fear and anxiety. 

The pretest data of the subjects’ mode of 
defense preference on the Blacky Projection 
test show no initial significant difference be- 
tween any of the groups in their tendency to 
use projection. 

Among the groups of neutral judges who 
evaluated all the slides shown in the study, 
from 68%-98% reported perceiving either no 
difference in the degree to which the model 
appeared upset, or a difference opposite to that 
reported in Table 1. This result holds for both 
fear and anxiety, and regardless of the order of 
presentation or amount of the stimulus field 
seen (model’s face only or entire slide). Thus, 
it appears that the projection measure can be 
used as an index of the efficacy of the experi- 
mental conditions and manipulations. 


Effects of Fear and Anxiety on Social Affiliation 


The results bearing upon the hypothesis of 
the study are presented in Table 2, where for 
each condition the mean intensity of desire to 
affiliate, as well as the number of subjects 
choosing to affiliate and to be alone, are pre- 
sented. It is evident that there is a strong 
positive relationship between fear and the in- 
dex of affiliative tendency, but a strong nega- 
tive relationship between anxiety and affilia- 
tion, so that as fear increases affiliation also 
increases, while an increase in anxiety results 


TABLE 2 
RELATIONSHIP OF MOTIVE TO SOCIAL AFFILIATION 
Number of Subjects 
Mean Affiliation Choosing 
frengts Together Alone or vo- 
F 
Tow 34.0 12 3 
High 51.0 19 1 
Anxiet; 
Dee 27.0 11 4 
High 8.0 10 12 


garacHon: (Motive X Level) p < .05, 
2.30, df = 68. 


^ The larger the score, the greater the affiliation tendency; iso- 
lation intensity score subtracted from affiliation intensity score. 
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in a decrease in affiliation. Thus, our prediction 
of an interaction between kind of motive and 
level of arousal is clearly supported by the 
data. While some 95% of the High Fear sub- 
jects chose the “together” alternative (with 
more than 0 intensity), only 46% of the High 
Anxiety subjects chose to wait together. The 
marked mean difference between these groups 
in intensity of choice (51.0-8.0) is significant 
well beyond the .01 level (¢ = 3.63). The large 
mean difference in affiliative tendency between 
the High and Low Fear groups (p < .07, 
t = 1,96) represents a replication of Schachter’s 
(1959, p. 18) results. While the mean difference 
between High and Low Anxiety was even 
larger than that between the Fear conditions, 
it only approached significance (p = .16, t = 
1.46) due to the marked heterogeniety of vari- 
ance of the High Anxiety group. 


Reasons Given for Affiliation Choice 


The final measure taken was a questionnaire 
that explored the reasons the subjects gave 
for choosing to wait together with others or to 
wait alone. The 11 structured items on the 
questionnaire each presented a possible motive 
for affiliation; and each was accompanied by a 
70-point scale on which the subject indicated 
how important he thought the motive was in 
determining his choice. The highly significant 
interaction between experimental treatment 
and questions (p < .001, F = 3.74, df = 
30.570) on a repeated-measurement analysis of 
variance justified a search for those questions 
(motives for affiliation) that differentiated the 
groups. 

Since there were too few subjects choosing 
the alone condition, the analysis is limited to 
those wanting to affiliate. The motives for 
affiliation that were most important for the 
High Fear subjects and most distinguished 
them from the Low Fear subjects were (the 
lower the mean, the greater the importance; 
10 = extremely important): 


1. Lam not sure whether I am reacting in the same 
way as the others to the prospect of getting shocked 
and would like to compare my reactions to theirs. 
[Emotional comparison] High Fear = 38, Low Fear 
Š = 54, p < 001. 

2. I feel worried about getting shocked and would 
like to know to what extent the others are worried too. 
[Extent of comparison] High Fear = 40, Low Fear 
ž = 61, p < .001. 
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3. I want to be distracted in order to take my mind 
off the prospect of getting shocked. [Distraction] High 
Fear X = 44, Low Fear x = 59, p < .01. 

4, I am worried about the prospect of getting shocked 
and felt that talking with someone about it would get 
it off my chest. [Catharsis] High Fear = 50, Low Fear 
ž = 59, p < .05. 

The reasons for affiliation given spontane- 
ously to a single open-ended question also re- 
flect the importance of these same considera- 
tions. Among High Anxiety subjects choosing 
to be alone, the major reason given spontane- 
ously and supported by the scaled questions is 
the desire “to be alone to think about personal 
affairs and school work.” 

Curiosity as to “what the others were like” 
was important, but equally so across all con- 
ditions. Of least importance among all sub- 
jects are the following motives for affiliation 
(“oral stimulation” substituted for “shock” for 
Anxiety groups): 


“Tt would be clearer in my own mind as to how I 
feel about getting shocked if I could express my reac- 
tions to someone else.” “I anticipated that the others 
would offer reassuring comments.” “I want to be with 
others to reassure myself that I am not the only one 
who was singled out to be shocked.” “I feel that per- 
haps together we could somehow figure out a way to 
avoid getting shocked.” 


There are several large differences between 
the High Fear and High Anxiety groups; with 
the former finding the following motives as 
significantly more important: emotional com- 
parison, extent of comparison, distraction, 
catharsis, and the physical presence of others 
(p < .05 in each instance). Similarly, an in- 
ternal analysis of the High Fear group reveals 
these same motives (especially catharsis and 
emotional comparison) to be more important 
for those subjects who chose to affiliate most 
strongly than for those below the group median 
in affiliation strength. 


Ordinal Position and Its Relation to Affiliation 


While the reasoning used in the planning of 
the present study did not include predictions 
of the effects of ordinal position upon affilia- 
tion, data relevant to this question was never- 
theless obtained, to check on Schachter’s 
(1959) finding that affiliation tendencies in- 
creased with emotional arousal only among 
first- and only-born children. This finding is 
duplicated in the present study. First-born 
children want to affiliate significantly more 
than later-borns under conditions of high fear, 
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but not when the level of fear is low. 
the mean affiliation intensity for the first-bor: 
High Fear subjects was 62, it was only 23 
the later-born High Fear subjects (p = 
t = 2.10). This same general finding holds fo 
the High Anxiety group, but again the wit 
group variability does not permit the larg 
mean difference obtained (16 for first-bor 
and —3 for later-borns) to be statistically 
nificant. 
Discussion . 
Since our basic hypothesis has been sup- 
ported, our results lend credence to the p 
ously drawn conceptual distinction betw 
fear and anxiety. In view of the fact that 
anxiety arousing stimulus was specifically 
signed to tap only one kind of repressed moi 
it of course remains an empirical ques 
whether or not the evocation of other type 
of presumably repressed motives also leads t 
social isolation. il 
In order to predict the consequences of thi 
arousal of a motive, therefore, it is nece 
to know which responses are required to reduc 
its tension. The probability of the social 
parison response is, thus, a function of 
kind of motive aroused, the intensity of 
motive, the degree of novelty of the emo 
experience, the response hierarchy associated 
with the specific motive, and certain attrib’ 
of those with whom the person is to affili 
We do not question the assumption that 


However, we feel that the need for self-evalua 
tion is not the most salient motive aroused 
the experimental situations that Schacht 
(1959) and we employed. We do not view the 
cognitive need to structure a vague emotioi 
state as the primary motive in these e 
ments; we see social comparison not as an 
in itself but merely as one of the se 
responses that are instrumental in reducing the 
tension associated with the situationally moré 
salient motives of fear and anxiety. 

Strict application of the theory of emotio1 
comparison Processes to the present experi! 
tal situation should lead one to predict greai 
affiliation tendencies for the High Anxi 
subjects than the High Fear subjects, sin 
the Anxiety situation was more unusual th 
that of Fear, and the emotion aroused w. 
probably more novel and vague. The oppos 
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prediction, supported by the results, demands 
an approach, such as the one followed here, 
that specifies the probability of the response 
alternatives evoked by the dominant motives 
aroused, 

As the emotional experience becomes very 
novel and unusual, the need for comparison of 
one’s reactions with others should increase, 
and, hence, intensify affiliation tendencies. The 
induction of esoteric states of consciousness by 
“anxiety producing drugs” (being studied pres- 
ently by Schachter) may be the kind of situa- 
tion in which emotional comparison theory 
offers the best explanations and predictions. 
Under such circumstances, it may be possible 
to create emotional states that are epiphe- 
nomena of motives whose neurophysiological 
bases had never previously been set into 
motion. A more natural counterpart of this 
novel emotional experience occurs the first 
time a person experiences the emotions asso- 
ciated with the death of a loved one. 

The predictive importance of knowing the 
specific responses appropriate to the motive 
aroused is clearly illustrated by the following 
examples. If a person’s guilt is aroused, his 
response to feelings of guilt should be to seek 
out others only if they could be expected to 
punish him and, thus, to expiate his guilt, but 
not to affiliate with individuals perceived as 
unable to fill this role. Similarly, if repressed 
homosexual anxieties are aroused, isolation 
should generally be preferred to affiliation, as 
with oral anxiety in the present study. How- 
ever, affiliation tendencies should increase if 
the subject is allowed to wait with females, 
but not if he can wait only in the company of 
males. 

While our questionnaire data offer support 
for the importance of emotional comparison, 
they also point up the role of other motives 
such as need for catharsis and distraction. The 
marked difference in the importance of the 
reasons given for affiliation between the High 
Fear and High Anxiety groups is perhaps the 
most substantial evidence that the experimen- 
tal manipulations have indeed led to the 
arousal of two quite different motives. 

A final point of interest concerns the data 
about ordinal position. The finding that first- 
born children show greater affiliation tenden- 
cies than later-born children when either fear 
or anxiety are aroused supports Schachter’s 
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(1959) results. Theoretical and experimental 
attempts to uncover the dynamics underlying 
this “static” variable should prove interesting 
and fruitful. 


SUMMARY 


This experiment tests the utility of the 
psychoanalytic distinction between fear and 
anxiety for making differential predictions 
about social affiliation. It also assesses the 
breadth of generalization of Schachter’s (1959) 
empirical finding of a positive relation between 
emotional arousal and affiliation. Seventy-two 
subjects were randomly assigned to four ex- 
perimental treatments in which low and high 
levels of fear and anxiety were manipulated. 
The success of these inductions was established 
by a projective device and questionnaire data. 
The dependent variable of social affiliation was 
measured by having the subjects choose to 
await the anticipated exposure to the stimulus 
situation either alone or together with others. 

The results show that, while the desire to 
affiliate increases as fear increases (a replica- 
tion of Schachter’s, 1959, results), the oppo- 
site is true for anxiety; as anxiety increases the 
desire to affiliate decreases. Thus, as predicted, 
our findings lend empirical support to the 
theoretical distinction between fear and anx- 
iety. At the same time, our results suggest that 
the theory of social comparison processes may 
not be adequate to account for the general 
relationship between emotions and affiliative 
tendencies. 
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berg, and Budin (1958) hypothesized that 

normals under time pressures and schizo- 
phrenics under the usual word association con- 
dition would give fewer microgenetically “ma- 
ture” and more “immature” responses to 
stimulus words than normal controls, Such re- 
sponse categories as Synonym, Supraordinate, 
and Subordinate were assumed to reflect 
later microdevelopmental phases of cognition 
because of their logical “secondary proc- 
ess” character, while Distant, Completion, 
Perseveration, and other categories were as- 
sumed to represent earlier or abortive phases 
because of their superficial, contextual, or af- 
fect-mediated connection to the stimulus word. 
Differences between schizophrenics and nor- 
mals were significant and in the predicted 
direction for 5 of the 15 categories scored, and 
it was concluded that so far as word associa- 
tion is concerned, microdevelopmental imma- 
turity is accurately characterized by some but 
not all of the attributes assigned to it. The 
present study reports the results of micro- 
genetic analysis of word associations in a sec- 
ond sample of schizophrenic patients and in a 
sample of brain damaged patients. This latter 
group also presumably gives expression to con- 
crete or “immature” responses that normals 
usually suppress in favor of higher-level con- 
ceptual responses (Goldstein, 1943; Werner, 
1948). 


I a recent study Flavell, Draguns, Fein- 


METHOD 


The subjects were 32 schizophrenic patients and 30 
brain damaged patients on the psychiatry and neurology 
wards of several VA hospitals and 32 normal controls. 
These male groups were equated on age, IQ (as meas- 
ured by the Vocabulary subtest of the Wechsler- 
Bellevue Form I), and education. Means and SDs 
are reported in Table 1, none of the group differences 
approaching significance. Subjects were administered 50 
words from the Kent-Rosanoff list with instructions to 
respond with the first word that came to mind. Tests 
were given individually and responses recorded by 
hand. The protocols were scored by the author in terms 
of Flavell’s (Flavell et al., 1958, pp. 2-3) criteria as illus- 
trated below: 


1 Now at the Nebraska Psychiatric Institute. 
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A. Immature Categories 

1. Completion: Completes a common word or 
phrase (whistle-“stop”). 

2. Distant: Unrelated or distantly related (music- 
“hospital”). 

3. Perseveration: Distantly related to its stimulus 
word, but obviously related to or identical with a 
stimulus or response word occurring earlier in the 
series. 

4. Perseveration-meaningful: Closely related to its 
stimulus word, but identical to a stimulus or response 
word occurring earlier, 

5. Clang: Related only by sound (earth-“worth”), 

6. Emotional: Connotes affective judgment (health- 
“wonderful”), 

7, Repetition: Repetition or grammatical variant 
(deep-“depth”). 

B. Mature Categories 

1. Synonym: (chair-“seat”). 

2. Supraordinate: Denotes class of which stimulus 
word is member (butterfly-“insect”). 

3. Subordinate: Denotes member of class signified 
by stimulus word (sickness-“flu”). 

C. Categories of Indeterminate Microgenetic Phase? 

1. Attribute: Modifies or is modified by stimulus 
word (beautiful-“daughter” or stove-“hot”). 


2. Verb: Response is verb and stimulus word a 


noun or vice versa (cheese-“eat” or eating-“food”). 
3. Contrast-Coordinate: Antonym or comple- 
mentary (dark-“light” or salt-“pepper”). 
4. Multiword: (chair-to sit on”), 

The word list® used contained nine words in common 
with Flavell’s (Flavell et al., 1958) list and was com- 
Posed of nouns, adjectives, verbs, and other grammati- 
cal classes in roughly the same proportions as his. 
Identical word lists would make the results of the two 
studies more comparable, but positive results with a 
different list would lend greater support to the micro- 
genetic hypothesis, It was recognized that the frequen- 
cies of responses in the categories described above are 
partly a function of the parts of speech of the stimulus 
words. Nouns generally elicit other nouns, adjectives, 
and verbs; verbs rarely elicit other verbs or adjectives.’ 


* Blocking or the failure to respond within 10 seconds 
was scored in Flavell’s, but not in the present study. 

* Word association list and instructions to the sub- 
jects may be obtained without charge from Norman A. 
Milgram; 602 South 44th Avenue; Omaha, Nebr., of 
from the American Documentation Institute. Order 
Document No. 6744 from ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Congress; 
Washington 25, D. C., remitting in advance $1.25 for 
microfilms or $1.25 for photocopies. Make checks 
payable to: Chief, Photoduplication Service, Library 
of Congress. 

* Personal communication from Roger Brown. 
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In this important respect, the two lists were compar- 
able. 
. 


RESULTS AND DISCUSSION 


The microgenetic hypothesis that schizo- 
phrenic and brain damaged patients would 
respond with fewer “mature” and more “im- 
mature” associations than normals found little 
support. Of the significant group differences 
obtained, as many were in the direction oppo- 
site to prediction as in the predicted direction. 
Table 2 indicates that normals gave fewer 
Perseveration (p < .05) and Repetition (p < 
.001) than schizophrenics. On the other hand, 
they also gave fewer Supraordinate (p < .001) 
and more Perseveration-meaningful responses 
(p < .01). In addition, of the indeterminate 
categories that are considered concrete and 
possibly immature, normals gave more Verb 
and Attribute and fewer Multiword responses 


TABLE 1 
AGE, IQ, AND EDUCATION OF THREE GROUPS 


(p < .001 for each). The brain damaged group 
gave fewer Perseveration (p < .05), more 
Perseveration-meaningful (ġ < .01) and more 
Verb responses (p < .01) than schizophrenics, 
and more Multiword (p < .001) and fewer 
Attribute responses (p < .01) than normals. 

In general, the brain damaged and schizo- 
phrenic groups were more similar to one 
another than was either to the normal controls. 
Rank order correlations of the frequencies of 
13 response categories yielded coefficients of .66 
and .77 for schizophrenics-normals and brain 
damaged-normals, respectively, and .95 for 
schizophrenics-brain damaged. 

In retrospect, several of the underlying as- 
sumptions of the microgenetic approach to 
word association appear untenable. For one, it 
was assumed that the occurrence of a verbal 
response logically defined as mature was nec- 
essarily contingent upon the occurrence of an 
analogous mature cognitive process in the sub- 
ject giving the response. Goldstein and Scheerer 
(1941, pp. 11-12), Yacorzynski (1941), and 


Cae ool ict sae Rapaport (1945, p. 149) have illustrated this 
non sequitur and have contrasted apparent 
Schizophrenic M 43.1 114.9 10.8 stability of conceptual word usage and actual 
ne Sp a a 4 ree 2 deficits in conceptual thought processes in these 
rein damaged SD 15.2 17.7 3.2 patient groups. The word association task may 
Normal M 44.5 115.8 11.6 be a poor choice for the illustration of micro- 
SD p dA wg pio genetic phases of thought because it permits 
TABLE 2 
CATEGORY SCORES FoR THREE GROUPS 
Normal Schizophrenic Brain Damaged 
Category + 
M SD M SD M SD 
I t 
S Complntion 2.78 2.22 2.19 2.19 1.63 1.40 ns 
Distant 1.09 1.35 2.06 1.87 1.43 1.75 ns 
Perseveration 0.13 0.33 0.44 0.71 0.13 0.35 .05 
Perseveration-meaningful 4.91 1.97 3.81 2.31 5.60 2.85 <.05 
Clang? 0 0 0.07 0.24 0.03 0.17 ns 
Emotional 1.28 1.25 0.88 1.45 1.23 1.96 ns 
Repetition 0.13 0.33 0.97 1.26 0.43 0.90 <.001 
Mature iy 
S; 6.00 3.72 7.84 3.93 7.47 4.50 ns 
SUprasrdinate 1.06 1.14 3.38 3.10 2.53 2.42 <.001 
Subordinate 0.81 0.78 0.81 1.09 0.93 0.93 ns 
Lier 4.09 2.69 2.09 2.39 2.63 1.67 <.001 
Verb 4.19 2.18 ie Age oie rai! <.01 
-l i 10.41 5.80 i i % x ns 
oe 0.56 1.16 4.50 6.34 5.87 6.46 <.001 


a All tests were two-tailed. F and ¢ tests were used when the assumption of homogeneity of variance was accepted, x? and ¢ 


test with Snedecor’s correction when it was rejected (Bartlett’s test). z 7 
> The number of Clang responses was so few that the category was disregarded in subsequent comparisons. 
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subjects to fall back on well-practiced and over- 
learned verbal habits whose operations may 
circumvent the microgenetic route. 

Flavell et al. (1958) suggested that the 
microgenetic formulation was hardly rigorous 
enough in its present form to rule out alternate 
explanations in terms of the inability of schizo- 
phrenics to maintain a constant mental set in 
word association. This writer agrees but would 
question if even normal subjects adopt the 
same mental set as one another, much less 
maintain a constant mental set in word associa- 
tion. In a recent study, Milgram (1960) showed 
that the set to give “adult” associations re- 
sulted in an increase in abstractly related re- 
sponses and the reverse for “child” associa- 
tions. These findings were obtained in a 
multiple-choice word association task which is 
hardly equivalent to the usual open-ended 
word association task. Nevertheless, these 
findings suggest that when the subjects are 
not instructed to adopt a specific mental set, 
some sophisticated subjects may choose to 
associate “immaturely” as evidence of their 
imaginative and creative approach, while 
others may adopt a “mature” semantic set as 
evidence of their knowing what words mean. 
This possibility receives some support from 
Flavell’s own data on hospital aides and col- 
lege students, who were significantly different 
on five response categories in the direction 
suggested above. 


Mircram 


In summary, while the microgenetic ap- 
proach toward cognitive processes in normal 
and pathological groups appears provocative 
and fruitful, attempts to apply this approach 
to word association are handicapped because — 
of uncontrolled factors operating in the mental 
sets adopted in word association and because 
of a lack of necessary correspondence between 
the logical maturity of a verbal response and 
the actual maturity of the cognitive process 
which produced it. 
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(the subject) is ordinarily asked to make 

judgments about a particular person 
(hereafter called O) in a given situation. Typi- 
cally the Æ (experimenter) requires S to use 
fixed scales for specifying his judgments of the 
particular O who is the object of the perception 
or cognition. Often E probably requires S to 
make judgments on dimensions that S would 
not ordinarily use in “sizing up” a particular 
O in that situation. Interpretation of the re- 
sults of such studies becomes difficult because 
the “meaning” and “salience” of the scales for 
S may be unclear, atypical, or even absent. 
Although the fixed scale technique has the ad- 
vantage of easy calculations, a technique is 
also needed by which Æ could determine which 
scales are appropriate for the Ss, Os, and situa- 
tions of interest to him. Interpretation of ex- 
perimental results could be facilitated by using 
only those scales, in a fixed scale study, which 
were previously shown to be meaningful for 
the Ss, the Os, and the situation under con- 
sideration, 

In their discussion of the necessity for at- 
tempting to determine the relevant dimensions 
on which Ss make judgments of Os, Hastorf, 
Richardson, and Dornbusch (1958) “. . . urge 
a momentary pause in the search for the ‘trait’ 
of sensitivity in order to study the qualities of a 
person’s experience of others in terms of the 
verbal categories he uses in reporting that 
experience. The central characteristic of this 
type of methodology would be the eliciting of 
free and unrestricted descriptions of other 
persons. The classes of persons to be described 
would be specified by the experimenter” (p. 


N experiments in “person perception,” S 
I ‘Xp! pel perception, 
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with this report. 


56). One purpose of this paper is to demon- 
strate the feasibility of such a methodology. 

In addition to the questionable meaningful- 
ness of scales and other response measures used 
in research on person perception, the stimulus 
used for the O is often highly unrepresentative 
of the population to which results are to be 
generalized. The literature shows that the O 
who is to be the object of the perception or 
cognition on the part of S has been specified in 
many ways. One can use real people (eg, 
Newcomb, 1956), films (e.g., Stone, Leavitt, & 
Gage, 1954), recordings (e.g., Jones & de- 
Charms, 1958), handwriting (e.g., Huntley, 
1940), photographs (e.g., Cline, 1956), draw- 
ings of faces (e.g., Brunswik, 1956), stories 
(e.g, Cawthorne & Kogan, 1959), autobiog- 
raphies (e.g., Sperber, 1959), lists of adjectives 
(eg., Asch, 1946), abstract figures (e.g., 
Heider & Simmel, 1944), etc. Many of these 
studies use artificial or synthetic Os and at- 
tempt to generalize to the perception of real, 
living people. While in many cases the stimuli 
serve the purpose of the research in which they 
were used, their generalizability is somewhat 
tenuous. Ss are doubtless aware of the arti- 
ficiality of the situation and it is not improb- 
able that these conditions also influence the 
results obtained. 

When asking S to judge real people Æ has 
difficulty in knowing, manipulating, and con- 
trolling all of the variables that may be rele- 
vant from S’s point of view. Not only is it 
difficult to obtain a broad enough sample of Os 
to allow generalization (often the sample is 
restricted to classmates or fellow dormitory 
residents and the like) but Æ often does not 
know, for instance, how well the S and O know 
each other, how long they have been friends, 
O’s perceived status from S’s point of view, 
etc. 

S moves, daily, in a world inhabited by 
different persons having certain relationships 
to him; some are of the same sex, some of the 
opposite, some are well known to S, others are 
only undifferentiated figures with no clear 
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identity, some are very important to S, others 
of little concern to him, etc. If one were to 
select a number of these people one would 
probably find that S describes them differently 
and behaves toward them differently. Further, 
the requirements of the situation doubtless 
influence what dimensions or characteristics 
assume the greatest importance in S’s descrip- 
tions of these people. 

The general research question becomes, what 
kinds of dimensions are used by different kinds 
of Ss when describing different kinds of Os 
under different kinds of conditions? 

Kelly (1955) has provided the Role Con- 
struct Repertory Test as a method of investi- 
gating this question. This test was designed for 
individual administration; when it is group 
administered it is difficult for E to decide if two 
Ss mean the same thing by a given term or 
expression; a term needs to be embedded in a 
context to be adequately defined. In the in- 
dividual administration of the test Æ can get 
additional information from S that would aid 
in this decision. Further, the Role Construct 
Repertory Test assumes that all dimensions 
apply, in one way or another, to all Os de- 
scribed and that all dimensions are bipolar 
(Kelly, 1955, p. 268), assumptions that the 
present Es feel may be premature. 

The present study attempts to apply another 
method to provide information about the 
kinds of dimensions used by different kinds of 
Ss when describing different kinds of Os. The 
method is simply to sample the people from 
S’s world and obtain his impressions of them. 
But the Os sampled from different Ss’ worlds 
should be comparable so that some degree of 
generalizability can be attained. To guarantee 
that Ss make their judgments about real peo- 
ple, that these Os represent a rather broad 
selection of persons, and at the same time to 
assure some degree of comparability of Os over 
various Ss, a method somewhat similar to that 
used by Kelly’s Role Construct Repertory Test 
was adopted, in that specific classes of Os to be 
described were designated. S was asked to 
choose someone he knew who fit the class, and 
then to describe the person he chose using 
any terms or dimensions he wished. After this 
a content analysis* was performed on the ob- 


3 There is, of course, no reason to believe that the 
free response—content analysis technique reveals the 
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tained free response data in an attempt to 
discover relationships. These relationships m 
serve to support the feasibility of using 
method for assessing the dimensions used | 
the Ss and to show that dimension use relat 
to differences in the kinds of Ss doing tl 
describing and differences in the kinds of 
being described. 

The free response—content analysis a 
proach involves the following assumptions 
can give verbal descriptions of other peopl 
These descriptions employ dimensions that 
feels are relevant in the judging of O. (It m 
be argued that S may, for psychodyn 
reasons, be unable to verbalize on the mo 
important dimensions. But as a beginning, 
present procedure may prove meaningful, 
least for conscious judgments.) The relai 
frequency of use of different dimensions 
given S is a reflection of the relative sali 
or importance of the dimensions for that 5. 

Such considerations lead to expectation 
like the following: 

1. The content of free verbal descripti 
of people can be readily and reliably anal: 
This implies that a content analysis can | 
performed, and that all responses can be 
bly coded into a set of selected categories. 
aspects of the adequacy of the category system 
used can be assessed by the usual criteria € 
reliability such as interjudge agreement 
repeat scoring, and to a degree, by the ext 
to which meaningful and statistically sig 
cant results can be obtained when the cat 
system is used in evaluating various 
potheses. A related expectation is that a gi 
S’s use of categories for a particular kind 
and over various types of Os should be 
sistent over time. 

2. Ss “weight” dimensions differently wl 
describing different Os; category use va 
with the kind of O being described. That 
Os different in certain respects tend to 
described with emphasis on different cat 


“true” dimensions uncontaminated by the cont 
analyst; it does, however, help one gain a better 
spective on the variety of dimensions used “natu 
The investigator must make the decisions about 
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gories, with different evaluative tone, and with 
different amounts of information. 

3. Similarly, groups of Ss that differ in 
certain respects tend to differ in their “weight- 
ing” of categories when describing the same 
kinds of Os. Here too there should be differ- 
ences in evaluative tone, and different amounts 
of information. Expectations 2 and 3 state 
that descriptions vary as a function of S differ- 
ences, when certain O characteristics are held 
constant, and with certain O differences, when 
certain S characteristics are held constant. 

4. One might also expect an interactive 
effect between the two sets of variables, such 
that the same O differences produce different 
differences in the descriptions when different 
groups of Ss are used. 


METHOD 


Subjects 


Sixty-six students (30 female, 36 male) enrolled 
during the 1958 summer session in an introductory 
psychology course at the University of Colorado served 
as Ss. 


Object Persons 


As already indicated, the Os were designated by 
constructing classes of people and having the Ss think 
of persons who fit the class, The classes were formed by 
using all possible combinations of well known-not well 
known, higher-equal-lower status (relative to S), and 
same and opposite sex, to generate 12 classes of Os to 
be described. An example of a class designation used is: 
“Think of a person who is the same sex as you, whom 
you know well, and whom you consider of higher status 
than yourself.” 


Procedure 


During a regular 60-minute class meeting the Ss 
were given 844” X 514” booklets, the first page of 
which contained instructions stating that S would be 
asked to think of various people he knew and to give 
descriptions of them. 

At the top of each of the 12 following pages there 
was a short class designation like the one given above. 
Immediately below this S was asked to indicate O’s 
age in a space provided for this purpose; the rest of the 
page was to be used for S’s description of O (“Describe 
this person”). The age question was included to help 
assure that S actually thought of a real O. If S could 
not think of a person to fit the designation he was 
requested to leave the page blank. 

In addition to the 12 descriptions of Os, S was also 
asked to describe himself. A page for this self-descrip- 
tion occurred first in the booklet for 24 Ss (12 males, 
12 females), in the middle of the booklet for 18 (12 
males, 6 females), and at the end for 24 (12 males, 12 
females). In order to minimize possible systematic 
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order effects, all but the instructions and final page 
(which asked for information such as S’s age, his major, 
etc.) in the book were arranged in different orders in 
different booklets, with no more than two Ss (1 male 
and 1 female) having the same page order. 


Content Analysis 


To arrive at content categories for the analysis, a 
judge (Judge 1) went through the booklets to obtain 
a global impression of what dimensions were used. 
This procedure yielded a sizable list which was then 
grouped on the basis of similarity into 13 major cate- 
gories.* These categories and their component dimen- 
sions are presented in Table 1. 

The categories may be viewed as falling into four 
major divisions: 

I. Objective Information. O’s appearance (Cate- 
gory 1), his background (Category 2), and his 
job, income, etc. (Category 3). 

II. Social Interaction. O’s behavior toward S 
(Category 4) and his behavior toward other 
people (Category 5). S’s reaction to and accept- 
ance of O (Category 7) andother people’sreaction 
to and acceptance of O (Category 6). 

III. Behavioral Consistencies. O’s temperament 
(Category 8), self-concept and emotional ad- 
justment (Category 9), and his morals and 
values (Category 10). 

IV. Performance and Activities. O’s abilities (Cate- 
gory 11), motivation (Category 12), and his 
activities and interests (Category 13). 

Using the 13 content categories, Judge 1 scored all 
of the booklets, without reference to the designations 
of the Os at the top of each page. Scoring consisted of 
recording which content categories were used, and the 
frequency of use for each page of each booklet, 

Three procedures were used to check scoring relia- 
bility. Two other judges (Judges 2 and 3) were asked 
to perform the analysis independently on 10 of the 
booklets. Judge 1 reanalyzed 10 booklets 6 months after 
his first scoring. Eleven Ss returned 2 weeks after the 
initial study to fill out the booklets again. At the time 
of this retest the Ss were told to make no special effort 
to describe the same Os they had described previously, 
nor to make an effort to think of new people. Both the 
test and the retest protocols were analyzed by Judge 1. 

For purposes of data analysis, a summary data sheet 
was constructed for each booklet with the 13 content 
analysis categories across the top and the 12 Os and 
the self down the side. This gave a matrix of 169 cells, 
the rows of which were the descriptions of Os and the 
columns of which were the content categories. 

The judge, when reading a given description, placed 
a mark to indicate the presence of a category, in the 
appropriate column cell for each phrase or complete 


4 Obviously other judges might have arrived at 
different categories, but in light of the results to be 
reported the ones used here do appear to be meaningful, 
‘The investigator using a free response approach prepar- 
atory to a fixed scale study must, of course, try as best 
he can to arrive at categories which are suitable to his 


particular research. 
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TABLE 1 


Content CATEGORIES AND EXAMPLES OF DIMENSIONS CONTAINED IN THEM 


Category 


1. Appearance of 0. 
2. O’s background. 


3. General information. 


4. O’s behavior toward S. 


5. O's behavior toward other 
people. 


6. Other’s reaction to O. 


7. S’s reaction to 0. 


8. O’s enduring personal char- 
acteristics. 


9. O's emotional adjustment and 
self-concept. 


10. O’s values and ideals. 
11. O’s abilities. 


12. O’s aspirations, ambition, and 
motivation. 
13, O’s interests. 


Dimensions 


Release ta build, features, dress, general good or bad looks, sex appeal, 

voice quality. 

Family solidit ts’ occupation, family status and wealth, nationality, 
a childhood « experiences (eg. “He was overshadowed by his sis- 
ter”). 

O’s present marital status, education, occupation, church preference, veteran, 
college, college year, where and how S met O if this is “merely informa- 
tion.” 

Specific mention of some behavior on the part of O toward S (e.g., “He has 
always been nice to me”), i 

Specific behavioral modes involving other people (which may or may not 
include S). Conformity, behavior reflecting some sort of general attitude 
(snooty, overbearing, etc.), degree of openness in interactions, willingness 
to be with people and to interact with them in the usual socially accepta- 
ble manner. This does not include judgments about O’s “nature” (egu 


“He has a friendly (not good) personality”). Examples are: charming 


hostess, poor manners, likes people, sy to get to know, etc. 

Opinions of O held by other people and their acceptance or nonacceptance 
of him. Such things as: people usually like him, he isn’t very popular, etc, 

General or global pronouncements about O. Such statements as: he has a 
good personality, I like him very much, the best friend I ever had, like- 
able, enjoyable to be around, easy to get along with, etc. This category 
also includes any guesses S makes about O’s future or about how things 
are going ane at present (e.g., he’ll never reach his goal, his life seems 
pleasant, etc.) 

Rather consistent personality factors which dictate the general color of his 
interactions with other le without actually being an instance of that 
behavior. Examples are: quiet type, has a sense of humor, extrovert, out- 
going, practical, conservative, gushy, I-don’t-care-attitude, sincere, seri- 


ous, etc. 

Emotional adjustment and O’s behavior toward himself from S’s point of 
view, not necessarily from a psychologist’s. Examples are: He’s confused, 
she feels sorry for herself, inferiority complex, mature, knows when to 
work and when to play, humble, wonderful outlook on life, spoiled, etc. 

Any set values or standards which O oe or fails to impose upon himself 
or generally holds as desirable in people. Examples are: religiosity, morals, 
political values, mercenary, drinking and smol habits, etc. 

O’s intellectual abilities, other abilities (mechanical, etc.), talents, scope of 
knowledge, proficiency, etc. Also achievements which reflect ability. This 
category does not include social abilities, 

Goals in life, direction in activity, motivation in tasks undertaken, etc. 


Interests aside from specific goals (occupation or studies). Not social unless 
a specific interest in working with or being with people (such as camp 
work). Also includes offices held and memberships in other than purely 
scholastic organizations. Examples are: likes sports, out for a good time, 
likes to have fun, belongs to social groups, has many hobbies, etc. 


thought in the description. A plus sign was used if E 
considered the phrase or thought to suggest a positive 
or favorable evaluation of O by S, a zero if it is was neu- 
tral, and a minus sign if it was negative or unfavorable. 
Number of units was then summed across rows and 
down columns. The row and column sums (and, for 
certain analyses, the cell sums), were converted into 
proportions of the overall total, to correct for individual 
differences in “wordiness” or total number of “units.” 
According to our assumptions, the proportion of total 
responses in each column provides an index of the rela- 
tive weights of the categories. The proportion of total 
responses in each row gives the relative response pro- 
ductivity for each O described (or for the self-descrip- 


tion). Comparable proportional transformations were 
undertaken for the evaluative symbols. 

These proportions make it possible to analyze the 
effects of the independent variables on category use. 
For example, to study how degree of acquaintance 
affects category use, one need only compare the average 
column proportion for well-known Os with that for not 
well-known Os for each S over the whole group of Ss. 
Or, to study response productivity as a function of 
degree of acquaintance, one compares average TOW 
proportions to see whether more was said about well- 
known than about not well-known Qs. Such analyses 
require the use of some statistical test such as ¢ or the 
Wilcoxon signed ranks tests. Nonparametric tests were 
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used throughout the analysis to avoid assumptions 
regarding variance and distribution. , 


RESULTS 


The following results provide information 
about each of the expectations stated above: 

The content of free verbal descriptions of people 
can be reliably analyzed. Reliability was assessed 
by interjudge agreement and by rescoring 
after a time interval; the retest results are also 
relevant to the reliability question. 

For the reliability analysis, agreement be- 
tween two scorings of the same S’s protocol 
was studied by cell-for-cell comparison of the 
two summary data sheets for the protocols. 
Fourfold tables were constructed, using the 
following cell entries: both judges agreed on 
the presence of the category in the description, 
Judge A said it was present but Judge B said 
it was not, Judge B said it was present but 
Judge A said it was not, and both judges agreed 
on the absence of the category. Since Vs were 
small (about 10 protocols for each reliability 
check), the fourfold tables were combined 
across Os for each of the 13 categories. The phi 
coefficients were computed on these composite 
fourfold tables to provide a quantitative index 
of agreement. Table 2 presents the results for 
each category and for all categories combined, 
for each reliability check. 

The table indicates that, if inter- and intra- 
judge agreements are considered, scoring relia- 
bility may well be considered satisfactory. 
Scoring of Categories 1, 2, 3, and 6, especially, 
seems reasonably reliable; scoring reliability 
of Categories 4, 7, 8, and 10 appears somewhat 
less satisfactory. The retest procedure in 
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general yielded far lower agreement. In the 
last rows of the table, a simple mean phi for 
each category over all reliability procedures 
(by Fisher z transformation) appears for 
purposes of indicating relative reliability of 
the different categories in general. 

The expectation that a given S’s weighting 
of categories for a particular kind of O and 
over various types of Os should be consistent 
over time was not clearly supported. As Table 
2 shows, although only 1 of the 13 phi’s was 
negative (p = .01) the mean phi coefficient 
over the 11 test-retest protocols was only .16. 
This coefficient is not significantly greater than 
zero, and indicates that there was no apprecia- 
ble degree of consistency from the first protocol 
to the retest. Various considerations may 
clarify this result. First, most Ss, when asked 
at the end of the retest session, reported that 
they had described some new Os in addition to 
some of the Os described during the first ses- 
sion. Examination of the retest protocols indi- 
cated that an average of 54% of the Os had 
not been described before. Second, such a test- 
retest procedure combines the error variance 
contributed by the judge with that contributed 
by the S and might therefore be expected to 
reduce the obtained phi coefficient. Third, the 
relative reliabilities of the categories for test- 
retest were comparable to those in the assess- 
ments of judge reliability. A Spearman rho of 
61 (p = .03) was obtained between the mean 
interjudge reliability for each category and the 
test-retest reliability for each category. The 
relative reliabilities of the categories thus 
appear to be similar in the two checks. 

Tt should be understood that this study was 


TABLE 2 
PHI COEFFICIENTS OBTAINED IN THE RELIABILITY ANALYSES 


Comparison 


Categories 


Test-Retest 

Mean phi (without re- | .91 
test, 

Mean phi (with retest) | .86 


k ele Saisie | ~ 
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exploratory and that the results to be reported 
in the following sections are illustrative rather 
than exhaustive. They serve to show that data 
do support each of the expectations outlined 
in the introduction. (In the nonparametric 
statistical tests reported here the rejection 
criterion was .05, two-tailed, unless otherwise 
stated.) 

Os different in certain respects tend to be 
described with different categories, different 
evaluative tone, and different amounts of informa- 
tion. All three aspects of this expectation were 
supported by a variety of results. 

1. Category Use. Example: The proportion 
of total responses made for well-known Os is 
higher in Category 1 (physical characteristics) 
than for Os who are not well known. 

2. Evaluative Tone. Example: There was a 
significantly greater proportion of negative 
evaluations of O’s motivation (Category 12) 
for lower status Os than for equal status Os, 
and for equal status Os than for higher status 
Os. 


3. Amount of Information. Example: The 
amount of information provided in a descrip- 
tion of an O varied with the degree of acquaint- 
ance. The largest proportion of units was used 
in self-description, a moderate proportion in 
describing well-known Os, and the smallest 
proportion when describing not well-known Os. 
Seventy-seven percent of the Ss exhibited this 
order of response productivity. 

These examples indicate that the selection 
of different kinds of Os to be described does 
seem to produce differences in the descriptions. 

Ss different in certain respects use different 
categories, different evaluative tone, and different 
amounts of information. Once again, a variety 
of findings supported this expectation. 

1. Category Use. Examples: Regardless of 
whether a male or a female O was being de- 
scribed, male Ss tended to use categories that 
emphasized physical appearance, background, 
occupation, values, abilities, motivation, and 
interests (Categories 1, 2, 3, 10, 11, 12, and 13) 
more than did female Ss. On the other hand, 
female Ss tended to use content categories 
emphasizing social skills and adjustment 
(Categories 5 and 9) more than did men. 

S’s age was also related to category use. For 
example, Category 3 (objective information) 
was used more by male Ss whose age was above 
the median than by those male Ss whose age 
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was below the median. (Male ages ranged from 
18-29, with æ mean of 21.94 and a standard 
deviation of 2.94.) 

2. Evaluative Tone. Examples: Females were 
more likely to use evaluative terms than were 
males, in describing physical characteristics 
of the O (Category 1). Increase in age was 
associated with decrease in negative evalua- 
tions in Category 6 (other people’s reaction to 
0). 


3. Amount of Information. Example: Under 
the conditions of the study, female Ss tended 
to be somewhat more productive than male Ss; 
the number of codable units in the entire 
protocol indicated a greater amount of informa- 
tion in descriptions made by females than in 
descriptions made by males. (This was signifi- 
cant only at the .05 level, by one-tailed test.) 

Descriptions vary as a function of the interac- 
tion of S and O characteristics. Such interactions 
were found to affect all three dependent varia- 
bles, category use, evaluative tone, and amount 
of information. 

1. Category Use, Examples: Category use 
followed a sex-sex, a sex-acquaintance, and a 
sex-status interaction. Thus male Ss made 
greater use of Category 11 (abilities) when 
describing male Os than when describing fe- 
male Os; on the other hand they made greater 
use of Category 7 (0’s behavior toward S) 
when describing female Os than when describ- 
ing male Os. Neither of these findings held for 
female Ss. 

Male Ss used Category 12 (motivation) 
more in describing well-known Os than in 
describing Os they did not know well, while 
female Ss used this category equally for both 
kinds of Os. Female Ss, on the other hand, used 
Category 13 (interests) more for well-known 
Os than for not well-known Os; male Ss did 
not show this tendency. 

Male and female Ss showed a relation be- 
tween use of Category 12 (motivation) and 
status of the O. Use of this category was greater 
for both equal and lower status Os than for 
higher status Os. An additional interactive 
pattern is that female Ss used Category 6 
(other people’s reaction to O) less for equal ani 
lower status Os than for higher, while male Ss 
used it less for lower status and equally for 
higher and equal status Os. 

2. Evaluative Tone. Example: A significant 
interaction was found between degree of 
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acquaintance with O and the evaluative tone 
of responses in Category 10 (values), as a func- 
tion of S’s sex. For male Ss an Increase in de- 
gree of acquaintance was associated with an 
increase in positive evaluations; that is, well- 
known Os were more likely to be given positive 
evaluations in regard to values than were Os 
who were not well-known. Descriptions by 
female Ss did not evidence this trend. 

3. Amount of Information. Example: Male 
Ss used a significantly greater proportion of 
units when describing male Os than when 
describing female Os, while female Ss did not 
differ in response productivity as a function of 
O's sex. 

These findings illustrate the interaction 
effects of the kind of Ss doing the describing 
and the kind of Os being described. 


Discussion 


Tn the preceding examples we have presented 
only some of the more striking instances of 
results to illustrate the variety of approaches 
that can be taken with this sort of strategy.’ 

Although it would be desirable to extend the 
application of free response techniques to a 
wider range of Ss, Os, and situations, these 
preliminary results indicate that the dimen- 
sions along which Ss make judgments of Os 
are indeed dependent upon the characteristics 
of the Ss, of the Os, and of the interaction 
between S and O variables. The present study 
suggests that in one situation it is possible to 
categorize Ss’ descriptions of Os adequately 
enough so that the differences between Ss, the 
differences between Os, and the effects of 
various interactions between S and O variables 
can be shown to produce significant differences 
in category use, evaluative tone of the descrip- 
tions, and the amount of information given 
in descriptions. 

These results suggest that when selecting 
scales or dimensions upon which S is to be 
required to make judgments about Os, one 
must keep in mind that these dimensions are 
not going to be applicable to all Os and Ss. S, 
it would seem, considers different dimensions 


5 Although the examples chosen here were of course 
selected from among a variety of analyses, it is unlikely 
that the differences reported were due to chance alone. 
Over one-third of all of the analyses (over 100) at- 
tempted on the data yielded differences significant at 
the .05 level or better, two-tailed. 
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appropriate for different kinds of Os. How one 
is to interpret responses made on “inappro- 
priate” dimensions is quite unclear. It is not 
impossible, for instance, that on these dimen- 
sions responses are made randomly. Or, per- 
haps responses on these inappropriate dimen- 
sions are based on stereotypes while responses 
made on dimensions that S considers appro- 
priate are more specific to the O being judged. 
This might help to explain some of the results 
in the person perception literature (e.g., Cline 
& Richards, 1960) which suggest that accuracy 
in judging the stereotype accounts for a good 
part of overall accuracy in making judgments 
about a given O. In fact, in an accuracy experi- 
ment it may be necessary to find those dimen- 
sions which S considers appropriate to O and 
those dimensions which O considers appro- 
priate to himself. It may be that only dimen- 
sions that are jointly considered appropriate 
yield a high degree of accuracy. 

In general, a failure to obtain the predicted 
experimental results in studies of person per- 
ception or person cognition could be a result 
of incorrect hypotheses or of the inappro- 
priateness of the dimensions selected for S to 
use in making responses concerning O. Conse- 
quently, it seems reasonable to attempt to find 
the dimensions which S considers relevant or 
appropriate to a given O or kind of O and then 
use these dimensions as the scales in the subse- 
quent fixed scale study. 


SUMMARY 

In person perception experiments subjects 
(Ss) have often been required to make judg- 
ments about other people (Os) on dimensions 
that may not be applicable or relevant from 
the S’s point of view. The results of such 
studies are, consequently, difficult to interpret. 
It has been argued that fixed scale research 
should be preceded by free response—content 
analysis studies to aid in decisions as to which 
scales or dimensions are appropriate for the Ss, 
the Os, and the situation involved. 

In an attempt to show that dimension use 
does differ for different Ss and Os, Ss were 
asked to describe persons they knew who fit 
the specifications of classes of Os designated 
by E. These descriptions were then content 
analyzed. 

The findings serve to support the contention 
that the dimensions used in judgments of other 
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people are a function of the Ss doing the judg- 
ing, the Os being judged, the situation, and the 
interaction among these variables. 

It was concluded that a prior free response— 
content analysis approach like the present one 
might well be considered logically and meth- 
odologically prior to the study of how inde- 
pendent variables affect ratings along fixed 
predetermined scales. 
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ADDRESS IN AMERICAN ENGLISH? 


ROGER BROWN anp MARGUERITE FORD 
Massachusetts Institute of Technology 


HEN one person speaks to another, 
the selection of certain linguistic 
forms is governed by the relation 

between the speaker and his addressee. The 
principal option of address in American English 
is the choice between use of the first name 
(hereafter abbreviated to FN) and use of a 
title with the last name (TLN). These lin- 
guistic forms follow a rule that is truly rela- 
tional. Their use is not predictable from proper- 
ties of the addressee alone and not predictable 
from properties of the speaker alone but only 
from properties of the dyad. Kinship terms of 
address (e.g., dad, mom, son) are also rela- 
tional language, but they constitute a restricted 
language of relationship since most dyads that 
might be created in America would not call for 
any sort of kinship term. Proper names, on the 
other hand, constitute a nearly universal 
language of relationship; the semantic dimen- 
sions involved serve to relate to one another 
all of the members of the society. 

Much of our knowledge of the dimensions 
that structure interaction has been obtained 
from paper-and-pencil performances such as 
the status ranking of acquaintances or the 
sociometric choice from among acquaintances. 
There can be no doubt that the dimensions 
revealed by such data are genuinely functional 
in ordinary life. However, performances elicited 
by the social scientist are not the everyday 
performances that reveal dimensions of social 
structure and cause each new generation to 
internalize such dimensions. It is always possi- 
ble that a paper-and-pencil task imposes a 
dimension or encourages an unrepresentative 
consistency and delicacy of differentiation. It 
is desirable to study social structure in every- 
day life, but much of the everyday behavior 
that is governed by social dimensions is diffi- 
cult to record and involves an uncertain num- 
ber of significant contrasts (e.g., influencing 
decisions, smiling at someone, warmth of 
manner). Forms of address are speech and so 

1This study was supported by a research grant 
(M-2649) from the National Institute of Mental Health 


of the National Institutes of Health, United States Public 
Health Service. 


there is a recording technique called writing 
which preserves most of the significant detail. 
English forms of address are reasonably well 
described by a single binary contrast: FN or 
TLN. These forms are ubiquitous and for any 
set of interacting persons it is possible to ob- 
tain a collection of address dyads from which 
the latent structuring dimensions can be 
inferred. This paper provides an example of the 
application of semantic analysis to the study of 
social structure. We begin by describing the 
norms of American address, then point out a 
certain pattern in these norms which is to be 
found in the forms of address of all languages 
known to us. This abstract speech pattern 
suggests a feature of social structure that may 
possibly be universal. 


METHOD 
Materials 

To discover thenorms of address in American English 
we require a large sample of usage. The range of the 
subject population is vast but the uniformity must be 
great. Some sensible compromise is required between 
the stratified national sampledictated by the scope of the 
problem and the unsystematic observation of one’s 
friends dictated by the probable simplicity of its solu- 
tion. Four kinds of data have been used. 

Usage in modern American plays. There are more 
instances of address in plays than in any other form of 
literature. Thirty-eight plays written by American 
authors, performed since 1939, and anthologized in 
three volumes of Best American Plays (Gassner, 1947, 
1952, 1958) were used. Of the plays in these volumes we 
omitted only those set in some remote time or ina 
country other than the United States. Without listing 
all the titles it is possible to indicate the range of Ameri- 
can usage that it represented. By geographic region we 
move from The Philadelphia Story to the Southland of 
The Member of the Wedding through the midwest of 
Picnic to the Far West of The Time of Your Life; by 
social class from The Rose Tattoo and A View from the 
Bridge through Death of o Salesman and Tea and 
‘Sympathy to The Solid Gold Cadillac. For military usage 
there are such plays as Mister Roberts and No Time for 
Sergeants. The speech of Jewish, Italian, and Negro 
minorities is represented and also that of such groups 
as politicians, corporation executives, narcotics addicts, 
college professors, and policemen. 

Of course these materials are not a record of actual 

from the characters named but are the speech 
constructed for such characters by playwrights. Prob- 
ably playwrights accurately reproduce the true norms 
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of address, and it is possible to check one author against 
another. In addition, however, there are three other 
kinds of data to catch possible inaccuracies in this first 
set. 

Actual usage in a Boston business firm. For 2 months 
a man employed in a drafting firm took advantage of 
leisure moments to jot down for us instances of linguistic 
address overheard from his fellow workers. He collected 
address terms for 214 different dyads in which 82 
different people are involved either as speaker or ad- 
dressee. Each person is identified by full name, sex, age, 
and position in a 12-level occupational hierarchy. In 
addition, we benefited from our informant’s good 
knowledge of friendship patterns among the dramatis 
personae. 

Reported usage of business executives, Each year 
there is at MIT a group of Alfred P. Sloan Fellows; 
these are business executives between 30 and 38 years 
of age, who are nominated by their respective employers 
to study for one year at the institute. The 34 Sloan 
Fellows designated for 1958-59 served as informants 
for this study. They had come from many different 
parts of the country, most of them from very large 
corporations but a few from small companies. 

By the time the Sloan Fellows served as informants 
the general pattern of the norms had become sufficiently 
clear to make possible the writing of a questionnaire 
designed to elicit the most important information. 
Each man was asked to write down the full names and 
positions of four persons whom he was accustomed to 
see nearly every day at his place of business, and he 
was to distribute his selections so as to include: one 
person equal to himself in the organization hierarchy 
with whom he was on close or intimate terms, one person 
equal to himself with whom he was on distant or formal 
terms, one person superior to himself in the organization 
hierarchy, one person subordinate to himself in the 
hierarchy. After listing the names the informant was 
asked to write down for each person listed the exact 
words that he (the informant) would customarily speak 
in greeting that person for the first time each day. In 
addition, the Sloan Fellows were asked some other 
questions that will be described when they become 
relevant. 

Recorded usage in Midwest. The Psychological Field 
Station of the University of Kansas directed by Roger 
G. Barker and Herbert F. Wright (1954) kindly allowed 
us to make an extended study of 10 “specimen records,” 
each of which records the events and conversation in a 
full-day of the life of a child. The station has also al- 
lowed us to work with a set of brief “behavior settings 
observations” made on 56 children in the town called 
Midwest and on 56 children in Yoredale, England, 
matched with those in Midwest by sex and age. In 
these materials we have been primarily concerned with 
a grammatical analysisof the kinds of “mands” 
addressed to children (for another study of the language 
of social relationship) but have, in addition, taken the 
opportunity to check our conclusions about American 
address against the Midwest records. 


Procedure 


For each of the 38 plays every instance of address 
was recorded, together with an identification of the 
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speaker and addressee. The method of study is a so 
controlled induction.? Approximately one-third of 
plays were first èxamined in an effort to discover 
that would summarize all of the instances of a 
they contained. The resulting provisional rules 
then tested against a second set of plays and und 
some revision. The revised rules proved adequate 
description of all instances of address in a third 
set of plays. The supplementary data from a busi 
firm, from the Sloan Fellows, and from the M 
records were used as additional checks on the 
induced from the plays and also to test several pai 
ticular hypotheses. 


Major PATTERNS OF ADDRESS 


If we consider only the FN and TLN 
are just three logically possible dyadic 
terns: the reciprocal exchange of FN, 
reciprocal exchange of TLN, and the n 
reciprocal pattern in which one person uses 
and the other TLN. All three patterns oc 
with high frequency and the problem is o 
inducing the semantic factors invariably 
ciated with a given pattern and serving í 
distinguish it from the others. = ~ 

In classifying instances of address into th 


first names (e.g., Robert), familiar abb 
tions (e.g., Bob), and diminutive forms 
Bobbie). It may be said at once that male 
names in American English very seldom occu 
in full form (Robert, James, or Gerald) but a 
almost always either abbreviated (Bob, Jin 
or diminutized (Jerry) or both (Bob 
Jimmy). Female first names are more o; 
left unaltered. Titles for the purpose of th 
classification include, in addition to Mr., Mrs. 
and Miss, such occupational titles as 
Senator, Major, and the like. 


Two Reciprocal Patterns 


The vast majority of all dyads in the play 
exchange FN (Mutual FN). Indeed, where 
actual name is not known there occur som 
times what may be called generic first name: 
these include the Mack, Jack, and Buddy í 
taxi drivers. Mutual TLN is most commonl 
found between newly introduced adults. Th 


? Since address involves two persons and, very ofti 
a choice between two linguistic forms, we believe 
should be possible to treat the complete set of ad dr 
dyads for any group of persons as a matrix of Į 
comparisons from which the dimensions structt 
group could be rigorously derived. We are at p 
trying to develop such a method. 
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distinction between the two patterns is pri- 
marily one of degree of acquaintance with the 
degree required for the Mutual FN being less 
for younger people than for older people and 
less where the members of the dyad are of the 
same sex than where they are of different sex. 

It seems likely that the two reciprocal pat- 


` terns are on a dimension that ranges from 


acquaintance to intimacy. However, in modern 
American English the distance between the two 
points is small with the Mutual FN usually 
representing only a very small increment of 
intimacy over the Mutual TLN; as small 
sometimes as 5 minutes of conversation. Be- 
cause the segment of the line that lies between 
the two patterns is usually so very short, it is 
not easy to make out its exact character. How- 
ever, in English of the pastè and in cognate 
languages today the Mutual FN is farther 
displaced from the Mutual TLN, and from 
these cases and other materials later to be 
presented we can hazard a characterization of 
intimacy. Intimacy is the horizontal line be- 
tween members of a dyad. The principal fac- 
tors predisposing to intimacy seem to be shared 
values (which may derive from kinship, from 
identity of occupation, sex, nationality, etc., 
or from some common fate) and frequent con- 
tact. Among the behavioral manifestations of 
intimacy, a relatively complete and honest 
self-disclosure is important. 


Nonreciprocal Pattern 


In this case one member of the dyad says 
FN and the other TLN. There are two kinds 
of relation that can generate this pattern. The 
first is a difference of age: children say TLN 
to adults and receive FN; among adults an 
elder by approximately 15-or-more years re- 
ceives TLN and gives FN to his junior. The 
second is a difference of occupational status: 
this may be a relation of direct and enduring 
subordination (e.g., master-servant, employer- 
employee, officer-enlisted man); it may be a 
relation of direct but temporary subordina- 
tion, involving someone in a service occupa- 
tion (e.g., waiter, bootblack) and a customer; 
lt may be an enduring difference of occupa- 


è In six American plays (Quinn, 1917) written be- 
tween 1830 and 1911 the reciprocal FN between adults 
clearly implies a much longer and closer acquaintance 
than it does in contemporary usage. 
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tional status that does not involve direct sub- 
ordination (e.g., United States senators have 
higher status than firemen). If the intimacy 
dimension that governs reciprocal address is 
the horizontal of social relationship, then the 
status dimension that underlies the non- 
reciprocal pattern may be called the vertical 
of social relationship.‘ 

Age and occupational status are correlated 
and most instances of nonreciprocal address 
involve congruent differences on the two di- 
mensions. There is, however, proof that a 
difference on either dimension alone is able 
to generate the nonreciprocal pattern. The 
proof is the existence of nonreciprocal dyads 
matched on one dimension but not on the 
other. 

Because there are two criteria for the assign- 
ment of status and, in addition, the correlation 
between the two criteria is not perfect, we 
must ask what happens to address in dyads 
where the elder has the humbler occupation, 
There are in the plays numerous instances in 
which the criteria oppose one another: an 
adolescent girl and her family’s middle-aged 
cook, a young navy ensign and a middle-aged 
enlisted man, a young executive and an elderly 
janitor. In all such cases address is in accord- 
ance with occupational status and so it would 
appear that there is a normative rule of pri- 
ority for the two criteria. It is to be expected 
in a society whose values are more strongly 
linked to achieved personal attributes than to 
ascribed attributes (Parsons, 1951) that occu- 
pation would prevail over age in the deter- 
mination of deference. 

The three sets of data in addition to the 
plays confirm the generalizations made above. 
Sloan Fellows call almost all of their business 
acquaintances by FN and expect this address 
to be reciprocated. In the few cases where one 
of these men would say TLN and expect to 
receive FN the other member of the dyad was 
invariably an organizational superior and was 
also the elder. In the Boston drafting firm, 
also, most address is on the pattern of Mutual 


4 Kinship terms of address in American English 
(Schneider & Homans, 1955) also show a nonreciprocal- 
ity of status. Members of ascending generations are 
commonly addressed with kinship titles (mother, father, 
grandmother, grandfather, uncle, aunt) but respond 
by calling their children, grandchildren, nephews, and 
nieces by FN. 
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FN. The few cases of Mutual TLN involved 
persons who were scarcely acquainted and 
who were well matched by age and position. 
There were 40 different instances of non- 
reciprocal address. In 36 of these the recipient 
of TLN was the organizational superior and 
elder; in 28 cases he was a member of one of 
the four top executive ranks whereas the re- 
cipient of FN was a member of one of the 
eight unionized ranks. In three cases the or- 
ganizational superior was younger than his 
subordinate and it was the superior, not the 
elder, who received TLN. The single remain- 
ing case involved a pair matched by rank but 
not by age and in this case the elder received 
TLN. Midwest children participate in non- 
reciprocal address dyads with parents and 
teachers and in Mutual FN dyads with other 
children. 


VaRIANT Forms oF ADDRESS 


In this section are offered tentative charac- 
terizations of the several common forms other 
than FN and TLN. At a later point in the 
paper we will return to these variants and 
suggest the nature of their relationship with 
the major patterns. 


Title Without Name 


Commonly used titles (T) include sir, 
madam, ma’am, and Miss. In general these 
forms are used like TLN; either reciprocally 
between new acquaintances or nonreciprocally 
by a person of lower status to a person of 
higher status. The address form T is probably 
a degree less intimate and a degree more defer- 
ential than TLN. It may, for instance, be 
used reciprocally where acquaintance is so 
slight that the last name is not known, In 
nonreciprocal military usage between ranks 
the TLN may be used to immediate superiors 
but the T to remote colonels, generals, com- 
manders, and admirals even though the names 
of these superiors are well known. 

Beyond this general characterization as a 
minimally intimate and maximally deferential 
form particular varieties of T have their 
specialized uses. The form ma’am is most 
commonly heard from young men to mature 
women. Schoolchildren in Yoredale, England 
preface almost all address to a teacher by 
saying: “Please, Miss . . . ” In parallel circum- 
stances Midwest children use TLN. 
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Last Name Alone 

An occasional person is regularly a 
by LN. This seems usually to occur wh 
FN is polysyllabic and has no fan 
breviation whereas the LN is either a me 
syllable or readily transformed into a me 
syllable. In these circumstances LN is si 
a substitute for FN and patterns in 
fashion. 

Where the LN is not the usual form 
addressee it represents a degree of i 
greater than TLN but less than FN, 
tary usage enlisted men receive the L] 
officers when they are little acquain 
creased familiarity leads to the FN doy 
though not upward. Elderly and 
tinguished professors sometimes begin 
to junior colleagues whom they know fi 
well: “Dear Jones...” The form is 
ciprocated in this case. 

Reciprocal LN is common between | 
men until they become acquainted. 
during reciprocal LN seems always to 
a mutual antagonism that blocks pr 
to intimacy. In The Caine Mutiny Cow 
tial, for instance, Greenwald and Ki 
change LN though the degree of acquaini 
and their ages would normatively p 
Mutual FN. In the climax of this play Gt 
wald dashes a drink in Keefer’s face. 
Philadelphia Story Kittredge and Hay 
change LN; the former is the husband 
of Tracy Lord and the latter her form 
still-interested husband. 


Multiple Names 


A speaker may use more than one 
the proper name for the same addressee 
times saying TLN, sometimes FN or 
nickname, sometimes creating phone! 
ants of either FN or the nickname. We 
here interested in the business of tem: 
gression through the possible forms 
earlier terms are dropped as new te 
taken up. The case of multiple names 
is the case in which two or more vi 
the proper name are used in free 
with one another. 

The instances of MN in the plays s 
that this form represented a greater 
intimacy than the FN, but degrees of in 
are not easily judged and so we deci 
put the hypothesis to a more direct te: 
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test involved individual interviews with 32 
MIT male undergraduates. Eack subject was 
asked to think of four men of about his own 
age all of whom he had met for the first time 
approximately one year ago. In addition, it 
was to be the case that the subject had had 
about equal opportunity to get to know all of 
these men and yet now was to find himself on 
close, friendly terms with some of them and 
on more distant terms with others. The sub- 
ject was asked to write down the full names of 
the four men and then to record the name 
by which he usually addressed each one. When 
this had been done, the subject was asked, for 
each acquaintance, whether he ever addressed 
the man in any other way and all the names 
currently being used were noted down. 

In an earlier paragraph it was suggested 
that one of the meanings of intimacy is a rela- 
tively complete and honest disclosure of the 
personality. Jourard and Lasakow (1958) have 
devised a Self-disclosure questionnaire which 
requires the subject to indicate whether or 
not he has discussed with a designated other 
person each of 60 topics classifiable under the 
headings: Attitudes and opinions, Tastes and 
interests, Work (or studies), Money, Personal- 
ity, and Body. Our 32 subjects filled out this 
questionnaire for each acquaintance. For each 
subject, then, we ranked the four acquaint- 
ances in order of decreasing Self-disclosure and 
we take this to be an order of decreasing in- 
timacy. It generally corresponds with the 
subjects’ ordering of the four into friends and 
acquaintances. 

Across all subjects we combined the 32 
closest friends, those eliciting highest Self- 
disclosure scores (Self-disclosure 1) and also 
the 32 acquaintances with Self-disclosure rank 
orders 2, 3, and 4. For each rank group we 
determined the number of cases addressed by 
MN (2 or more proper names) and the num- 
ber addressed by FN alone. The results appear 
in Table 1. The cases of MN decline as in- 
timacy declines. Using the sign test and a 
one-tailed hypothesis it was determined that 
the Self-disclosure 1 group contained more 
cases of MN than did the next most intimate 
group (Self-disclosure 2) with p = .020. The 
difference between the most intimate and the 
least intimate (Self-disclosure 4) group shows 
more cases of MN in the former group with 
$ < .004, 
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TABLE 1 


FREQUENCIES OF Two Forms OF ADDRESS TO 
ACQUAINTANCES AT DIFFERENT LEVELS 
OF SELF-DISCLOSURE 


Self-dis- | Self-dis- | Self-dis- | Self-dis- 
closure 1 | closure 2 | closure 3 | closure 4 


Multiple Names 18 10 7 3 
First Name 14 22 25 29 


Note.—Self-disclosure 1 vs. Self-disclosure 2 by sign test, $ = 
.020; Self-disclosure 1 vs. Self-disclosure 4 by sign test, p < .004. 


One informant addressed his closest friend 
whose name is Robert Williams as Williams 
or Robert or Bob or Willie and his next closest 
friend whose name is James Scoggin as Scog- 
gin, James, Jim, or Scoggs. Many informants 
reported that they sometimes playfully ad- 
dressed a good friend by TLN. Others used 
playful, and usually pejorative, phonetic varia- 
tions: Magoo for Magee, Katool for Katell, 
Lice for Leis. 

The tendency to proliferate proper names 
in intimacy is interesting because it accords 
with a familiar semantic-psychological princi- 
ple. For language communities the degree of 
lexical differentiation of a referent field in- 
creases with the importance of that field to 
the community. To cite a fresh example of 
this kind of thing, Conklin (1957) reports that 
the Hanunéo of the Philippine Islands have 
names for 92 varieties of rice which is their 
principal food. In naming ferns and orchids, 
with which they are little concerned, the 
Hanunéo combine numerous botanical species 
under one term whereas the rice they differen- 
tiate so finely is for the botanist a single spe- 
cies. Within a language community Brown 
(1958) has pointed out that a speaker more 
concerned with a given referent field will make 
finer lexical distinctions than a speaker less 
concerned with that field (botanists have more 
names for plants than do psychologists). In 
the referent field composed of other persons 
we have seen that where contact and concern 
are minimal and distance greatest, titles alone 
are likely to be used in address. To call some- 
one Miss or sir is to address the person on a 
categorical level which does not establish the 
addressee’s individual identity. The proper 
name constitutes the individual as a unique 
organism. Beyond the single proper name, 
however, where interest is still greater the 
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individual is fragmented into a variety of 
names. Perhaps this differentiation beyond in- 
dividuality expresses various manifestations 
or ways of regarding someone who is close 
(Brown, 1959). 


A GENERAL SYSTEM OF ADDRESS 


How are these various forms of address re- 
lated to one another? Consider first only the 
three major patterns. The Mutual TLN goes 
with distance or formality and the Mutual FN 
with a slightly greater degree of intimacy. 
In nonreciprocal address the TLN is used to 
the person of higher status and the FN to the 
person of lower status. One form expresses 
both distance and deference; the other form 
expresses both intimacy and condescension. 
Within the limits of two dyadic address forms 
there is a formally or logically possible alterna- 
tive pattern. The form used mutually between 
intimates could be used upward to superiors 
and the form used between distant acquaint- 
ances could be directed downward to subordi- 
nates. Because there is an alternative the pat- 
tern actually found is not a formal necessity 
but rather an empirical fact; a fact, we shall 
see, of great generality. 

Several years ago we began our general 
studies of the language of social relationship 
with a selection point that does not occur in 
modern English but which does occur in the 
other Indo-European languages. In French, 
for example, a speaker must choose between 
two second person singular pronouns; his ad- 
dressee may be addressed as /u or as vous. In 
German the comparable forms are du and Sie; 
in Italian ¢w and Lei. In English of the past, 
from about the 13th century until the 18th, 
there was a cognate option of address; the 
choice between thou and ye. We have studied 
the semantic rules governing these pronouns 
in 20 languages of Europe and India, compar- 
ing one language with another and the usage 
of earlier centuries with later (Brown & 
Gilman, 1960). For our present purposes the 
important point is that these pronouns in all 
the languages studied follow the same abstract 
pattern as the FN and TLN. 

In discussing the pronouns of address let us 
use T as a generic designator for pronouns of 
the type of tu and du and V as a designator 

for pronouns of the type of vows and Sie. 
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Mutual V is the form of address for adult ne 
acquaintances; it begins where TLN bi 
at the temporal point of origin of the d 
Mutual T like Mutual FN is an exp; i 
increased intimacy but it is, for most E 
peans, much farther along the line than is 
Mutual FN of Americans. From medi 
times into the present century non ) 
T and V was the pattern for those unequ: 
status with the superior receiving the V 
the subordinate the T. In recent times 
nonreciprocal use of the pronouns has 
declined because of a conscious egalitari 
The important point is, however, that w 
the nonreciprocal pattern has been used a 
where from southern India to Scandi 
the downward directed form has been 
intimate T and the upward directed the 
tant V. ‘ 
It may be that the abstract linkage in p 
sonal address of intimacy and condescen: 
distance and deference is a linguistic uni 
but we certainly do not know that as 
We do know that the linkage occurs a 
some non-Indo-European languages 
Japanese second person pronouns). In 
in those few languages for which we hi 
found adequate descriptions of the s 
of address, no violations of the abstract 
tern have yet appeared. 
It seems also that the pattern applies’ 
more than names, titles, and pronouns. T 
Sloan Fellows’ greetings to their business 
ciates almost invariably used the FN. 
ever, the greetings themselves were 
varied; including Hi, Morning, Good 
ing, Hello, Howdy, etc. Only Hi and Go 
Morning occurred with sufficient frequél 
to make possible the discovery of a pal 
The Sloans reported on greetings to 
classes of associate: Equal and Intimati 


ordinates while Good Morning is for 
acquaintances and superiors. Using the McN 
mar test for the significance of chan 
related samples with the Yates corre 
continuity, the difference between intime 
and acquaintances is significant with $ 
-0025 and the difference for subordinates à 
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TABLE 2 


Two Forms oF GREETING FoR Four 
CLASSES OF ÅSSOCIATÊ 


Equal and Intimate 


Good Morning Hi 
Equal and Distant 
Hi 0 4 
Good Morning 4 10 
Subordinate 
Good Morning Hi 
Superior 
Hi 1 3 
Good Morning 13 8 


Note.—With McNemar test for the significance of changes in 
related samples (employing Yates correction) x? for Intimate vs. 
Distant = 8.10, p < .0025 and x? for Subordinate vs. Superior = 
4.00, $ < .025, 


superiors with p < .025. In both cases a one- 
tailed test was used since the direction of the 
differences was predicted by the abstract 
pattern of address, The records of actual 
usage in a Boston firm accord with this finding. 
In one revealing instance a workman was 
greeted “Hi” and promptly answered “Hi,” 
but as he turned and recognized the boss, he 
added “Good morning.” 

Why should the abstract pattern described 
govern address between two persons? A curi- 
ous fact about contemporary use of T and V 
provides a clue. While the nonreciprocal pat- 
tern for pronouns has generally been aban- 
doned in Europe, inequality of status con- 
tinues to affect one aspect of usage. Dyads 
begin at the Mutual V and, with time, may 
advance to the intimacy of Mutual T. For 
many the shift from V to T is an important 
rite of passage. The Germans even have a 
little informal ceremony they call the Bruder- 
schaft. One waits for a congenial mood, a 
mellow occasion, perhaps with a glass of 
wine, and says: “Why don’t we say du to one 
another?” The new usage is, of course, to be 
reciprocal. However, there is one necessarily 
nonreciprocal aspect of the occasion—someone 
Must make the suggestion. When there is a 
clear difference of status between the two 
the right to initiate the change unequivocally 
belongs to the superior—to the elder, the 
richer, the more distinguished of the two. The 
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gate to linguistic intimacy is kept by the per- 
son of higher status. 

The norms of English address also make a 
pattern in time. A dyad must, with time, either 
increase its total amount of contact or else 
dissolve. Since the Mutual TLN represents 
less contact than the Mutual FN if Mutual 
TLN is to occur in a given dyad it must occur 
at an earlier time than the Mutual FN. The 
place of the nonreciprocal pattern in time is 
between the other two and it may be under- 
stood as a step from Mutual TLN in the direc- 
tion of Mutual FN; a step which, like the sug- 
gestion of the Bruderschaft is taken first by 
the superior. Many dyads will linger for a 
very long time—possibly the life of the dyad— 
in the nonreciprocal pattern. In this circum- 
stance the pattern gives enduring expression 
to an inequality of status. 

Consider a familiar sort of example. A pros- 
pective graduate student arrives at a univer- 
sity to meet some of the faculty of the psychol- 
ogy department and is interviewed by the 
chairman. Probably the two will initially ex- 
change TLN. In the course of the day or, if 
not, shortly after the student enrolls, the 
chairman will begin to call him by FN. He 
extends the hand of friendship, but the student 
knows that it behooves him not to grasp it 
too quickly. The student will continue with 
the TLN for several years (4 is probably the 
mode) and in this period the nonreciprocality 
of speech will express the inequality of status. 
If the chairman is neither very elderly nor 
very august the student will eventually feel 
able to reciprocate the FN and the dyad will 
have advanced to Mutual FN, The three pat- 
terns may be described as a progression in 
time (see upper portion of Figure 1) if we add 
several important qualifications. 

Not every dyad passes through all three 
steps. There are some that begin at each of 
the three points: adults of equal status with 
Mutual TLN, master and servant with non- 
reciprocality, young people with Mutual FN. 
Tn addition, not every dyad that continues to 
exist will necessarily advance to Mutual FN. 
In North Carolina until 1860 the Negro slave 
said TLN to his master and was told FN 
(Eliason, 1956), and there was no change with 
time. There is a final qualification concerning 
the progression of address in time. Even when 
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Fic. 1. Graphic models of the progression of address 
in time (from left to right). (The upper portion of the 
figure represents the major progression; the lower 
portion represents the full progression.) 


relationships do develop in intimacy they will 
not necessarily pass through the intermediate 
nonreciprocal stage. When the members of a 
dyad are not of clearly unequal status, they 
will advance at the same time to Mutual FN. 

The general statement that can be made is: 
if an address pattern changes in time it will 
change in the direction of Mutual TLN —> 
nonreciprocal TLN and FN —> Mutual FN 
though a step may be skipped. Even this state- 
ment cannot stand without a little more ex- 
planation. There are special circumstances in 
which the direction of movement of address 
can be reversed. If a person of lower status 
Seems to move too fast to the reciprocation of 
the FN a superior may step back from his 
use of FN to TLN. The day after a convivial 
office party a breezy young clerk calls out to 
the president: “Morning, Jack!” and in icy 
tones the president replies: “Good morning, 
Mr. Jones.” The person of lower status must 
never use a more familiar form of address than 
the person of higher status and the backstep 
by the superior puts the subordinate in just 
that position from which he will usually with- 
draw to the propriety of TLN. 

The variant forms of address discussed at 
an earlier point in this paper seem to function 
as additional optional steps in the Progression 
of address.® Title alone (T) probably is the 


* This paper does not discuss all American English 
forms of address but only those that have clear positions 
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formal extreme since we find in our data that 
dyads which begin as Mutual T often change 
with time to nonreciprocal TLN downwards 
and T upwards. The last name alone (LN) 
must be intercalated between TLN and FN 
since LN downwards is found in combination 
with TLN upwards and LN upwards is found 
in combination with FN downwards. The use 
of multiple names (MN), we have seen, repre- 
sents the intimate extreme. Making these 
additions to the major three-step progression 
we arrive at the full progression of Figure 
1 (lower portion) in which each new step 
towards intimacy is initiated by the superior 
who is, therefore, not just the gatekeeper to 
Mutual FN but the pacesetter for all linguistic 
advances in intimacy. 

The qualifications that apply to the major 
progression apply also to the full progression. 
Dyads may begin at any point, need not move — 
at all, and if they move may skip steps. There — 
is an additional qualification that results from 
the multiplication of forms. Any less formal 
term may be used downwards in a nonrecipro- 
cal pattern with any more formal term being 
used upwards. Thus we find not only the TLN 
with the LN but also the T with LN or even 
with FN as when a captain calls an enlisted 
man Jones or John but is called captain. 

Is there any way to test the accuracy of the 
full progression pictured in Figure 1? We have 
found no dyads that pass through the entire 
Progression but only dyads moving in one or 
another limited region. By the time the third 
and final set of plays was ready for analysis, 
the full progression had been constructed. If 
the progression, with all its qualifications, is 
accurately descriptive of American English 
practice, one should find certain kinds of ad- 
dress combinations and not others. Any ad- 
dress form can be used mutually (5 possible 
combinations). When there is a clear differ- _ 
ence of status we may have any nonreciprocal 
pattern that combines a less formal term 
downwards with a more formal term upwards 
(10 possible combinations). With a clear status 
difference we may not have any nonrecipr 
pattern that combines a more formal term 
in the intimacy and status pattern. There is, for in- 
stance, the use of the completename (John Jones or even 
John Montgomery Jones) which is used as an intensifier 
and is particularly favored by mothers “manding 
disobedient children. 
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downwards with a less formal term upwards 
(10 impossible combinations). In the last set 
of plays instances of all the possible combina- 
tions occur and no instances of the impossible 
combinations. The static predictions of the 
model are validated. 

The construction also generates a set of 
dynamic predictions. When address changes 
(if there is not either a reproving or a jocular 
intent) movement must be in the direction 
from left to right in the drawing of Figure 1 
though it need not be to the immediately ad- 
jacent position. Changes of address in the 
plays are surprisingly infrequent and seldom 
involve more than two steps. Only seven differ- 
ent kinds of change occurred but all of these 
are included among those defined as possible 
by the model. One “impossible” change oc- 
curred in the play Born Yesterday. Billie 
Dawn, the junk dealer’s mistress, in speaking 
with the wife of a United States Senator who 
is many years her senior initiates movement 
from Mutual TLN to a more intimate form. 
This violation of the norms is greeted with 
general shock and is a device that helps to 
establish Billie Dawn’s ingenuous vulgarity. 
Insofar as the limited occurrences of change 
in address permit, then, the model is confirmed 
in its dynamic aspect. 


STATUS AND INTIMACY IN 
Sociat RELATIONS 


Two persons of unequal status may be con- 
ceived as two points on a generalized value 
scale of the sort used by Osgood and Tannen- 
baum (1955) for the congruity model which 
they have used to predict attitude change. 
The person of superior status has, of course, 
the greater value on such a scale. Movements 
towards intimacy of address in terms of the 
congruity model are acts of association. Such 
acts, the model predicts, will cause the objects 
of unequal value to move towards one another; 
in the case of address between persons this 
means that the value of the inferior is en- 
hanced by intimate association with a superior 
while the value of the superior is diminished. 
The prediction is good intuitive sense. But 
now the interesting thing is that this model 
seems to call for the abstract pattern of ad- 
dress that does not occur. Since the person of 
lower status has a motive for initiating in- 
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timacy and the superior has none the intimate 
form ought, in nonreciprocal address, to be 
used upwards and the distant form to be used 
downwards. This is not the pattern which we 
have found in all of our materials but, is 
rather, the formal alternative to that pattern 
and we have nowhere found this alternative 
to be the operative norm. 

The abstract design of address is not a di- 
rect expression of the realities of status and 
intimacy but is rather a denial of the realities. 
The pattern might have been designed to 
minimize the pain of invidious status distinc- 
tions. The person of lesser value may be pre- 
sumed to be ever ready for association with 
a person of higher value but the person of 
higher value must be presumed to be less 
ready. If the person of lesser value were to 
initiate associative acts, he would run the risk 
of rebuff; if the person of higher value initiates 
such acts there is no such risk. The superior, 
then, must be the pacesetter in progression to 
intimacy. If there is to be no progression but 
rather an enduring nonreciprocal expression of 
inequality, this expression is not so disagree- 
able as it might be since the superior offers 
intimate address and it is the inferior who 
demurs. The abstract pattern minimizes the 
pain that could be involyed when persons are 
to be related on two dimensions; the horizontal 
of intimacy and the vertical of status. 

The person of higher status is, we believe, 
the pacesetter not in linguistic address alone 
but in all acts that increase intimacy. The 
Sloan Fellows who served as informants for 
us responded to two questions that concern 
nonlinguistic moves towards intimacy. Each 


TABLE 3 


COMPARATIVE WILLINGNESS TO INITIATE ACTS OF 
INTIMACY WITH Four CLASSES OF ASSOCIATE 


More Wil-| More Wil-| nrore Wil- More Wil- 


ith | ling with | More W 
equal and [Equal nnd | ling witt | ling with 
timate | Distant Than |Than with 
[Than with [Than with nate Than Than with 
Distant | Intimate dott mate 
Auaste | Assoctats Superior nate 
Borrow Comb 7 0 A 5 
Hand on 11 3 a i 
Shoulder 


Note.—By sign test Intimate vs. Distant on Borrow Comb, 
= .008; on Hand on Shoulder, p = .029. Subordinate vs, Superior 
on Borrow Comb, p = .033; on Hand on Shoulder, p < .002. 
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question was answered with reference to each 
of the four persons listed by the informant. 
One of these persons, you may remember, was 
a Superior, one a Subordinate, one Equal and 
Intimate, one Equal and Distant. The inform- 
ant indicated ona scale from 1-5 how willing 
he would be to behave in the designated man- 
ner with reference to each person. The Num- 
ber 1 represents maximal willingness (‘Defi- 
nitely would”) and 5 minimal willingness 
(“Definitely would not”). The questions are: 


1. In ordinary circumstances would you be willing 
to ask “X” for the loan of his comb? 

2. Suppose “X” were feeling very unhappy about 
something or other. Would you put your hand on his 
shoulder in a reassuring way? 


The results appear in Table 3 and show a 
familiar pattern. Informants were more will- 
ing to borrow a comb from, and to put a hand 
on the shoulder of, an associate who was 
Equal and Intimate than one Equal and Dis- 
tant and also more willing to initiate these 
acts of intimacy with Subordinates than with 
Superiors. Using a sign test and a one-tailed 
hypothesis the comb question differentiates 
intimate from remote associates with a p = 
008 and Subordinates from Superiors with a 
p = .033; the hand-on-shoulder question 
differentiates intimate from remote associates 
with a p = .029 and Subordinates from Su- 
periors with a p < .002. These nonspeech acts 
follow the pattern of TLN and FN; of T and 
V; and of Hi and Good Morning. Perhaps all 
kinds of associative behavior can be placed 
on a rough scale of intimacy and in the pro- 
gression over this scale of a dyad the superior 
may always be in advance. 

With a good sized inequality of status the 
use of FN by the higher does not, of course, 
justify immediate reciprocation from the 
lower. The lower must wait for the initiation 
of additional acts of intimacy before taking 
a step on his own. It is very likely that the 
normative lag in intimacy increases with the 
degree of status inequality. However, the 
norms are not always perfectly clear; graduate 
students will sometimes be uncertain whether 
the time has come to say FN to a professor, 
employees will wonder whether they know 
their bosses well enough to use the familiar 
form. When someone is in this region of un- 
certainty, we find that he avoids the use of any 
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sort of personal name and makes do with the 
uncommitted gmnibus you. 


SUMMARY 


The semantic rules governing address in 
American English are worked out from a 
varied collection of data that includes usage in 
American plays, actual usage in a Boston 
business firm, reported usage of business 
executives from various cities in the United 
States, and usage recorded in a midwestern 
American town. The most common address 
forms are the first name (FN) and the title 
plus last name (TLN). These function in 
three sorts of dyadic pattern: the Mutual 
TLN, the Mutual FN, and the nonreciprocal 
use of TLN and FN. The semantic distinction 
between the two mutual patterns is on the 
intimacy dimension with Mutual FN being 
the more intimate of the two patterns. In the 
nonreciprocal pattern a distinction is made 
in terms of status with the higher saying FN 
and the lower TLN. The practice of using 
the intimate form to a subordinate and the 
distant form to a superior also governs the 
use of pronouns of address in many languages 
as well as the use of certain conventional 
greetings. It is suggested that this very general 
pattern prevails because in the progression 
towards intimacy of unequals the superior is 
always the pacesetter initiating new moves in 
that direction. The superior is the pacesetter 
because the willingness of the person of lower 
status to enter into association can be taken 
for granted and there is little risk that a 
superior will be rebuffed whereas the risk 
would be great if the inferior were to initiate 
acts of association. Such variant forms of 
address as the title alone, the last name alone, 
and the use of multiple names are fitted into a 
model that purports to describe the temporal 
progression of address from acquaintance to 
friendship. Each new step towards friendship 
is, in this model, initiated by the person of 
higher status. 
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WHAT DO ATTITUDE SCALES MEASURE: 


THE PROBLEM OF SOCIAL DESIRABILITY 


JAMES BENTLEY TAYLOR! 
University of Washington 


UcH recent research on personality 
measurement indicates that test- 
taking behavior is highly influenced 

by group norms of social desirability. A num- 
ber of studies have reported that the more 
socially desirable a statement is, the more 
apt it is to be endorsed. There is also evidence 
to show that people differ in their willingness 
to endorse socially undesirable statements. 
This difference accounts for much of the 
variance of scores on personality tests, even 
when the tests supposedly measure quite 
different aspects of personality (Edwards, 
1957). 

Something similar may take place with atti- 
tude scale items. A person who tends to give a 
socially desirable impression of himself may 
also tend to endorse only those items on an 
attitude scale which he perceives as socially 
desirable. Such a response set might thus in- 
fluence a person’s scores on a number of atti- 
tude scales, producing “spurious” interscale 
correlations. For instance, a person who scores 
high on an “authoritarianism” scale may also 
score high on a scale measuring “anti-Semi- 
tism” because of a general willingness to en- 
dorse socially undesirable attitude items. 

This supposition implies the following four 
hypotheses: 

1. Social desirability norms exist for atti- 
tude-scale items. 

2. People differ in the extent to which they 
show a general response tendency to endorse 
socially undesirable attitude items. 

3. A measure of such a general tendency 
should be highly correlated with scores on a 
variety of attitude scales. 

4, People who tend to endorse a large num- 
ber of socially undesirable personality state- 
ments also tend to endorse a large number of 
socially undesirable attitude statements. 


1 Thanks are due to Walter Gerson, William Larson, 
and George Rothbart for their aid in providing subjects, 
and to Judith Nelson for computational assistance. 


The present paper reports an investigation 
of these four hypotheses. 

But even if these hypotheses are confirmed, 
acquiescence response set, as originally identi- 
fied by Lentz (1938), must be considered as an 
alternate explanation of the findings. A final 
hypothesis is therefore investigated: 

5. Assuming the confirmation of Hypothe- 
ses 1-4, the results cannot be explained in 
terms of acquiescence response set. 

Three studies were conducted, the first 
testing Hypothesis 1, the second Hypotheses 
2 and 3, and the third testing Hypotheses 4 
and 5. These studies are reported separately 
below. 


Srupy I: SOCIAL DESIRABILITY AND 
Irem ENDORSEMENT 


Method 


A pool of 127 attitude statements, from six atti- 
tude scales, was collected for the first study. Included 
were 26 items from the California F Scale (Adorno, 
Frenkel-Brunswik, Levinson, & Sanford, 1950), a 
S-item Anomie scale (Srole, 1956), a 20-item scale of 
Ethnocentrism (Adorno et al., 1950), a scale of Radical- 
ism-Conservatism with 15 items (Eysenck, 1955), and 
50 items drawn from the Parent Attitude Research 
Instrument—hereafter referred to as the PARI 
(Schaefer & Bell, 1958). Ten scales from the PARI 
were included but were not analyzed as separate 
scales, since factor analysis of the PARI has shown 
factor loadings on two dimensions: “Approval of 
maternal control of child,” and “Approval of expression 
of hostility” (Schaefer, 1957). The PARI items were 
scored for these two factor scales. In addition, nine 
“filler” items were included in the pool of attitude 
statements. All 127 items were rated by 53 under- 
graduates at the University of Washington on a nine- 
point scale of social desirability, using instructions 
adapted from those given by Edwards (1957). The 
same 127 items were given to another group of students 
(n = 58), with the request to agree or disagree with 
each item.? 


* An 8-page table giving the instructions for the 
social desirability ratings, and the median 
desirability rating for the 127 attitude scale items, has 
been deposited with the American Documentation 
Institute. Order Document No. 6743 from ADI Auxil- 
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Socrat DESIRABILITY AND ATTITUDES 


The median social desirability rating, the probability 
of item endorsement, and the interquartile range was 
computed for each item, The median Social desirability 
ratings were then correlated with the item endorsement 
values. In addition, each of the six scales in the item 
pool was analyzed separately to find the average 
median social desirability of its items, the mean inter- 
quartile range of the items, and the standard deviation 
of the median social desirability ratings, 


Results 


The average social desirability ratings for 
scale items, the average interquartile ranges 
of the items, and the standard deviations of 
the medians are reported in Table 1. The 
“average rating” shows the extent to which 
the scale items are generally weighted with 
social desirability, the average Q values indi- 
cate the degree of consensus on social desira- 
bility for items in the scale, and the standard 
deviation of medians gives an index of the 
variability between scale items on the social 
desirability continuum. It will be noted that 
all of the scales are loaded with socially unde- 
sirable items, with the exception of the PART 
scale for “Approval of Hostility.” Of the total 
127 scale items, only 13 had a median rating 
on the socially desirable half of the scale (a 
score of 5 or above). When the median social 
desirability ratings were correlated with the 
probability of item endorsement, a correlation 
of .80 was obtained. 

A number of statements in the item pool 
clearly implied that socially desirable results 
would follow from certain acts; agreement with 
such a statement in effect assigned high social 
desirability to it. The incorporation of such 
statements might be expected to result in a 
spuriously high correlation between item en- 
dorsement and social desirability values. 
Therefore, the probability of item endorse- 
ment was correlated with the median social 
desirability values of the items with these 
“social action” statements removed. The 
correlation was .74. 

Of the six scales, five were scored only in 
the socially undesirable direction. That is, if 
a person answered in a socially undesirable 
direction, he was scored as being higher in 
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TABLE 1 
MEDIAN Socrat DESIRABILITY Ratincs, STANDARD 
DEVIATIONS, AND INTERQUARTILE RANGES OF 
Irems FOR SIX ATTITUDE 


Average Mean 
le Median 

i Rating Ratings tem Q 
Authoritarian 3.34 1.42 3.4 
Anomie 2.48 2.32 2.8 
Ethnocentrism 2.28 1.18 2.9 
icalism 2.94 1.41 2.9 
PARI Control 3:23. AZ 353) 
PARI Hostility 4.59 99 34 


whatever the scale measured (Ethnocentrism, 
Anomie, etc.). The sole exception was the 
Eysenck Radicalism Scale, in which 7 of the 
15 items were scored so that a socially desira- 
ble response added to the scale score. It might 
be expected that this “balancing out” of social 
desirability would make a score on the Radi- 
calism scale less influenced by the social de- 
sirability variable. 

From these findings it is apparent that 
definite social desirability norms exist for 
these attitude-scale items; that, with one ex- 
ception, the scales which make up the pool are 
loaded with socially undesirable items; and 
that the high correlation between probability 
of item endorsement and social desirability 
of items which has been demonstrated for 
personality statements also characterizes atti- 
tude-scale items. 


Srupy II: GENERALITY OF SOCIAL 
DESIRABILITY RESPONSE SET 


Method 
In order to test the generality and significance of 
the social desirability set on attitude-scale scores, a 
Social Desirability scale was devised, From the pool of 
items collected in the first study, 59 with a median 
social desirability value of 3.0 or less were selected as 
being least socially desirable. The 59 statements were 
divided at random into two groups, of 29 and 30 items, 
It will be recalled that 58 students had responded to 
the items in the statement pool in terms of agreement 
or disagreement. Each of these students was assigned 
two scores: the number of items endorsed in the first 
group of statements, and the number of items endorsed 
in the second, and a correlation between scores com- 
puted. If the students were indeed differentially in- 
fluenced by a general set to give socially desirable 
then we would expect a significant and high 
correlation. 
If a general set of social desirability influences 
attitude-scale scores, the Attitude Social Desirability 
score should correlate substantially with scores on the 
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other scales. It was not, of course, possible to use the 
entire Attitude Social Desirability scale in each correla- 
tion, since it included items from each of the six attitude 
scales. In comparing scores on Social Desirability with 
scores on Authoritarianism, and again when comparing 
Social Desirability with Ethnocentrism, both the Au- 
thoritarianism and the Ethnocentrism items were 
dropped from the Social Desirability scale. Similarly, 
in correlating each of the two PARI scales with the 
Social Desirability scale, all PARI items were dropped 
from the latter. In the case of correlations with Radical- 
ism and with Anomie, the overlapping items were 
omitted from the Attitude Social Desirability scale. All 
correlations were Pearsonian r’s, with the exception of 
the correlation between the Attitude Social Desira- 
bility and the Anomie scale, which consisted of only 
five items with a very limited response range. In this 
case the subjects were divided into those who endorsed 
at least one item on the Anomie scale, and those who 
endorsed none, and biserial coefficient of correlation 
was computed for these two groups. 


Results 


When the split-half test of reliability was 
made between the two groups of socially 
undesirable items, a correlation of .74 was ob- 
tained which, by the Spearman Brown Proph- 
ecy formula, corresponds to a reliability coeffi- 
cient of .85. 

The correlations between the partial Atti- 
tude Social Desirability scale and the other 
scales are given in Table 2. As can be seen, 
there is a substantial correlation between 
Attitude Social Desirability and the scale 
scores for all except one scale. The single ex- 
ception, Eysenck’s Radicalism Scale, is also 
the only scale in which the items are scored so 
as to balance out social desirability. 

These results indicate that people differ in 
the extent to which they endorse socially un- 
desirable attitude items, that this difference in 
willingness is a general response tendency 
over several types of attitudes or items, and 
that the tendency to endorse socially unde- 
sirable attitude statements is strongly corre- 
lated with scores on a number of attitude 
scales that are diverse in content. 


Srupy III: ATTITUDINAL AND PERSONAL 
Socrat DESIRABILITY 


Method 


In this third study two additional measures of 
social desirability set were developed, besides the 
scale of Attitude Social Desirability just described: 
the “Personal Social Desirability” scale, and the 
“Reversed Social Desirability” scale. As previously 
reported (Taylor, 1959), university students had rated 
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TABLE 2 
CORRELATIONS BETWEEN A PARTIAL ATTITUDE 
DESIRABILITY-SCALE AND SIX ATTITUDE 


Correlation 


65 
578) 
61 


PARI Hostility 
^ Biserial coefficient; other values are product-moment, 


TABLE 3 
INTERCORRELATIONS BETWEEN THREE 
Desrasruity SCALES 


Socar 


Attitude Personal R 
Scale SD SD 


Attitude SD 
Personal SD 
Reversed SD 


^ Split-half estimates of total scale reliability. 


e 58. 
83" 


205 MMPI items on a nine-point scale of social de 
bility. On the basis of these ratings, 59 MMPI 
ments that had been rated as very undesirable 
given a median rating of three or less) were sele 

make up a scale of Personal Social Desirability. 
high score on this scale was held to measure the de 

to which a person was apt to endorse socially un 
able personality statements. In addition, 38 of 
MMPI items that had been given. median ratin 
seven or above on the socially desirable end 
continuum were grouped into another scale, 
Reversed Social Desirability. A high score on this 
measured the degree to which a person was willi 
deny favorable statements about himself. F 

These three scales—Attitude Social Desirab 

Personal Social Desirability, and Reversed 
Desirability—were then given to 50 undergrad 
students at the University of Washington. The 
liability for each scale was estimated, using the sp 
half technique and estimating the total reliability wit 
the Spearman Brown Prophecy formula. The thre 
scales were also intercorrelated. 4 


Results Á 


Table 3 lists the obtained intercorrelatior 
and the estimated reliabilities of the 
scales. As can be seen, a substantial relati 
ship was found between willingness to endi 
socially undesirable attitude statements al 
willingness to endorse socially und 
personality statements. Corrected for at 
tion, the correlation is .70. 

There was a substantial relationship b 
tween willingness to endorse socially 
sirable personality items and willingnes 
deny socially desirable items, contrary | 


SoctaL DESIRABILITY AND ATTITUDES 


what would be expected if the social desira- 
bility scales were simply measuring an ac- 
squiescence response set. This finding is similar 
to that reported by Edwards (1957) in other 
studies of the social desirability set. 

These findings indicate that people who 
tend to endorse a large number of socially un- 
desirable attitude statements also tend to 
endorse a large number of undesirable per- 
sonality statements; and that the findings 
reported in this paper can best be explained in 
terms of a general willingness to describe 
oneself as falling somewhere on a social de- 
sirability continuum, rather than in terms of 
a simple acquiescence response set. 


DISCUSSION 


The results of these studies indicate that, 
for the attitude scales studied, items have 
definite social desirability values, that people 
respond differentially to these values, and 
that a substantial amount of variance in the 
attitude-scale scores can be explained as being 
due to this general response tendency. It is 
also apparent that the influence of the social 
desirability set affects scale items drawn from 
both personality and attitude scales. What do 
these findings mean in terms of scale construc- 
tion and scale interpretation? 

The answer to this question would seem to 
depend in part on the view one has on attitude 
scaling in general. If one defines attitudes 
solely in operational terms as being “what the 
attitude scale measures,” it is apparent that 
the attitude scales studied here all to some 
extent measure a willingness to describe one- 
self in socially undesirable terms. From this 
point of view many attitude scales may be as- 
sumed to measure the general social desirability 
set, as wellas measuring othermorespecific atti- 
tudes. If scales are wanted that measure spe- 
cific attitudinal factors free from conflicting 
response sets, then one would be advised to 
cancel out the influence of social desirability 
(as Eysenck seems to have done with some 
success in his Radicalism scale). In any event, 
a certain caution seems called for in the label- 
ing and interpretation of present] attitude 
scales. Many interscale correlations have prob- 
ably been reported as proving some quite spe- 
cific hypothesis, when in fact such correlations 
have resulted from general social desirability. 
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; It is, of course, possible to conceive of spe- 
cific attitudes as being much more than “what 
the attitude scale measures.” Perhaps the 
more usual view is that attitudes can better 
be thought of as “a readiness to respond in 
certain ways,” a readiness that may show 
itself in verbal behavior, overt action, fantasy, 
tension states, etc. From this point of view 
the social desirability variable can perhaps 
best be conceived as rather overriding attitude, 
a single attitude that interacts with and some- 
times masks other attitudes. For instance, 
“latent anti-Semitism” may be regarded as 
an anti-Semitic attitude that is masked by 
the social desirability variable. If so, the re- 
searcher may be advised to take social desira- 
bility and its effects into account when he 
studies the influence of other specific attitudes 
on behavior. 

The notion of a general. social desirability 
set—influencing attitude scales and personal- 
ity inventories alike—raises a further theoreti- 
cal question: What factors lie behind differ- 
ences in the set? On a priori grounds we might 
wonder whether it is influenced by such varia- 
bles as “conformity,” “psychopathology,” 
“low status,” “poor morale,” or “low self- 
esteem.” The reader can probably add to 
this list ad infinitum. Although the concept 
of social desirability serves to clarify certain 
problems in attitude and personality scales, 
it also adds a further measure of complexity 
to the researcher’s task. 


SUMMARY 


This study investigates the influence of. 
social desirability on attitude items and on 
attitude scale scores. Items were drawn from 
six attitude scales: the California Authori- 
tarian Scale, the California Ethnocentrism 
Scale, the Srole Anomie Scale, the Eysenck 
Radicalism Scale, the PARI Approval of 
Hostility scale, and the PARI Approval of 
Control scale. It was found that definite social 
desirability norms exist for attitude-scale 
items; the median social desirability value of 
an item is highly correlated with the probabil- 
ity that the item will be endorsed; people differ 
in the extent to which they will endorse 
socially undesirable attitude statements, and 
this difference in willingness is a general re- 
sponse tendency which occurs irrespective of 
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the obvious content of the scale; a substantial 
correlation exists between this general response 
tendency and scores on five of the six scales 
investigated; and the tendency to describe 
oneself as undesirable on attitude-scale items 
is substantially correlated with the tendency 
to describe oneself as undesirable on personal- 
ity-scale items. Data are presented to show 
that these results are not explainable by refer- 
ence to a general acquiescence response set. 
It is possible that many interscale correlations 
have been reported as proving a specific hy- 
pothesis, when in fact such correlations have 
resulted from the more general social desira- 
bility variable. These findings sound a cau- 
tionary note in the development and interpre- 
tation of attitude scales, while at the same 
time raising certain theoretical problems. 
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& Solomon, 1959) it was found that the 
way a person reacts to a positive or nega- 
tive evaluation of his performance is influ- 
enced by his own evaluation of the perform- 
ance. The individual (“ego”) tends to react 
more favorably to the other person (“alter”) 
when alter’s evaluation is consistent, rather 
than inconsistent, with ego’s self-evaluation. 
In a subsequent study (Deutsch, 1960) we 
asked observers to predict ego’s reaction to 
alter’s evaluation. We found that observers 
made systematic errors which were termed the 
“pathetic fallacy.” Observers’ evaluations, as 
compared to those of the actual participants, 
were determined not by the consistency be- 
tween alter’s evaluation of ego and ego’s self- 
evaluation, but rather by the degree to which 
alter’s behavior was consistent with perceived 
social norms concerned with the “maintenance 
of face.” That is, the more alter went out of 
his way to help ego maintain face, the more 
favorably was alter evaluated. Similarly, the 
more alter went out of his way to destroy ego’s 
face, the less favorably was alter evaluated. 
The foregoing studies are part of a program 
of research concerned with understanding the 
factors that influence the interpretation of 
praise and criticism. We define “praise” to 
mean any positive evaluation of ego by alter 
and “criticism” to mean any negative evalua- 
tion of ego by alter. We shall not attempt 
here to develop a system for the description 
and classification of evaluations (see Kluck- 
hohn, 1951, for an attempt at such a system) 
but it is well to recognize that evaluations 
differ not only in sign (i.e., positive or nega- 
tive) but also in such dimensions as: intensity, 
scope (e.g., are they applied to all of ego or to 
some limited aspect of ego such as a perform- 
ance, a membership, a belief, a possession?), 
explicitness, type of value appealed to in the 


I a previously published study (Deutsch 


1I wish to acknowledge the assistance of Robert M. 
Krauss in the collection and analysis of the data. I am 
also indebted to M. B. Wilk for various suggestions 
concerning statistical procedures. 


evaluation, mode of expression of the evalua- 
tion, to whom the evaluation is expressed, etc. 

In the research reported in this paper, we 
have employed simple evaluative acts: “You 
did that extremely well,” “You did that well,” 
“You did that poorly,” and “You did that 
extremely poorly.” We have placed the differ- 
ent evaluative acts into various institutional 
settings: family, school, military, and work 
contexts. Within each context we have se- 
lected the three role relations defined by the 
relative power of alter as compared to ego: 
alter is superordinate to ego, alter is subordi- 
nate to ego, and alter and ego are peers. We 
are interested in determining how the attri- 
butes of an evaluative act in conjunction with 
the institutional and role context of the act 
influence its perception or interpretation? 
We focus, in this paper, upon the interpreta- 
tions of an observer rather than upon the in- 
terpretations of the participants in the inter- 
action, We do mot make the assumption that 
the interpretations of the observers are the 
same as those of the participants; our prior 
research (Deutsch, 1960) has indicated to 
the contrary. The interpretations of the actual 
participants are bound to be more complexly 
determined than those of detached observers. 
The participants’ interpretations reflect not 
only the impersonal characteristics of the 
social situation, but also their self-attitudes in 
the situation and the particularistic aspects of 
the existing relationship with the other. 

The basic assumption underlying the present 
research is that the framework for the per- 
ception of any social act, by someone not 
self-involved in the situation, is determined 
by the perceived social norms regulating the 
social situation in which the act occurs. More 
specifically, we assume that there is a tendency 
to assimilate or to contrast the act to the social 
norms regulating the social situation in which 
the act occurs: assimilation occurring when the 


2 Throughout the paper we shall use the words 
“perception,” “cognition,” and “interpretation” as 
synonyms. 
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differences between the act and its social context 
are small; contrast occurring when the differ- 
ences are large. Thus, we assume that the in- 
terpreter attempts to arrive at an interpreta- 
tion of the act and of its consequences that is 
stable and consistent with his views of the 
social norms within the institutional and role 
setting in which the act has occurred. Further, 
we can expect a tendency to perceive the vari- 
ous components of a social act (e.g., its con- 
tent, its tone, its style) homogeneously as 
assimilated or as contrasted to the social con- 
text in which it occurs (Heider, 1958). 

Moreover, we assume that the stable and 
coherent perception of an act of social deviance 
requires a more complex, articulated organiza- 
tion of the social elements being perceived 
(i.e., the act, ego, alter, the role relations, the 
institutional context) than does the stable 
and coherent perception of an act of social 
conformity. In everyday terms, one does not 
have to invoke special principles to understand 
why a person conforms to the social norms 
but one does to understand why a person 
violates them.’ Hence, it is easier to arrive at 
a stable, coherent perception of conforming 
behavior. 

Thus, one can expect a bias toward perceiv- 
ing a social act as conforming or appropriate 
to the social norms of the context in which it 
occurs (i.e., assimilating the act to its context) 
rather than toward perceiving it as a violation 
of the social norms (i.e., contrasting the act to 
its context). Presumably, acts conforming to 
the social norms are more positively evaluated 
than acts violating the social norms by a 
participant in the social situation within which 
the act occurs (i.e., by someone who may be 
expected to accept the social norms), 

We assume that social norms exist within 
American society which define the degree of 
appropriateness or inappropriateness of given 
acts of praise or criticism and we also assume 
that the social norms are somewhat different 


3 The assumption that conformity requires no special 
explanation while deviance does is similar to a basic 
assumption involved in Parson’s (1951) theoretical 
system: “... The continuance of a stabilized motiva- 
tional process in a stabilized relationship to the relevant 
objects is to be treated as not problematical . . . . Hence 
for the social system as well as the personality we will 
not be concerned with the problem of the maintenance 
of given states of the social system except when there are 
known tendencies to alter those states” (p. 204), 
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in different institutional and role contexts, 
Further, for gur experiment, we assume that 
our subjects knew the norms pertaining to 
the different institutional and role contexts 
and that their interpretation of an act of praise 
or criticism was determined by the perception 
of the act in relation to the relevant social 
norms. Our initial assumptions concerning the 
relations between the social norms and the 
three independent variables of our study (i.e. 
“evaluative statement,” “role,” and “institu- 
tion”) were: the norms of the American culture 
are biased toward viewing praise as more ap- 
propriate than criticism; the norms with re- 
gard to authority are such as to define evalua- 
tions from a superordinate as more appropriate 
than evaluations from a subordinate; and the 
norms governing interactions within the 
family are the most permissive and benevolent, 
while the norms within the military are the 
least permissive and benevolent of the norms 
characterizing the four institutional settings 
considered in this study. Our more specific 
assumptions about the interactions among 
our independent variables will be discussed in 
connection with the results. 


METHOD 


The study was conducted with students at several 
local colleges during a regular meeting of their psy- 
chology classes. By a random procedure, the students 
were assigned to the 48 experimental variations that 
resulted from our 4 X 4 X 3 factorial experimental 
design. Sixteen students were assigned to each experi- 
mental variation; each student was exposed to five 
experimental variations. 

The method of the study was to present the students 
(i.e, the observers) with simple evaluative statements 
made by alter to ego in which the role and institutional 
contexts were specified by appropriate labeling of 
alter and ego. The method is one that we arrived at 
after rejecting such methods as films, voice recordings, 
and puppet enactments for depicting the interactions. 
It became obvious that a simulation procedure runs 
into the problem that in any lifelike interaction the 
participants use a variety of techniques to define the 
meaning they are intending to convey by a given act. 
Thus, the meaning of a given act may be determined 
by such factors as the time and place in which it occurs, 
the tone of voice, gestures, manner, and appearance 
with which it is accompanied, etc. In real life, one 
attempts to manipulate these expressive components 0 
an act to produce in a given social context, with its 
given social norms, the effect one intends. Thus, & 
private who wants to encourage a captain through 
praise will express his praise in a tone of voice, Wi 
gestures and a manner, etc. that are different from 
tone of voice, etc. of a captain who wants to encouragé 
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a private. Hence, the experimental simulation of praise 
would have to be rather different for the different 
roles and social contexts if one is to avbid introducing a 
false note or inconsistency into the act itself. The 
radical simplification of our procedure* largely elimi- 
nates the expressive components of the act and, hence, 
allows one to study the effects of social context upon 
acts that are otherwise identical (and which, despite 
their identity, do not have false or inconsistent notes in 
any of the social contexts). 

‘By varying the evaluative statement and the label- 
ing, the following experimental variables were created: 

Evaluation scale. Four evaluative statements were 
used to create four levels of this factor: “You did that 
extremely well,” “You did that well,” “You did that 
poorly,” “You did that extremely poorly.” 

Institutional context. Four institutional contexts 
were employed: family, school, work, and military. 
‘The institutional context was created by the labeling 
of ego and alter. For the family, the labels father, son, 
and brother were employed; for the school, teacher 
and student; for work, employer and employee; for 
the military, captain and private. 

Role context. Three role contexts were created: 
alter as superordinate to ego (i.e., father to son, teacher 
to student, employer to employee, captain to private); 
alter and ego as peers (i.e., brother to brother, student 
to student, employee to employee, private to private); 
and alter as subordinate to ego (i.e., son to father, 
student to teacher, employee to employer, private to 
captain). 

Examples of the statements are: “You did that 
extremely well,’ Mr. Pitman, a student, told his 
teacher, Mr. Anderson. “You did that poorly,” Bob 
Simon said to his father. “You did that extremely 
poorly,” Private Ford told Private Walker. 

Data were obtained by having the observers answer 
four questions: What did ego G.e., the recipient of the 
act) think alter’s (i.e., the person making the evalua- 
tion) purpose was? What did ego say to alter? What 
did ego want to say to alter? and What did ego feel? 
The questions were, of course, phrased not in terms of 
ego and alter but in terms of the names of the people in 
the depicted interaction. The observers responded to 
the questions by filling in 13-point rating scales, which 
ranged in value from —6 through 0 to +6. The scales 
for Questions 1 and 4 were identical: +6 was labeled 
“extremely good,” 0 was “neither good nor bad,” 
—6 was “extremely bad.” The instructions, of course, 
made it clear to the observers that for Question la 
rating of +6, for example, meant that they (ie., the 
observers) held the view that ego thought alter’s 
purpose in making his statement was “extremely 7 
while a rating of +6 in response to Question 4 meant 
they thought ego felt “extremely good” after hearing 
alter’s statement. The scale for Questions 2 and 3 
were identical, +6 was labeled “extremely friendly,” 
Maker L 


4 Tn principle, the methodology of our study is no 
different from the methodology employed in the labora- 
tory investigation of physical perception. ‘There, too, 
radical simplification rather than simulation is the 
method of choice in studying basic processes of per- 
ception. 
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9 was “neither friendly nor unfriendly,’ —6 was 

extremely unfriendly.” The observers understood that 
a rating, for example, of +-6 on Question 2 meant they 
thought ego said something that was “extremely 
friendly” in reply to alter and that a rating of +6 on 
Question 3 indicated they thought ego wanted to say 
something “extremely friendly”; a difference in ratings 
on the two questions represented a difference between 
what they thought ego wanted to say and what they 
thought he actually said. In addition to the rating scale 
data, free responses were obtained to the four questions 
from 337 observers. 


RESULTS 


The main data for this experiment were ob- 
tained from the observers’ ratings in response 
to the question: What did ego think alter’s 
purpose was? Since the ratings for all four 
questions were highly intercorrelated (the 
average r was .71), the other three ratings are 
not discussed as such, They are, however, con- 
sidered in relation to measures derived from 
them that are largely free of “halo effects” and 
“response bias.” These measures, which are 
discussed more fully below, are “the perceived 
immediate emotional impact” of alter’s evalu- 
ation upon ego and the “perceived behavioral 
restraint” of ego in responding to alter. 


Observer’s Evaluation of Alter’s Purpose 


General effects. Tables 1 and 2 provide a 
summary of the data concerning the observer’s 
evaluation of alter’s purpose. It is apparent 
that the same act (e.g, “you did that ex- 
tremely well”) is evaluated in strikingly differ- 
ent ways as a function of the context of the 
act (e.g, compare the evaluation when a 
father is saying it to his son with a private 
saying it to a captain). The effect of context 
can be so strong that intense criticism of a 
son by a father is more favorably evaluated 
than intense praise of a captain by a private. 

If one accepts our assumption that a posi- 
tive evaluation of alter’s purpose indicates that 
the observer has perceived that alter is con- 
forming to the social norms governing his rela- 
tionship with ego and that a negative evalua- 
tion indicates the opposite, it is evident that, 
over all, there is a strong bias in favor of per- 
ceiving alter as conforming to rather than 
violating the norms. We have suggested that 
the bias results from the greater ease of ar- 
riving at a stable, coherent perception of con- 


TABLE 1 
CELL Means: Eco’s EVALUATION oF ALTER’S Purpose (as perceived by observers) 


Institution 
Scale Status 
Family School Work Military 
Extremely well high-low 4,948 4.88 4.38 2.81 
ji low-low 4.31 3.25 4.00 2.50 
low-high 3.88 2.75 1.69 81 
Well high-low 3.38 3.69 3.50 3.81 
low-low 2.50 2.19 3.75 2.38 
low-high 2.62 2.06 1.69 -38 
Poorly i .06 1.12 1.00 31 
peated 1.00 94 1.31 44 
low-high :50 =.56 :56 —3.75 
Extremel: rly high-low 1.88 -.12 00 1.19 
of low-low 31 —.62 —2.25 06 
low-high —1.00 2.31 —.25 —2.12 
X Institution (2.03) (1.44) (1.40) (.73) 
Raaen, 2.30 
low-low K a) 
low-high (.43 
® Each cell entry is a mean based on 16 responses. 
TABLE 2 hare E 
ANALYSIS OF VARIANCE: Eco’s EVALUATION OF ALTER’S than of eats (the bias 1s inten 
(as perceived by observers) more intense praise or criticism) even 
the effects of role and institutional ¢ 
asii 4 aS P are eliminated statistically. We use the | 
Between institution 3 53.95 6.49648 bias here to indicate that we can see no 
peiron IAN : Sa ASERT trinsic reason why praise, per se, sho 
Institution X Scale 9 13.65 1.64* more favorably evaluated than criticist 
Institution X Sta- 6, 22.33” 2.68%* se. The finding, although intuitively o 
SEN Gata > A requires an explanation. Our explanati 
Institution X Scale 18 12199  1.56* sumes that the purpose of any social 
X Status always evaluated in relation to some 
cine 720 8.32 tive context; the context that ren 
pee, Uta the observers, after the elimination 
Reg specific effects of “role” and “institt 


forming as contrasted with deviating be- 
havior.® 

Effects of the evaluative statement. Tables 1 
and 2 also give evidence of a bias that results 
in more favorable interpretations of praise 


ë In another study, with the same social context 
variables, we presented an interaction in which alter’s 
evaluative act clearly violated all of the relevant social 
norms: he was described as ridiculing ego. Ridicule is 
evaluated quite negatively in each of the institutional 
and role contexts. The contrast between the act of 
ridicule and the social norms is so strong as to “wash 
out” the differential effects of role and institutional 
contexts. 
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was the broad normative context of / 
can society. Elsewhere (Deutsch, 196 
have developed the rationale for a 
that the norms within the American § 
are biased in favor of praise. One can im 
other cultures that might be strongly 
in terms of “appropriateness of eval 
or even some “masochistic” culi 
would be biased toward favorable eva 
of criticism. 

Effects of role. We have assumed that 
social norms concerning authority are 
that they would lead the observers to 
alter’s evaluative act more favorabl 
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higher alter’s status is in relation to ego. The 
results clearly support this assumption. For 
each of the four evaluative acts, the higher 
alter’s relative status the more favorably do 
the observers interpret the purpose of the 
act. There is evidence (reflected in the “‘scale 
and status” interaction) that the effect of rela- 
tive status upon the observers’ interpretations 
is enhanced when the evaluation is intense 
rather than mild. That is, subordinates are 
perceived to have even relatively less right, 
as compared with superordinates, to make 
intense rather than mild evaluations. Status 
differences have most effect upon interpreta- 
tions of acts within the military, next most in 
the school, and least in the family, (cf. the 
“institution and status” interaction). The 
latter results are consistent with the assump- 
tion that the relative status of alter to ego is a 
more important determinant of the observers’ 
evaluations of alter’s act, the more rigid the 
authority structure® within which the act 
occurs. 

Effects of institution. We have assumed that 
the observers’ interpretations of alter’s act 
would be more favorable if alter’s act takes 
place within an institutional context of rela- 
tions between alter and ego that is perceived 
to be cooperative or mutually facilitative 
rather than individualistic or competitive. 
From this assumption, one would expect that 
an evaluative act would be most favorably 
interpreted in the family and least favorably 
in the military context. The results confirm 
this expectation. 

Within the school context, evaluative acts 
between superordinate and subordinate tend 
to be interpreted more favorably than in the 
work context unless the evaluation is one of 
intense criticism, in which case the reverse is 
true. On the other hand, evaluations between 
peers are more favorably interpreted in the 
work than in the school context unless the 
evaluation is critical, in which case the re- 
verse is true. These reversals indicate a more 

6 “Rigidity of authority” is difficult to define in 
terms of a single criterion. However, such indicators as 
the following would reflect this concept: the right of a 
Superordinate to expect unquestioning obedience from 
a subordinate; the severity of the sanctions that a 
superordinate can legitimately invoke to punish @ 
disobedient subordinate; the freedom of the subordinate 


to appeal to others to countermand an order from his 
superordinate or to support his violation of an order. 
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intense contrast effect for negative evaluation 
of peers in the work as compared to the other 
settings and a more intense contrast effect for 
severe criticism of a person of different status 
in the school as compared to the other con- 
texts. 

As a possible explanation of these results we 
offer the hypothesis that a contrast effect is 
likely to occur more intensely or with weaker 
differences, the more it is perceived possible 
for relations to occur between alter and ego 
opposite to those that usually obtain. Thus, 
observers might assume that within the family 
context there is little possibility for a relation- 
ship that is not mutually facilitative to occur. 
Here it would take a large violation of the 
social norms (see Footnote 5) to produce a 
contrast effect. On the other hand, between 
teacher and student or between co-workers, 
observers might assume that while a mutually 
facilitative relationship is most common, a 
mutually antagonistic relationship is not un- 
common. Here, a violation of the norms would 
more readily lead to a contrast effect. This hy- 
pothesis explains the findings described in the 
previous paragraph if one assumes that ob- 
servers are more likely to perceive relations 
between different pairs of employers and 
employees as more homogeneous than among 
different pairs of co-workers, and co-student 
relationships as more homogeneous than 
teacher-student relationships. We have no 
evidence that is relevant to this assumption. 


Perceived Immediate Emotional Impact upon 
Ego of Alter’s Evaluation 


From the difference between the observers’ 
ratings of “what ego felt” and “what ego 
thought alter’s purpose was” a measure of the 
perceived immediate emotional impact of 
alter’s evaluation upon ego was obtained. It 
seems reasonable to assume that the total emo- 
tional impact of an act (i.e., “what ego felt”) 
results from its perceived purpose and its im- 
mediate emotional impact. The immediate 
emotional impact of an evaluation reflects, we 
believe, its impact upon the individual’s self- 
esteem (or upon his security system). That is, 
an evaluative act has emotional impact for a 

n over and above the significance attrib- 
utable to its perceived purpose if the evalua- 
tion is perceived to have relevance to the 
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_ TABLE 3 
CELL Means: ImmepraTre EMOTIONAL Impact OF ALTER’s EVALUATION UPON Eco 
(as perceived by observers)* © 
Institution 
Scale Status ~ X Scale 
Family School Work Military 
Extremely well high-low -38 -56 38 -06 
low-low -88 12 — .38 1.25 
low-high 31 -56 -50 1.81 Cag 
Well high-low 44 -38 1.06 1.06 i 
low-low 38 1.19 .88 1.00 ' 
low-high -38 -88 -06 1.69 CR) 
Poor! igh-low 2.44 3.00 2.62 1.88 i 
7 aca 1.88 2.31 81 -06 
low-high 2.25 1.69 1.31 —.81 iy 
1.62 
Extremely poorly high-low 3.69 2.19 2.75 3.38 
low-low 2.00 1.12 .12 -50 4 
low-high 1.31 .25 1.19 1.19 aia 7 
X Institution (1.36) (1.09) (.92) (1.09) i 
Tiet (1.64) 
low . 
low-low {3 À 
low-high -87) 


a Cell entries for this table were obtained in the following manner: 


TABLE 4 
` ANALYSIS OF VARIANCE: IMMEDIATE EMOTIONAL 
IMPACT OF Atrer’s EVALUATION UPON Eco 
(as perceived by observers) 


Source af MS R 
Between institution 3 6.28 -90 
Between scale 3 68.67 = A: Siad 
Between status Z S292 75088" 
Institution X Scale Os" 20,25; 2.0088 
Institution X Status 6 5.41 -78 
Scale X Status 6 625.26. 3.62"** 
Institution X Scale 18 5.34 77 

X Status 
Within cells 720 6.97 
asep < 01. 


values and social supports upon which the 
person bases his self-esteem. 

The data (summarized in Tables 3 and 4) 
indicate that criticism is perceived to have 
more immediate impact than praise. Criticism 
from a superordinate is perceived to have more 
immediate impact than criticism from a peer 
or a subordinate. However, there is a sugges- 
tion in the data that for praise the reverse is 
true in the military. These findings seem con- 
sistent with our view that immediate emo- 
tional impact reflects self-esteem involvement. 
Here, we assume as we have elsewhere 


for positive evaluative statements (“extremely well” and “well”} 
the difference was computed as “‘ego felt” minus “‘alter’s purpose.” For negative evaluative statements (‘poorly”’ and “extremely poorly”) 
the difference was computed as “alter’s purpose’ minus ‘‘ego felt?” 


(Deutsch, 1960) that in a success-oriented 
culture such as ours, threat to self-esteem 
in failure or criticism) is perceived to be mor 
self-involving than is reassurance (as in suc- 
cess or praise), It is also reasonable to assume 
that the threat from criticism is perceived t€ 
be greatest when the critic is a superordinate 
Initially, we had also assumed that the 
personal-impersonal characteristics of an 1 
stitution contribute to the perceived emo 
tional impact of an evaluative act and, hence, 
that the immediate emotional impact wot 
be perceived to be greatest in the family. 
However, this assumption does not providi 
an adequate explanation for praise havin i 
more marked impact in the military than im 
the family context. Alternatively, one may 
hypothesize that an evaluative act has im= 
mediate emotional impact, not only as a func 
tion of its relevance to the individual's $ 
curity system, but also as a function of it 
surprise or its unexpectedness. Thus, praise i 
the military (as compared with the family) 
especially from a subordinate, and criti ‘ 
in the family (as compared with the military), 
are both likely to be considered relatively W 
expected. d 
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The perceived behavioral restraint of ego in 
responding to alter. The data pgesented so far 
have been concerned with the observer’s ex- 
pectation concerning the effect upon ego of 
alter’s evaluation of ego. Our interest here was 
in the observer’s view of ego as the recipient 
of am evaluation. It is also relevant to consider, 
from the observer’s viewpoint, ego as a re- 
sponder to alter’s evaluation. In general, it is 
reasonable to posit that the observers would 
assume a large measure of complementarity in 
the social norms that determine ego’s recep- 
tion or interpretation of an evaluation and 
the norms that determine ego’s action toward 
alter in response to alter’s evaluation. Our re- 
search design, however, does not permit any 
unequivocal test of this assumption. A correla- 
tion of .76 (p < .01) is obtained between the 
observers ratings of alter’s purpose and of 
ego’s overt response to alter; the correlation 
in addition to reflecting the assumed comple- 
mentarity undoubtedly is also influenced by 
“halo effects.” 

While complementarity is to be expected, 
there is also reason for the observers to assume 
that ego’s overt behavior toward alter may 
not necessarily be the same as his covert be- 
havior; that social norms parallel to the ones 
affecting his interpretation of alter’s behavior 
would place restraints upon ego’s overt re- 
sponse. These perceived restraints can be 
measured by comparing overt and covert re- 
sponses. Negative restraint may be defined as 
existing whenever the overt response is per- 
ceived to be more favorable than the covert 
response toward alter; positive restraint as 
existing whenever the covert response is per- 
ceived to be more favorable than the overt 
response. Our initial assumptions focused upon 
the occurrence of negative restraint. We an- 
ticipated that the observers would expect 
more negative restraint after criticism when 
ego was a subordinate rather than a super- 
ordinate of alter and when the institutional 
setting was more rigidly authoritarian and 
less benevolent (e.g., the military as compared 
with the family). We also expected more nega- 
tive restraint after criticism than praise. 

The data (see Tables 5 and 6) show there 
to be more negative restraint in response to 
criticism than to praise; in certain social 
settings ego is expected to show positive re- 
straint in relation to praise. Thus, in response 
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to praise from peers or from members of his 
family, ego is expected to make an overt re- 
sponse that is less positive than his covert 
response. This suggests that in certain situa- 
tions a restraint is perceived against imgratiat- 
ing behavior, that may be sufficiently strong 
to curb the overt expression of gratitude, One 
may speculate that such a curb arises as a 
means of protecting or insuring the sincerity 
of praise when, frequently, in other social con- 
texts it is an attempt at ingratiation. 

The expectation of greatest perceived nega- 
tive restraint after criticism from a superordi- 
nate is supported by the data. However, the 
institutional effects were not as anticipated. 
In the military, where one would expect the 
largest amount of perceived negative restraint 
after criticism by a superordinate, one finds the 
smallest. One possible explanation for this sur- 
prising result is that the negative restraint in 
responding to criticism from a superordinate 
in the military are perceived to be so great 
that they are perceived to inhibit covert as 
well as overt action toward alter. This inter- 
pretation is given some plausibility by other 
data (not presented here) which indicate that 
a measure of perceived internal restraint (the 
difference between the observers’ ratings of 
“what ego felt” and “what ego wanted to 
say”) is highest in the military after criticism 
from a superordinate. 


Analysis of the Free Response Data 


The data collected from the observers when 
they gave free responses to the four questions 
(concerning alter’s purpose, ego’s overt and 
covert responses to alter, and ego’s feelings) 
are presented as supplementary, secondary 
data because of the methodological difficulties 
we experienced with them. About 25% of the 
observers’ responses were uncodable and it 
was impossible to code many of the codable 
responses without knowledge of the experi- 
mental treatments since many of the responses 
specifically identified the institutional and, or, 
role contexts. Thus, we are unable to rule out 
the possibility that the analysis of these data 
is biased by knowledge of the experimental 
treatments. Nevertheless, since they give re- 
sults completely consistent with the rating 
scale data, it seems worthwhile to take ad- 
vantage of the insights they offer. : 

The perception of alter’s purpose. Praise, as 
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TABLE 5 
CELL MEANS: DIFFERENCE BETWEEN WHat Eco Sam Anp WHat He WANTED TO Say TO ALTER 
(as perceived by observers)® * 
oo 


Scale Status 


Institution 
X scale 


Family School Work Military 
Extremely well high-low +25 -06 25 -19 
low-low a t 8 1825 760°" fe Hg 
low-high 25 56 ODay ers 112 ca 
Well high-low —.31 .00 06 —.12 À 
low-low — 81 15449 162!) | = 106 
low-high 44 12 1.06 156 cB 
Poorl; high-low .88 9 94 .38 ; 
4 ei 06 162 ‘94 94 
low-high 150 1.69 19 —.06 ts 
Extremely poorl high-low 462, (hf S Saag: S.i 25 7 
E low-low -1.06 ‘69 ‘69 1.06 
low-high — 44-62 15 119 cist 
X Institution (=.17) (52), (51) (25) i 
pean 43) 
low ` 
low-low {i} 
low-high 30 


^ The cell entries in this table were computed by subtracting the observers’ responses on “ego said to self" from “ego said to alter.” 


TABLE 6 
ANALYSIS OF VARIANCE: DIFFERENCE BETWEEN WHAT 
Eco Sam ann WHat He WANTED TO Say 
TO ALTER (as perceived by observers) 


Source df MS F 
Between institution 3 19.738 3.34 
Between scale 3 20.39 3.4" 
Between status 2 7.20 1.22 
Institution X Scale 9 3.21 0.54 
Institution X Status 6 6.23 1.05 
Scale X Status 6 9.74 1.65 
Institution X Scale X 18 4.83 0.82 

Status 
Within cells 720 5.92 
** p <05. 


contrasted with criticism, is much more fre- 
quently perceived to be motivated by “ought” 
forces. That is, praise is seen as being moti- 
vated by a desire to be “truthful” or “objec- 
tive” more than twice as frequently as criti- 
cism. The evaluations made by superordinates 
are more likely to be perceived as motivated 
by ought forces than are the evaluations by 
peers or by subordinates. Praise by a peer is 
most frequently perceived as motivated by a 
desire to make ego “feel good” or “happy,” 
praise by a superordinate is most frequently 
seen as having the intention of “helping” ego 
(e.g., “letting him know that he is right,” 


“encouraging him,” “teaching him”), while 
praise from a subordinate is most commonly 
interpreted as being motivated by the sub- 
ordinate’s own needs (e.g., “to ingratiate him- 
self with his superior,” “to get the boss to do 
something,” “to be a big shot”). The differ- 
ences in interpretation of the purpose of praise 
for the different status relations is most accen- 
tuated for the military setting and virtually 
disappears in the family setting. In the latter 
setting, praise by a brother or by a son are 
both seen, most often, as a desire to make 
ego “feel good.” In the more impersonal insti- 
tutional settings, praise is more likely to be 
seen as intending to be “helpful” than as m- 
tending to make ego “feel good”; the opposite 
is true for the family setting. ` 
Criticism by superordinates, especially 1 
the family setting, is seen typically as intend- 
ing to be helpful. In all but the work setting, 
criticism by peers is also perceived as intend- 
ing to be helpful (although less frequently than 
in the case of criticism by superordinates) ; in 
the work setting, it is perceived as malicious 
in intent (i.e., making ego “feel bad”). Crit 
cism by subordinates is most often viewed a8 
negative in purpose in the military and school 
contexts but not in the family or work con- 
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texts. The more intense the criticism, the more 
likely that the subordinate’s purpose is de- 
scribed as malicious rather than as an attempt 
to be a “big shot.” 

Ego’s affective reaction to alter. In all of the 
institutional settings, praise is perceived as 
giving rise to positive affect which is primarily 
self-directed (e.g., “feeling of self-esteem,” 
“feeling of achievement”) rather than alter- 
directed (e.g., “feeling of solidarity,” “feeling 
of respect”). Criticism in the school and 
family settings is also seen as giving rise to 
self-directed feelings (e.g., “feeling of insecu- 
rity,” “feeling of failure”); but, in the mili- 
tary and work settings, criticism gives rise 
to alter-directed affect (e.g., “contempt or 
pity,” “anger or indignation”). Praise or criti- 
cism, but especially criticism, from a subordi- 
nate is more likely to be perceived as evoking 
alter-directed affect than evaluations from a 
peer or a superordinate. 

The affective reaction of contempt, pity, 
anger, or indignation to criticism is expected 
most often when alter is lower in status than 
ego and least frequently when alter is of higher 
status. “Resentment” is expected in reaction 
to criticism only when alter is of higher status 
than ego, and only in the military and work 
contexts, The feeling of determination to im- 
prove one’s performance is expected most 
often after criticism by a superordinate and 
least often after criticism by a subordinate. 


Discussion 


The data of this experiment clearly demon- 
strate that the meaning of praise and criticism 
is very much influenced by the social contexts 
in which they occur. The data are consistent 
with everyday observations and with common 
sense intuitions even though they may run 
counter to the theoretical notions behind such 
concepts as “generalized reinforcers” (Krasner, 
1958; Skinner, 1957). On theoretical examina- 
tion (Deutsch & Solomon, 1959), the obvious 
fact that “praise” is, generally, more positively 
responded to than “criticism” no longer re- 
mains obvious; it becomes a fact requiring 
explanation. Our explanation has been that a 
social act is always interpreted in relation to 
some normative context(s). For the disinter- 
ested observer of a social act, we have assumed 
that the relevant normative framework within 
which an act is perceived includes the per- 


ceived, relevant norms from the culture, the 
institutional setting, and the role relationship 
within which the act occurs. For the actual 
participant in a social act, we assume that 
norms which relate to his self-conception and 
to the particularistic relations with the other 
also influence the framework within which an 
act is perceived. 

Praise tends to be more favorably evaluated 
than criticism, in terms of this explanation, 
because the norms of the success-oriented 
American culture are directed toward helping 
people maintain the “face” of success (Kluck- 
hohn, 1949; Williams, 1951) and because acts 
that are perceived as conforming to the ac- 
cepted norms are evaluated more favorably 
than acts that violate them. However, as this 
and a prior experiment (Deutsch & Solomon, 
1959) have demonstrated, when praise is 
counter to and criticism is consistent with 
specific institutional, role, or self-norms the 
reverse evaluations can be obtained. 

Despite the seeming superficiality and arti- 
ficiality of the methodology we have employed, 
we believe that our results indicate that. the 
technique is a very useful method for the in- 
vestigation of social norms. The technique is, 
of course, similar to certain projective methods 
(e.g., the sentence completion method) used to 
study individual personality. Here, however, 
the emphasis is on social situational variables. 
By systematically varying the social context 
of standard social acts and obtaining both 
quantitative and qualitative responses, one 
should be able to characterize in comparable 
terms the perceived effects of age, sex, occupa- 
tional and other characteristics upon the 
meaning of a given social interaction. 


SuMMARY AND CONCLUSIONS 


In this paper, we have investigated the 
effects of the institutional and role context 
upon the interpretation by a disinterested ob- 
server of written presentations of simple acts 
of praise and criticism made by alter to ego. 
We placed the different evaluative acts (e.g., 
“You did that extremely well,” “You did that 
extremely poorly”) in different institutional 
contexts (family, school, work, and military), 
and within each context we selected three 
role relations defined by the relative power of 
alter as compared with ego: alter is superordi- 
nate to ego, alter is subordinate to ego, and 
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alter and ego are peers. From the observers, 
we collected ratings or free responses to four 
questions: What did ego (i.e., the recipient of 
the act) think alter’s (i.e., the person making 
the evaluation) purpose was? What did ego 
say to alter? What did ego want to say to 
alter? and What did ego feel? 

The results may be summarized as follows 
(keeping in mind that the data refer to an 
observer’s predictions concerning ego’s reac- 
tions to alter’s evaluations of ego): 

1. There is a bias toward interpreting evalu- 
ative acts favorably, i.e., toward viewing them 
as conforming to rather than violating the 
social norms. 

2. There is a bias toward interpreting praise 
more favorably than criticism. Criticism is 
perceived as having greater immediate emo- 
tional impact than praise and as being more 
likely than praise to result in “holding one’s 
tongue.” 

3. Evaluations made by a superordinate to 
a subordinate are interpreted most favorably, 
evaluations by a peer next most favorably, 
evaluations by a subordinate least favorably. 
The immediate emotional impact of evalua- 
tions by superordinates is expected to be 
greater than those made by subordinates. 
The “holding of tongue” in response to intense 
criticism is expected to be greatest by subordi- 
nates and least by superordinates. 

4. Evaluative acts are interpreted most 
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favorably within the family, somewhat less 
favorably in school and work, and least favor- 
ably within the military context. The effects 
of status differences upon the interpretation of 
an evaluative act are seen to be most marked 
within the military and least marked within 
the family. The immediate emotional impact 
of criticism is expected to be greatest in the 
family, the immediate emotional impact of 
praise is expected to be greatest in the mili- 


tary. 
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QUALITY AND ACCEPTANCE OF PROBLEM SOLUTIONS BY 
MEMBERS OF HOMOGENEOUS AND 
HETEROGENEOUS GROUPS! 


L. RICHARD HOFFMAN ann NORMAN R. F. MAIER 
University of Michigan . 


N an earlier article Hoffman (1959) re- 
ported that higher quality solutions to 
problems were produced by groups whose 

members had dissimilar personalities (hetero- 
geneous groups) than by more homogeneous 
groups. On the other hand, the reported satis- 
faction of the members with the solutions 
(acceptance) was quite similar in the two 
types of groups. Since only two problems were 
used, however, the generality of the superior 
problem solving ability shown by the hetero- 
geneous groups may be questioned. 

Might homogeneous groups do better on 
other problems? Problems that emphasize 
differences among group members, and thus 
strain the ability of heterogeneous groups to 
work together effectively, may provide condi- 
tions advantageous for homogeneous groups. 
Even if the quality of solutions to such prob- 
lems by heterogeneous groups should not be 
impaired, the members’ satisfaction with the 
solutions may be reduced. 

In addition to the two problems used in the 
original study, the present study employed 
three new ones that involved conflicts in 
values, personal conflicts, and the distribution 
of a bonus. With the same experimental design 
the original study was thus extended to deter- 
mine whether heterogeneous groups would 
produce higher quality solutions to such varied 
problems without sacrificing acceptance. 


METHOD 


Four-person homogeneous and heterogeneous groups 
were constructed in the eight laboratory sections during 
two semesters of an undergraduate psychology course 


1 The major part of the research reported herein 
was supported by Project Michigan, a rtment of 
the Army-sponsored project of the University of 
Michigan in the field of combat surveillance. The 
contract (DA-36-039 SC 52654) is administered by the 
Signal Corps. The final report was written in conjunc- 
tion with USPHS Grant No. M-2704 from the National 
Institute of Mental Health, United States Public 
Health Service. A shortened version of this paper 
was presented at the Midwestern Psychological As- 
sociation meetings, April 1960. 


in human relations, The Guilford-Zimmerman Tempera- 
ment Survey (Guilford & Zimmerman, 1949) was used 
as the measure of personality, and profile correlations 
(Kendall’s tau) were computed between the 10-score 
profiles of all pairs of students in each section. Originally 
16 homogeneous groups (high positive tau’s) and 25 
heterogeneous groups (either near-zero tau’s or high 
negative and positive tau’s) were formed in the third 
week of the semester and interacted weekly thereafter 
in case discussions, problem solving, and role playing. 

Periodically, test problems were introduced into the 
laboratory sessions, Solutions to four of these problems 
could be scored for their objective quality; one was a 
problem purely of fairness, having no external standard 
of quality. Of the problems that could be scored for 
quality, solutions to three (Mined Road, Student 
Assistance Fund, and Painter-Inspector Argument) 
were graded quantitatively according to systems of 
content analysis that assigned increasing numbers of 
points to solutions of higher quality. On the fourth 
quality problen—Change of Work Procedure—new- 
modified and, especially, integrative solutions were 
considered to be of high quality. 

The Mined Road Problem, used in the earlier study, 
(Hoffman, 1959) requires the group to develop a method 
for permitting five men to cross a heavily mined road, 
Tt was administered in the first meeting of each group. 
The major problem for the group here is to recognize 
and solve all facets of the problem in the safest possible 


way. 
The Student Assistance Fund problem, a role play- 
ing case developed for the present study, was adminis- 
tered in the fourth meeting. Each subject was assigned 
the role of a different student who was deemed qualified 
for financial assistance from arin fund. Each 
“student” was told that he on one or more 
attributes. Each also was told that he needed $1,500 
for the next academic year, but the total fund was only 
to be $3,000 this year. The group had to decide on how 
to allocate the money this year and to recommend a 
method for allocating money in the future. Conflict was 
induced by the provision that the total amount of 
money was less than that needed by all group members 
com ‘and not enough for any one member if they 
divided the money evenly, The chance of reaching 
agreement was reduced further by giving the subjects 
opposing standards for allocating money among them- 
selves. Only the method developed for allocating funds 
among students in the future was scored for quality. 
The Painter-Inspector Argument problem (adapted 
from Maier, Solem, & Maier, 1957), another role 
playing case in which a leader was assigned randomly 
within each group, was administered in the sixth 
laboratory session. A painter, an inspector and rough 
primer, a shop steward, and a foreman meet to settle 
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an argument between the painter and the inspector. 
The argument reflects a basic conflict in the entire 
shop, but few groups were able to get beyond the initial 
argument and identify the total shop problem. 

The Change of Work Procedure problem (Maier, 
1952), administered in somewhat modified form in the 
eighth session, is also a role playing case, with a fore- 
man appointed to lead the discussion, and was repeated 
from the earlier study. The discussion centers around a 
request by the foreman that each of the three workers 
work in his best position rather than rotating among 
three simple jobs. The problem is usually approached as 
a choice between the increased pay of the fixed-position 
method (New solution) and the relief from boredom 
offered by rotation (Old solution). 

The Point Distribution problem was administered 
in the tenth session, immediately following the return 
to the class members of the third of four course examina- 
tions. Each group was given 19 points to distribute 
among its members as a bonus for participating in the 
experiment. The points were to be used to raise the 
final grade of the student. Each subject knew his own 
position in the class relative to the tentative cutting 
points for grades indicated by the instructor. The 
groups were given 40 minutes to arrive at a decision, 
and were told that no fractional or negative points were 
to be given and that failure to agree would give each 
subject four points. Since no objective standard of 
quality could be applied to the solutions, this problem 
was one of fairness or acceptance. 

Following each problem solving session, the subjects 
completed two Likert-type five-point questions con- 
cerning their satisfaction with the solution reached and 
with the amount of influence they had over the solu- 
tion. Values from 1 to 5 were assigned to each response, 
with 1 indicating “very dissatisfied” and 5, “very 
satisfied.” 

RESULTS 
Quality of Solutions 

The overall findings of the present study 
support those of the earlier one. On the vari- 
ous problems, the solutions by heterogeneous 
groups either were scored as significantly 
superior in quality to those of the homogene- 
ous groups, or did not differ in quality. Never 
were the homogeneous groups better than 
the heterogeneous groups. 

Table 1 presents a comparison of the results 
on the three quantitatively-scored problems 
that had a quality component—Mined Road, 
Student Assistance Fund, and Painter and 
Inspector—for the homogeneous and hetero- 
geneous groups. For the Mined Road problem 
the results combining the data of the present 
and earlier studies are also presented. The pro- 
portion of high quality solutions to both the 
Mined Road and Student Assistance Fund 
problems by heterogeneous groups was sub- 
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TABLE 1 i 


Quarrry or SOLUTIONS By HOMOGENEOUS AND 
ENEOUS GROUPS 


Present Study | Combined Studies 
Homo- |Hetero-| Homo- | Hetero- 
geneous (geneous | geneous | geneous 
Mined Road Problem 
Percentage above me- | 30% S7% | 21.7%* | 52.6% 
dian* 
M 55.9 65.3 45.9% 64.3° 
SD 3.6 29.9 32.1 29,3 
Number of groups 10 21 23 38 
Student Assistance Fund 
Percentage above me- | 36%* | 80%* 
dian* 
uM 44.2 50.9 
SD 17.5 12.8 l 
Number of groups “ 20 x 
Painters and Inspectors 
Percentage above me- 
dian* 54% | 56% 
uM 16.5 15.1 
SD 10.5 13.9 
Number of groups 13 16 


S The median score was determined from the results of all 
four-person groups, including residual groups not assigned to 
either the homogeneous or heterogeneous types. 

* Percentage or mean difference significant at the .05 level. 


stantially greater than by homogeneous 
groups. (The means are not very different as a 
result of a few very high scoring solutions by 
homogeneous groups.) The Painter and Inspec- 
tor problem was the only one in which no 
differences were found between the two types 
of groups. The generally low scores on this 
problem (an average of about 15 out of a 
possible 45 points with less than 20% of the 
groups achieving scores of 30 or more) point | 
to the restrictive character of this problem. 
Even heterogeneity of personality was insuti- 
cient to force the groups to consider other than 
the immediate problem of settling the per- 
sonal dispute. 

The results in Table 2 for the Change of 
Work Procedure problem again show the 
superiority of the heterogeneous over the 
homogeneous groups. Whereas 65% of the 
heterogeneous groups in the present study 
produced new-modified or integrative solu- 
tions—the higher quality solutions—only 21% 
of the homogeneous groups’ solutions were 
these types. This difference is statistically 
significant at the .05 level. Furthermore, the 
percentage of integrative solutions—the high- 
est quality solution—produced by the hetero- 


TABLE 2 


SOLUTIONS TO THE CHANGE OF WORK PROCEDURE 
PROBLEM BY HOMOGENEOUS AND HEfEROGENEOUS 


Groups 
Type of Solution Homogeneous Heterogeneous 
Integrative 0 (7.7%) * 5 (35.3%) * 
New-Modifed 3° ici: 
New 7 4 
Old 4 2 


® The numbers in parentheses are the percentages of Integra- 
tive solutions obtained from each type of group in the combined 
studies. This percentage difference is statistically significant at the 
.05 level. 

* Chi square test comparing Integrative and New-Modified 
with Old and New solutions is significant at the .05 level. 


geneous groups in both the present and earlier 
’ studies was significantly greater than the per- 
centage produced by homogeneous groups 
| (indicated in parentheses in Table 2). 

Thus on three of the four problems that 
could be scored for quality, substantially 
"more of the heterogeneous than of the homo- 
geneous groups produced better solutions. On 
the fourth problem there was no difference be- 
tween the group types, most solutions being of 
poor quality. 

The Point Distribution problem was de- 
signed to place strain on the heterogeneous 
_ groups by exaggerating the differences among 
the group members. The problem demands 
"agreement on the criteria to be used in assign- 
ing points to the several group members. 
_ Greater conflict was anticipated in the hetero- 
» geneous than in the homogeneous groups on 
the grounds that in the former groups the 
differing personalities would use different 
standards to determine the appropriate 
method of awarding points. Thus the hetero- 
geneous groups were expected either to allo- 
cate the points nearly equally, reflecting an 
inability to resolve the conflict, or to distribute 
"the points widely, reflecting some agreed-upon 

systematic method of awarding the points. 
The data tend to support this prediction. 
Since all groups arrived at some conclusion, 
the standard deviation of points awarded in a 
group was computed and used as the measure 
of outcome. Small scores represent nearly equal 
distributions of points and large scores repre- 
Sent diverse distributions. Although the mean 
standard deviation scores for the two types of 
groups were approximately equal (3.06 for the 
homogenous groups and 4.08 for the hetero- 
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geneous groups), the standard deviation of 
these scores for the heterogeneous groups was 
significantly greater than for the homogeneous 
groups (2.95 for the heterogeneous and 1.72 
for the homogeneous groups). In the hetero- 
geneous groups either points were awarded 
nearly as equally as possible or they were al- 
most all given to one or two members. The 
distribution of points by homogeneous groups 
tended to be more moderate, being negatively 
correlated, in general, to the class standing of 
the members on the three previous examina- 
tions. Thus the problem appears to have 
created greater conflict in the heterogeneous 
than in the homogeneous groups, but certain 
heterogeneous groups were able to resolve the 
conflict. 

Interaction of sex and personality composition. 
Using the combined data of the present and 
the original studies, the interaction of sex mix- 
ture and personality composition was studied 
in its effects on solutions to the Mined Road 
and the Change of Work Procedure problems. 
Mixed-sex as distinct from all-male groups in- 
volve another type of heterogeneity. The re- 
sults indicate that the interaction of sex and 
personality mixture was not the same for the 
two problems and suggest important differ- 
ences in the requirements that the problems 
placed on the groups. 

The results for the Mined Road problem 
are presented in Table 3 for groups in three 
categories of sex composition: All Male, One 
Female, and Two or Three Females. Although 
the means and percentages above the median 
for the heterogeneous groups were larger than 
for the homogeneous groups in all three cate- 
gories of sex composition, the only significant 
difference occurred between the All Male 
groups. The differences between the homoge- 
neous and heterogeneous groups successively 
diminished as the proportion of women in the 
group increased. (Unfortunately, only one 
group of all females was obtained in the 2 years’ 
data.) 

The results for the Change of Work Proce- 
dure problem, presented in Table 4, are divided 
only into All Male groups versus groups with 
One or More Females. For this problem the 
major difference between the homogeneous 
and heterogeneous groups occurred in the 
mixed-sex groups. Proportionately more new- 
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TABLE 3 
HOMOGENEOUS-HETEROGENEOUS DIFFERENCES IN SOLUTION 
IN Groups OF DIFFERENT SEX 
(Combined data: 1959 and present studies) 
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UALITY TO THE MINED Roap PROBLEM 
‘OMPGSITION 


All Male One Female pie cerpatee 
Homo- Hetero- Homo- Hetero- Homo- Hetero- Homo- 
geneous geneous geneous geneous geneous* geneous geneous 
Percentage above median? 12.585" 00572 0.0 33.3 50.0 72.7 21.7% 
M 34.8% 75.1% 37.7 $4.1 64.1 72.2. 45.9% 
SD 35.8 31.3 11.3 24.4 35.3 31.9 32.1 
Number of groups 8 9 7 18 8 11 23 


a noa from one homogeneous group in the 1959 study that consisted of four women are included in the two or three women cal 
b The median score was determined from the results of all four- -person groups, including residual groups not assigned to either 


homogeneous or heterogeneous types. 


* Mean or percentage difference between homogeneous and heterogeneous groups is significant at the .05 level by ¢ test or chi 


test, respectively. 


TABLE 4 
HomocENEous-HETEROGENEOUS DIFFERENCES IN 
Quarry oF SOLUTIONS TO CHANGE OF WORK 
PROCEDURE PROBLEM IN GROUPS OF DIFFERENT 
SEX COMPOSITION 
(Combined data: 1959 and present studies) 


All Male — One or More Females 


‘Type of group Old or Model or Old or Peete oe 
Nias latettve son EE gitar 
Solutions “Sojutions Solutions Solutions 
Homogeneous 7 5 11* 3* 
Heterogeneous 5 3 9* 17* 


* This relationship is significant at the .05 level by chi square 
test. 


modified or integrative solutions—high quality 
solutions—were produced by Mixed-Sex het- 
erogeneous groups than by Mixed-Sex homo- 
geneous groups. The proportions in both types 
of All-Male groups were approximately equal 
and were like those of the Mixed-Sex homoge- 
neous groups. In the present study, moreover, 
all five of the integrative solutions were pro- 
duced by Mixed-Sex heterogeneous groups (see 
Table 2). 

The results suggest that heterogeneity of 
personality may be sufficient to produce high 
quality solutions where, as with the Mined 
Road, the problem presents no restraints on 
possible alternative solutions. The group must 
generate its own potential solutions to this 
type of problem and then decide on the best 
one. Where, however, as with the Change of 
Work Procedure, alternative choices are pre- 
sented in the problem, the addition of sex mix- 
ture to the personality mixture may be re- 
quired to produce high quality solutions. To 


arrive at a solution of high quality to this t 
of problem, the group must recognize thai 
given alternatives are inadequate and 
vent” a better solution. Mixing sexes 
personalities appears to have freed these grou 
from the restraints of the solutions givel 

the problem. ’ 


Acceptance of Solutions 


Although on three of the four quality pi 
lems a higher proportion of heterogen 
groups produced good solutions than did 
homogeneous groups, the reported men 
satisfactions with their solutions showed n 
consistent differences between the two ty 
of groups (see Table 5). Both the mean 
faction ratings and the percentages of group 
in which all members were satisfied ¥ 
closely similar on all the problems except 
Mined Road problem, on which sa 
was somewhat (but not significantly) 
for the homogeneous groups. Thus the 
what higher quality of the heterogent 
groups’ solutions was not obtained at 
sacrifice of acceptance by the group memb 

As might be anticipated from this 
the correlations between quality of sol 
and mean member satisfaction were genei 
rather low. These are shown for the four q 
titatively scored problems in Table 6. Altho 
all of the correlations are positive, only two 0 
the eight achieve statistical significance. 

A more important variable than qualii 
solutions in determining member satisfa 
with the group’s solution is suggested by 
high correlations between the subjects’ 
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TABLE 5 


SATISFACTION WITH SOLUTIONS IN HOMOGENEOUS AND 
HETEROGENEOUS GROUPS 


Homo- Hetero- 
geneous* geneous 

Mined Road Problem 

M 4,47 4,22 

SD 0.37 0.43 

Eene unanimously satis- 80% 51% 

Number of groups 10 21 
Student Assistance Fund 

M 4.28 4,21 

SD 0.67 0.67 

sae LS" unanimously satis- 71% 10% 

er 

Number of groups 14 20 
Painters and Inspectors 

M 4.00 4.05 

SD 0.70 0.57 

ie unanimously satis- 54% 50% 

Number of groups 13 16 
Change of Work Procedure 

M 4.09 4.05 

SD 0.81 0.44 

Percent unanimously satisfied’ 50% 35% 

Number of groups 14 18 
Point Distribution 

M 4.19 4.39 

SD 0.69 0,41 

BS: unanimously satis- 50% 59% 

Number of Groups 16 17 


A None of the differences between group types is significant at 
the .05 level, 

b To determine unanimity of satisfaction a person was con- 
sidered satisfied if he checked either “very satisfied" or “quite 
satisfied.” 


faction with the amount of influence they had 
over the decision and their satisfaction with 
the solution. Table 7 shows that, with two ex- 
ceptions, these correlations, both at the in- 
dividual and the group levels, are all signifi- 
cantly positive. The correlations between the 
subjects’ satisfaction with the solution and 
their actual amount of influence over the de- 
cision (as rated by the other group members) 
were significantly different from zero in only 
3 of the 10 instances and were, in every case, 
smaller than the corresponding correlation be- 
tween satisfaction with solution and satisfac- 
tion with influence. These correlations suggest, 
then, that the subjects are satisfied with the 
decision their group reaches if they feel that 
their perceived amount of influence over the 
decision has coincided with some self-deter- 
mined level of expectation concerning the 
amount of influence they should have. 
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TABLE 6 


CORRELATIONS BETWEEN QUALITY AND MEAN 
SATISFACTION WITH SOLUTIONS 


Homogeneous | Heterogeneous 
a 
Poppee Number Number 
r of r of 
Groups Groups 


Mined Road Problem | .01 10 | .33 21 
Student Assistance : 


Fund 
Painters and Inspectors | .35 13 | .63°* | 16 
Point Distribution +29 16 | .26 17 


* A Pearson coefficient could not be computed on the Change of 
Work Procedure since the solutions were not scored quantitatively. 
The fourfold contingency tables formed in each group type by 
combining Old and New solutions against New-Modified and 
Integrative and taking the number of groups above against below 
the median mean satisfaction showed only slight positive relation- 


* Correlation significantly different from zero at .0S level. 
** Correlation significantly different from zero at .01 level, 


TABLE 7 
CORRELATIONS BETWEEN SATISFACTION WITH SOLUTION 
AND SATISFACTION WITH INFLUENCE 
OvER DECISION 


For Group Means 
Problem To... 
Homo- |Hetero- 
geneous gencous 
Mined Road Problem „47 | .49° 
Pund ote) Gate 
Student Assistance ‘ : 
% aa), Go), 
i t ’ ‘i 
Painters and Inspectors aay | tis) | do) 
Change of Work Proce- .| 46" | .06 
Snt Distribution Gel (otel Gate 
E (16) | 07) 
a SR A 
a Numbers in parentheses are the number of individuals or 
on which each correlation is based. 


* Correlation significantly different from zero at ,0S level, 
ee Correlation significantly different from zero at .01 level, 


Discussion 


The results of this study confirm, for a wider 
sampling of problems, the findings of the 
earlier one that heterogeneous groups are 
relatively superior to homogeneous groups In 
problem solving ability. Even on problems de- 
signed to produce emotional conflict, the het- 
erogeneous groups produced solutions that 
were better than or at least as good as those 
of the homogeneous groups. 

The similar personalities of members of the 
homogeneous groups may have provided them 
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with similar perspectives on each problem. 
Common to all of the homogeneous groups 
was the fact that all the members of each 
group had very similar self-perceptions. To the 
extent that similarity in self-perceptions is 
accompanied by similarity in the perceptions 
of problems and social situations, the number 
of different “directions” (Maier, 1930) avail- 
able to each homogeneous group was much 
less than the number available to each hetero- 
geneous group. With fewer directions available 
the likelihood was reduced that homogeneous 
groups would develop high quality solutions to 
these complex problems. In the heterogeneous 
groups, moreover, the conflict resulting from 
opposing viewpoints may have caused more 
complete solutions to emerge or new ones to be 
invented. Thus we might conclude that, al- 
though one can think of unreasonably extreme 
exceptions (e.g., people who do not speak the 
same language, adults and children), given 
relatively equally able people, the greater the 
differences in perceptions among the group 
members, the higher the quality of their prob- 
lem solving. 

Although other interpretations of these re- 
sults are possible, the interactive effects of 
heterogeneity with respect to sex and per- 
sonality on the solutions of the Mined Road 
and Change of Work Procedure problems seem 
to provide further supporting evidence for 
this view. Mixing sexes in the groups enhanced 
the quality of solutions to both problems, al- 
though in somewhat different ways for each 
problem. 

The differing results of sex and personality 
mixture for the Mined Road and Change of 
Work Procedure problems suggest that differ- 
ences in perspective are most important where 
the task is restrictive in its nature. But even 
where the problem is relatively unstructured, 
where the solutions must be developed by the 
group, the greater the differences the higher 
is the quality of group problem solving. 

The data indicate that the higher quality of 
solutions associated with group heterogeneity 
were not obtained at the sacrifice of their ac- 
ceptance by the group members. It would 
appear, moreover, that the very process by 
which the quality of group problem solving is 
improved, namely, the complete expression of 
such differences, is likely to enhance the ac- 
ceptance of solutions. Having shown that a 
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member’s satisfaction with his group’s 0 ut 
to a problem is a function of his sati 
with his amount of influence over the deci 
we can conclude tentatively that, wh 
ferences in perspective are present 0 
group members, the group’s solution will 
highest quality and will have the most at 
ance where these differences are expres 
finally utilized in making the final di 
This does not mean that each group 1 
must have an equal amount of influence 
the final decision. Rather, all divergent 
points must be given what the member 
ceives as a fair hearing by the group 
employed in making the final decision 
extent that they are seen as relevant, 
every group member has to participate 
in the discussion, as has sometimes i 
gested (e.g., Coch & French, 1958), Each 
member must participate, however, if 
dissatisfied with the decision as it ap, 
be forming and he has some alterna 
proach or modification to offer. 

By separating the factors impo! 
quality and acceptance of solutions i 
way, one can understand how group men 
may be satisfied with very poor group solut 


first case the group may have only a few di 
tions available among its members, but th 
have been expressed completely to the gr 
satisfaction. In the second case a variet 
directions may have been available but 
those which led to the correct solution ¥ 
heard, Dissatisfaction with the grow 
cision stems from the members’ feeling th 
their alternative viewpoints were not evalu 
fairly by the group. 


SUMMARY 


A wider variety of problems was us 
test the generality of Hoffman’s (1959) € 
finding that groups composed of people 
erogeneous with respect to personality 
superior in solving problems to groups 
posed of people with homogeneous pers¢ 
ties. Sixteen homogeneous and 25 hetero 
ous four-person groups were initially 
in the laboratory sections of an underg) 
course in the psychology of human relai 
Homogeneous groups consisted of peop! 
high positive profile correlations among 
scores on the Guilford-Zimmerman Tem 
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ment Survey, heterogeneous groups of people 
with near-zero or negative correlations. The 
group members interacted weekly in case dis- 
cussions, problem solving, and role playing. 

Five different problems were used to test 
the relative abilities of the two types of groups, 
four with some component of quality and one 
only involving acceptance. The subjects were 
also questioned about their satisfaction with 
the solution and with their influence over 
the solution for each problem. 

The following results were obtained: 

1. Heterogeneous groups produced a higher 
proportion of high quality solutions than did 
homogeneous groups to three of the four prob- 
lems with quality components. On the fourth 
problem, there was little difference and both 
types of groups produced poor solutions for 
the most part. 

2. Mixed-Sex groups tended to produce 
higher quality solutions than did All-Male 
groups. 

3. Satisfaction with the problem solutions 
| was about the same in the homogeneous and 
heterogeneous types of groups. 

4. Satisfaction with the solutions was shown 
to be more strongly correlated with the mem- 
bers’ satisfaction with the amount of influence 
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they had over the solution than with the ob- 
jective quality of the solutions or with the 
members’ actual amount of influence, 

It is suggested that solutions with high 
quality and high acceptance can be obtained 
from groups in which the members have sub- 
stantially different perspectives on the prob- 
lem, and in which these differences are ex- 
pressed and used by the group in arriving at 
the final decision. 


REFERENCES 


Coca, L., & Frencn, J. R. P. Overcoming resistance to 
change. In Eleanor E. Maccoby, T. M. Newcomb, 
& E. L. Hartley (Eds.), Readings in social psychol- 
ogy. (3rd ed.) New York: Holt, 1958. Pp, 233-250. 

GUILFORD, J. P., & Zmmerman, W. S. The Guilford- 
Zimmerman Temperament Survey. Beverly Hills: 
Sheridan Supply, 1949. 

Horraan, L. R. Homogeneity of member personality 
and its effect on group problem solving. J. abnorm. 
soc. Psychol., 1959, 68, 27-32, 

Mater, N. R. F. Reasoning in humans: I. On direction. 
J. comp. Psychol., 1930, 10, 115-144, 

Mawr, N. R. F. Principles of human relations. New 
York: Wiley, 1952, 

Mazer, N. R. F, Sorex, A. R, & Mater, A, A, 
Supervisory and executive development, New York: 
Wiley, 1957. 


(Received March 28, 1960) 


Journal of Abnormal and Social Psychology 
1961, Vol. 62, No. 2, 408-412 


GROUP COHESIVENESS, COMMUNICATION 
LEVEL, AND CONFORMITY? 


ALBERT J. LOTT 
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AND 


ness within the framework of S-R 

learning theory, predictions regarding 
both its antecedents and consequents have 
been made on the basis of general behavior 
principles (Lott, in press). Cohesiveness is de- 
fined as that group property which is inferred 
from the number and strength of mutual positive 
attitudes among the members of a group. An 
attitude is defined, following Doob (1947), as 
an implicit anticipatory response having both 
cue and drive properties. 

Basic to the present approach is the prop- 
osition that a primary condition for the de- 
velopment of positive attitudes toward persons 
is the receipt of reward in their presence. This 
proposition was tested in a previous study 
(Lott & Lott, 1960) in which confirming evi- 
dence was obtained. The present investigation 
is concerned, not with this determinant of co- 
hesiveness, but with two of its consequences. 
It is predicted that a group’s level of com- 
munication, and the tendency of its members 
to conform to the dominant group opinion on 
an issue, will be positively related to the degree 
of cohesiveness present within the group. 

The predicted relationship between cohesive- 
ness and communication level is derived from 
the assumed drive property of attitudes (Doob, 
1947). Following from such an assumption is 
the expectation that groups composed of mem- 
bers with strong positive attitudes toward one 
another (i.e., high cohesive groups) should be 
characterized by a generally high drive level 
reflected in the amount of communication 
among the members. The second hypothesis 
follows from the expectation that the greater 
the cohesiveness of a group, the more can in- 
dividual members function as secondary rein- 
forcers for the behavior of one another and, 


B placing the concept of group cohesive- 


1 Funds for research assistance on this study were 
provided by the Undergraduate Participation Program 
of the National Science Foundation. The authors would 
like to acknowledge the aid, during various stages of 
the investigation, of Arnold Jackman, a senior at the 
University of Kentucky. 


BERNICE EISMAN LOTT 
Kentucky State College 


thus, produce conformity.* This proposition is 
derived from the following considerations: the 
greater the cohesiveness of the group, the more 
have the members (person-stimuli) been pres- 
ent during past conditions of reinforcement; 
it is under such conditions that secondary 
reinforcers develop (Hull, 1951, Ch. 8); in 
addition, since it is assumed that the secondary 
reinforcing property of a stimulus depends 
upon the ability of the stimulus to evoke an 
anticipation of reward (Hull, 1952, Ch. 5), or 
a positive attitude, the more cohesive the 
group the more should its members be able to 
function as secondary reinforcers for one an- 
other and, thus, the more should approval of a 
behavior (opinion) by members serve to main- 
tain it and disapproval to extinguish it. 

In the present investigation natural, exist- 
ing groups of friends were put into a semicon- 
trolled laboratory situation. A positive cor- 
relation was predicted between measures of 
cohesiveness and both communication level, 
and conformity. 


METHOD 
Subjects 

Subjects were recruited from 15 on-going student 
organizations at the University of Kentucky and 
Kentucky State College? from each of which a small 
group of 6-10 friends was obtained. Descriptive in- 
formation concerning each of the groups is presented 
in Table 1. 

For purposes of validating a Group Cohesiveness 
Index, to be described below, seven groups of strangers 
each made up of 6-10 individuals brought together for 
the first time, were also used. These subjects were 
recruited from separate laboratory sections of intro- 
ductory psychology at the University of Kentucky. 


Procedure 


Each group met with the investigators in a room 
arranged so that members sat in a circle, with one 


2 Conformity behavior is defined as a change # 
behavior (opinion) following information about the i 
havior of others, where this change is in the direction al 
the information received. f 

3 At the time of the investigation, the student body 
of Kentucky State College was entirely Negro. Al 
subjects recruited from the University of Kentucky Were 
white. 
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TABLE 1 
DESCRIPTION OF Groups 
Male: 
Group Type Ese] N | School 
Ratio 
A Religious 4:5 | 9| College 
B Debating 5:1 | 6| College 
G Sorority 0:10| 10 | College 
D Academic 6:0 | 6| College 
E Athletic 0:7 | 7 | College 
F Religious 3:7 | 10 | College 
G Fraternity 9:0 | 9 | College 
H Veteran’s 10:0 | 10 | College 
1 Social 0:6 | 6| Colle; 
J Nurses (1)* | 0:6| 6| University 
K Nurses (2)* | 0:6 | 6| University 
L Nurses B a |0:6| 6| University 
M Nurses (4)" | 0:6 | 6| University 
N Social 1:5 | 6 | University 
oO Sorority 0:7 | 7 | University 
Total 15 38: 72)110 


® The four groups of nurses were separate friendship groups 
from within the larger freshman nursing class. 


investigator (Æ) slightly behind the circle and the 
other (Ea) at a table apart from the group. Each 
subject was given an identifying letter to pin on his 
(her) shirt. The letters were distributed in alphabetical 
order, clockwise around the circle, starting from the 
subject seated to the left of Zi. 

The purpose of the investigation was explained to 
the subjects as being that of obtaining the opinions of 
students on a subject of current interest to educators. 
Before the discussion topic was introduced, however, 
the group members were told: “We would like to know 
how each one of you feels about every other member of 
your group present here now,” and the following in- 
strument was administered: a booklet containing 
rating forms on which each individual could rate every 
other on a nine-point scale, ranging from “dislike 
very much” (1) to “like very much” (9). The booklet 
contained a separate page for each member, other than 
the rater, whose identifying letter simply appeared on 
the booklet’s cover page. The last page of the booklet 
called for the rater’s “attitude toward the group as a 
whole.” The person rated on each page was identified 
by letter and, since no record was made of the names 
associated with letters, subjects were assured of com- 
plete anonymity, The subjects were also informed of 
the fact that the sequence of letters within the booklets 
had been randomized so that each individual would be 
rating different members of the group at different times. 

After the booklets were collected, Ei presented the 
topic for discussion as follows: 

It is being said, by many professional educators as 

well as other general observers, that today’s average 

college student is interested only in himself and takes 
little or no interest in the problems facing his com- 
munity, his country or his world. It is being said that 
the college student is too contented, is lazy, and self- 
centered. We would like to have your opinions on 
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this question. Do you agree; do you disagree; why? 

Because we want only your opinions, ‘we will not 

take part in the discussion. We’ll stop you at the 

end of half an hour. 

While Z, took content notes, Æ; recorded communi- 
cation frequency by tallying separate remarks made 
within 5-second periods on a standard form suggested 
by Gibb, Platts, and Miller (1951). A separate tally was 
recorded for each comment made within a 5-second 
interval, regardless of its total duration. Thus, if one 
subject spoke continuously for 5 seconds, only one tally 
was made; if several subjects spoke during the same 
5-second interval, one tally was made for each speaker; 
and if one subject spoke for 15 seconds, three tallies 
were made, one for each 5-second interval utilized by 
him. Each tally identified the speaker by letter. 

At the end of 30 minutes, the discussion was termi- 
nated and an opinion scale was administered to each 
subject preceded by the following instructions: 
“ , .. Sometimes views expressed openly in discussion 
are not quite the same as when expressed privately. 
We would therefore like, now, to get an expression of 
opinion from each one of you individually.... These 
opinions are anonymous. Please do not put either your 
name or your letter on the paper, just your opinion. 
Let me repeat the statement: ....” The opinion scale 
consisted of nine points from “strongly disagree” (1) 
to “strongly agree” (9). The scales were collected in 
such a way as to preserve the identifying letter order 
of the seated subjects, thus, making it possible to later 
match the responses to this scale with responses made 
to one administered subsequently. 

Following collection of the scales, subjects were 
requested to maintain a brief period of complete silence 
and thoughtfulness. The atmosphere of this period was 
augmented by a recorded musical background provided 
by the first 5 minutes of Prokofiew’s Seventh Symphony. 
During this period Æ, tabulated the opinion scale 
responses which were reported to the group at the end 
of 5 minutes, The reported results were always directly 
op posite to the actual findings. Thus, a group which had 
actually indicated a tendency towards disagreement 
with the statement was told that the results showed 
agreement, and vice versa. 

After the “results” were presented to the group, the 
musical interlude was explained as follows: “There is 
evidence to suggest that sometimes just sitting quietly 
permits a person to think of things which he did not 
think of earlier, in the middle of a discussion, This 
might have happened to some of you. Just in case it 
did, we would like to give you another chance to express 
your opinion on the issue which the group discussed. 
Your opinions are, again, completely anonymous . . 4) 
A second opinion scale was then administered and 
collected in the same manner as described above. The 
subjects were asked to make any comments they wished 
on the back of their opinion blanks. 


Measures 


From the data gathered by means of the instruments 
described above, measures of cohesiveness, communi- 
cation level, and conformity were obtained for each 


group. 
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Cohesiveness. From the booklets which were ad- 
ministered at the beginning of the experimental session, 
the attitude of each group member toward every other 
member was ascertained. In obtaining an index of 
group cohesiveness from these individual responses, 
each member was paired with every other member and 
their ratings of each other were averaged. To simplify 
this procedure, the original nine-point scale was con- 
verted to one ranging from —4 (Point 1) through 0 
(Point 5) to +4 (Point 9). Thus, if Person A had 
circled Point 8 on the scale for Person B and Person B 
had circled Point 5 for Person A, their “mutual score” 
or pair average was +-1.5, The Mean of all the “mutual 
scores” or pair averages was then obtained as the 
Group Cohesiveness Index (GCI). 

This measure of cohesiveness follows directly from 
the conceptual definition presented earlier. It takes into 
account the strength and direction of attitudes of 
each member toward every other and, thus, reflects the 
mutual attitudes among the group members. 

Communication level. A description of the tally system 
used in recording frequency of communication within 
each group was given above. From the tallies, a group 
measure of communication level was obtained by 
calculating the Median number of verbalizations made 
by the individual group members. A verbalization is 
defined as a single remark, regardless of its duration. A 
Median was utilized as the group score because it is 
not influenced by extreme variations in individual 
group member scores which probably reflect differential 
“yerbalness” of individual participants. 

Conformity. The two sets of opinion scales on the 
discussion topic, which were obtained from each group, 
were compared, Since the second scale followed the 
presentation of contrived information regarding the 
general direction of results on the first scale, a post- 
information opinion could be compared with a pre- 
information opinion for every group member. The 
percentage of group members who changed their opin- 
ion in the direction of the contrived information was 
taken as the measure of conformity within each group. 


RESULTS 


Validity of the GCI 


The validity of the GCI was tested by com- 
paring the scores obtained on the 15 natural 
friendship groups with those obtained on 7 
groups of strangers. White’s rank test (Ed- 
wards, 1954) was used to test the significance 
of the difference between groups; the obtained 
T' value of 31 is significant at the .01 level 
(two-tailed). The group scores on the GCI for 
the strangers ranged from .8-1.7, in contrast 
to those for the natural groups which ranged 
from 1.5-3.6. The ability of the GCI to clearly 
differentiate between groups of strangers and 
groups of friends, where differences in cohesive- 
ness are to be expected, was, thus, established. 

An additional validation test of the GCI 
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TABLE 2 


SCORES OBTAINED BY GROUPS ON MEAS 
COHESIVENESS, COMMUNICATION, AND 
CONFORMITY 


Group GCI 

o 3.6 44.0 
J 3.5 88. 

I 3.2 16.5 
K 3.1 42.0 
G 3.1 6.0 
C 3.1 5.5 
N 2.9 32.0 
M 2.8 18.5 
B 2.7 15.0 
D 2.3 15.0 
F 2.2 14.5 
L 2.1 52.0 
H 2.1 “ns 
A 1.8 6.0 
E 1:5 9.0 


was made possible by examining res] 
the last question which had been asked 

subjects in the attitude-toward-group-n 
bers booklet: Will you now check your a 
towards the group as a whole? (on a n 
scale). Mean group scores to this q 
which ranged from 7.3-9.0, were coi 
with CGI scores, resulting in a tau 
significant at less than the .001 level. 


Cohesiveness and Communication Level — 


The scores obtained by each natural 
ship group on these two variables are pre 
in Table 2. The relationship between co 
ness and communication level was 
means of Kendall’s rank correlation co 
tau (Siegel, 1956). The obtained correl 
A3 is significant at the .02 level (two- 


Cohesiveness and Conformity 


The conformity score for each group 
given in Table 2. A tau analysis of the 
ship between conformity and group co 
ness yielded a coefficient of .54, sig 
less than the .01 level of confiden 
tailed). 


Discussion 


The results of this investigation 
that significant positive relationships 
tween group cohesiveness and both ci 


GROUP COHESIVENESS, COMMUNICATION LEVEL, AND CONFORMITY 


cation level and conformity behavior. There- 
fore, given knowledge of the degree of a group’s 
cohesiveness, as the concept has been here de- 
fined and measured, reliable predictions of two 
significant types of group behavior can be 
made. 

The authors are not aware of other reports 
of a direct relationship between the quantita- 
tive level of communication within a group 
and its degree of cohesiveness, although a 
qualitative relationship between high and low 
cohesiveness and communication patterns has 
been shown (Back, 1951). Both Newcomb 
(1953) and Festinger (1950), however, have 
suggested that frequency of communication 
between persons should vary directly with the 
degree to which they are attracted to one 
another (Newcomb) or to the group (Fest- 
inger). The present approach considers com- 
munication as only one manifestation of a 
group’s general drive level, to which the drive 
components of positive attitudes among group 
members contribute. Other reflections of drive 
level, such as task persistence, productivity, 
etc., are also expected to show positive re- 
lationships with cohesiveness. 

Empirical support for a positive relationship 
between group cohesiveness and conformity 
among group members can be found in the 
literature (e.g, Berkowitz, 1954; Festinger, 
Schachter, & Back, 1950; Schachter, 1951), 
but negative results have also been reported 
(Bovard, 1953; Downing, 1958). Although the 
matter cannot be gone into fully here, the 
present approach would not predict a positive 
relationship between cohesiveness and con- 
formity in all instances, but only in those 
situations where the stimulus features are 
similar, to some degree, to those which were 
present when group members acquired their 
secondary reinforcing characteristics. The 
principle of stimulus generalization, in other 
words, should be employed to predict the 
effectiveness with which group members can 
act as secondary reinforcers for one another in 
diverse situations. 

Evidence supporting the validity of the GCI 
used in this investigation has already been 
presented. In view of the high correlation 
reported between the GCI and responses to 
the question regarding attitude toward group 
as a whole, the question arises as to whether 
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the latter might not do just as well as a meas- 
ure of cohesiveness, as the former. Just such a 
simple measure has been suggested by Gross 
and Martin (1952). Points that might be 
considered in answering this question are the 
following: examination of the rating that 
individuals made of their groups-as-a-whole 
shows a marked tendency for use of the upper 
portions of the scale whereas a similar restric- 
tive tendency is not found in ratings made of 
individual group members; the Gross-Mar- 
tin Question predicted more poorly to the 
dependent variables of the present investiga- 
tion;* and the GCI is directly derived from 
our conceptual definition, 


SUMMARY 


The predictions that both communication 
level and conformity behavior within groups 
vary positively with the degree of group co- 
hesiveness were clearly confirmed. Measures 
of the three variables under investigation were 
obtained, in a laboratory setting, on 15 natural 
friendship groups, each of which engaged in a 
30-minute discussion of a presented topic. Data 
pertaining to the validity of a cohesiveness 
index, derived from the conceptual definition 
of cohesiveness, were also obtained. 

Since the hypotheses tested and supported 
in this investigation are part of a formulation 
of group cohesiveness within the framework of 
learning theory, the obtained results lend sup- 
port to attempts to predict group behavior on 
the basis of more general behavior principles. 
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SOCIAL DEPRIVATION AND RIGIDITY IN THE 
PERFORMANCE OF FEEBLEMINDED CHILDREN! 


EDWARD ZIGLER 
Yale University 


HE view that feebleminded individuals 

are more rigid than normal individuals 

of the same MA has been advanced by 
Lewin (1936) and Kounin (1939, 1941a, 1941b, 
1948). Recent findings (Plenderleith, 1956; 
Stevenson & Zigler, 1957), however, have 
failed to support the Lewin-Kounin formula- 
tion. In evaluating the disagreement of these 
findings with those of Lewin and Kounin, 
Stevenson and Zigler (1957) noted that in the 
later studies the subject was required to learn 
a response, while in the earlier studies, the 
subject was required to perform a response in 
compliance with instructions. 

The hypothesis was advanced that the 
differences in performance of normal and fee- 
bleminded subjects reported by Lewin and 
Kounin were related to motivational differ- 
ences between the groups, rather than to 
differences in cognitive rigidity. This hypothe- 
sis was based on the assumption that institu- 
tionalized feebleminded children tend to have 
been relatively deprived of adult contact and 
approval and hence have a higher motivation 
to secure such contact and approval than do 
normal children, This assumption appears 
congruent with the view advanced by other 
investigators that both institutionalized fee- 
bleminded and institutionalized normal in- 
dividuals exhibit an increased desire to inter- 
act with adult figures (Clark, 1933; Sarason, 
1953; Skeels, Updegraff, Wellman, & Wil- 
liams, 1938). 

1 This report is adapted from a dissertation sub- 
mitted to the graduate faculty of the University of 
Texas in partial fulfillment of the requirements for the 
degree of doctor of philosophy. Harold Stevenson 
initiated with the author a series of experiments which 
led directly to this study. To Louis Moran, who 
directed the study, and to Harold Stevenson, the writer 
is greatly indebted for their continuous constructive 
criticism. 

The study was made possible through the coopera- 
tion of Malcolm J. Farrell, Superintendent, and the 
staff of the Walter E. Fernald State School at Waverly, 
Massachusetts. Special thanks go to Catherine E. 
Chipman, principal psychologist, and to John H. 
Davidson, staff logist, for their conscientious 
work in rating the subjects on the social deprivation 
scale used in the study. 


Zigler, Hodgden, and Stevenson (1958) em- 
ployed the above-stated assumption in deriv- 
ing the hypothesis that interaction with an 
adult and adult approval provide greater rein- 
forcement for the responses of institutionalized 
feebleminded subjects than they do for those 
of normal subjects. Predictions concerning the 
performance of feebleminded and normal sub- 
jects on instruction-initiated tasks made on 
the basis of this hypothesis were supported. 
The findings of Gewirtz and Baer (1958a, 
1958b) provide further evidence for the rela- 
tionship between social deprivation and the 
effectiveness of adult approval as a reinforcer. 

The studies of Zigler et al. (1958) and Ge- 
wirtz and Baer (1958a, 1958b) lend support 
to the view that the reported differences in 
the incidence of “rigid” behavior between 
normal and feebleminded subjects of the same 
MA are a function of the greater social depri- 
vation experienced by feebleminded subjects 
rather than of the inherent rigidity of the 
feebleminded. A direct test of this position 
is suggested. If the rigid behaviors found in 
feebleminded subjects are a result of the 
social deprivation the subjects have experi- 
enced, then within a feebleminded population 
a relationship should exist between the degree 
of deprivation experienced and the amount of 
rigidity manifested. The following hypothesis 


is proposed: 


The greater the amount of pre-institutional social 
deprivation experienced by the feebleminded child, 
the greater will be his motivation to interact with an 
adult, making such interaction and any adult approval 
it more reinforcing for 


His responses (has for the responses of a fecbleminded 
child who has experienced a lesser amount of social 
deprivation. 


The methodological difficulties entailed in an 
adequate test of this hypothesis have been 
discussed at length elsewhere (Zigler, 1958). 
The problem of adequately defining “social 
deprivation” merits special attention. As 
others have noted (Gewirtz & Baer, 1958a), 
the concept of social deprivation has been ap- 
plied loosely to certain events in early child- 
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hood which in turn are characterized as be- 
ing antecedent to certain social behaviors 
(Bowlby, 1951; Goldfarb, 1945; Ribble, 1944; 
Spitz, 1945). Gewirtz and Baer (1958a, 1959b) 
avoided such looseness by equating “social 
deprivation” to a brief period of social isola- 
tion. They thus reduced deprivation to a 
short-term operation often employed in gen- 
eral behavior theory in which deprivation 
means “‘the unavailability to an organism of a 
given reinforcer for a period sufficient to effect 
an increase in behaviors for it.” As Gewirtz 
(1957) has pointed out in another paper, how- 
ever, such a short-term contemporaneous 
model is not the appropriate one when dealing 
with the concept of deprivation as employed 
by such investigators as Spitz, Ribble, Gold- 
farb, and Levy. To these latter investigators 
deprivation refers to long-time deficits and 
hence an appropriate definition of deprivation 
must reflect conditions operative through 
wide segments of time. 

In dealing with the social deprivation that 
feebleminded children have experienced, a 
measure that would reflect such long-term 
deficits is required. Since the population of 
events which constitutes social deprivation has 
never been adequately delimited, the proce- 
dure employed in the present study was that 
of allowing raters to evaluate the entire pre- 
institutionalization history of the individual 
for degree of social deprivation, while at the 
same time having them note the specific fac- 
tors in the history which determined the rat- 
ing. It was hoped that this procedure would 
result in not only an assessment of social dep- 
rivation for each subject, but also in the ex- 
traction of a universe of specific events, the 
occurrence or nonoccurrence of which could 
be employed as a definition of social depriva- 
tion in future investigations. 

The present study used a two-part game 
very similar to one employed by Zigler et al. 
(1958). On Part I, the subject was instructed 
to make one type of response and was allowed 
to continue making this response until satiated 
or until he had made the maximum number of 
responses allowed by the game. On Part II, 
the subject was instructed to make a different 
type of response to the same stimuli and 
again was allowed to perform until satiated or 
until he had made the maximum number of 
responses allowed by the game. Two condi- 
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tions of reinforcement were employed. In ¢ 
condition the subject performed Part I of i 
game under Nonsupport (attention only) 
then performed Part II under Support (sı 
portive comments). This was designated t 
Nonsupport to Support condition (NS-S). 
a second condition the subject performed 
I of the game under Support and Part 
der Nonsupport. This was designated the St 
port to Nonsupport condition (S—NS). 

From the hypothesis advanced in this s 
and by employing the above game and com 
tions of reinforcement, certain predictions ¢ 
be made. These predictions and their ration 
are presented below: 


1. The more socially deprived subjects perfo 
longer on the game than the less socially depriv 
subjects under both NS-S and S-NS conditions. T 
prediction was derived directly from the hypoth 
that the greater the amount of social deprivati 
experienced, the greater is the motivation to inti 
with adults. Since both NS (attention) and S (sup 
comments) are reinforcing, they both should 
greater reinforcing effect on the more socially depriv 
subjects than on the less socially deprived subjects. | 

2. The more socially deprived subjects make a low 
proportion of errors than the less socially deprit 
subjects. If, as is hypothesized, more socially depri 
subjects have a higher motivation to secure 
interaction and approval, they should show a gri 
compliance with the experimenter’s instructions. 
greater compliance should lead to fewer errors. 

3, The more socially deprived subjects more fi 
quently make the maximum number of responses 0 
the game. This prediction was derived from two sources 
since it was hypothesized that the social reinfore 
being offered would be more reinforcing for the respon 
of these subjects who have experienced the great 
amount of social deprivation, the more likely it shou 
be that these social reinforcers would continue to 
reinforcing throughout the maximum number ¢ 
responses allowed; and, if, as was assumed, higher mo 
vation to secure adult interaction and approval esuli 
in greater compliance with the experimenter’s insi 
tions, the more socially deprived subjects might feel 
such compliance involved completing the game. 

4. The increase in time spent in Part II over ts 
spent in Part I of the game is greater for the mol 
socially deprived than for the less socially depri 
subjects under both NS-S and S-NS conditions. 4 
prediction for the NS-S condition is supported i 
findings of an earlier study (Zigler et al., 1958), whl 
indicated that the S condition is more reinforcing © 
the NS condition. It would follow that this increase 
reinforcement would have a greater effect on © 
more socially deprived subjects due to their great 
motivation to attain this reinforcement. The predi 
for the S-NS condition was based on the assum] 
that a child switched from an S to an NS condition wo! 
evidence an increase in output under the NS conditi 
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in the hope that such an increase would lead to a 
reinstatement of S. Since the more socially deprived 
subjects were considered to have fhe higher motiva- 
tion for the reinstatement of S, it was expected that 
they would show a greater increase in time spent on 
Part II than would the less socially deprived subjects 
under the S-NS condition 


Tn addition to testing these predictions, the 
present study was directed toward an examina- 
tion of the differences in performance related 
to the two conditions of reinforcement. While 
it was assumed that there would be an increase 
in the number of responses in Part II over 
that in Part I under both the NS-S and S-NS 
conditions, other differences in performance 
that emerged under the two conditions also 
were of interest. An examination of such differ- 
ences was undertaken in an effort to furnish a 
conceptual framework within which to inter- 
pret the finding of Zigler et al. (1958) of an 
increase from Part I to Part II when both 
parts were given under an identical S condi- 
tion. 


METHOD 


Subjects 


The subjects were 60 feebleminded children residing 
at the Walter E. Fernald State School in Waverly, 
Massachusetts. The conditions for inclusion in the 
sample were that the child have a diagnosis of mental 
deficiency, familial type; be free of any gross motor or 
sensory disturbances; have a CA between 8.0 and 14.0 
years, have an MA between 4.0 and 8.0 years; and 
have been a resident of the school for at least one month., 
The school’s admission ledger was employed to select 
the sample. Beginning with the most recent admissions, 
every individual who met the above requirements was 
included in the sample. Names were chosen in order of 
admission with the sampling procedure continuing 
until a group of 30 boys and 30 girls had been formed. 
The birth date, MA, and IQ for each child were taken 
directly from the school’s records. Each child’s IQ 
was used to adjust his MA for the time interval that 
had intervened between his MA rating and his partici- 
pation in the present study. In all cases the 1937 revi- 
sion of the Stanford-Binet had been used to obtain 
the original MA ratings. 


Experimental Game (M arble-in-the-Hole) 


The game was repetitive and monotonous and was 
constructed to reduce relatively quickly the subject’s 
desire merely to “play the game,” thus allowing the 
assumed difference in motivation for interaction with 
the experimenter to become the dominant variable in 
determining performance. The game was extremely 
simple, and successful performance depended primarily 
upon compliance with the experimenter’s instructions 
rather than upon learning. The rules of the gameallowed 
the subject to stop playing whenever he wised. Al- 


415 


though the game had no defined goal, there was the 
implicit but distant goal of inserting all 400 marbles 
and thus emptying a container. This game was very 
similar to the Peg-in-the-Hole game used by Zigler 
and his colleagues (1958). 

In Marble-in-the-Hole, the subject’s task was to 
insert a marble of one color into one hole and a marble 
of another color into a second hole of an enclosed box, 
The apparatus included: a green box 9 X 6 X 6 inches 
with two small square holes along the middle of the lid 
of the box and separated by 2 inches; 400 marbles, 200 
blue and 200 orange; a red open container 3 X 4 X 4 
inches to hold the marbles. 

Part I. The container filled with 400 marbles and 
the box were placed in front of the subject. The experi- 
menter said: 

This is a game that we call “Marble-in-the-Hole.” 

T'll tell you how to play it. You see these marbles. 

Some of them are blue and some of them are orange. 

They go in these holes. The blue ones go in the 

front hole and the orange ones go in the back hole, 

(The experimenter points to the appropriate holes.) 

Now show me a blue marble. Put it in the hole it goes 

in. Now show me an orange marble. Put it in the hole 

it goes in. You can put as many marbles in the holes 
as you want to. You tell me when you want to stop 
and we’ll play a new game. 
The subject then played the game until he indicated 
that he wished to stop either by telling the experimenter 
that he wanted to stop; or not inserting a marble for 
30 seconds. 

Part II. After the subject indicated that he wished 
to stop, the experimenter emptied the marbles from 
the box back into the container and then said; 

Now I'll tell you how to play this game. This time 

we put the orange marbles in the front hole and the 

blue marbles in the back hole. Put an orange marble 
where it goes. Now put a blue marble where it goes. 

You can put as many marbles in the hole as you 

want to. You tell me when you want to stop, 

The subject was allowed to continue until he told the 
experimenter that he wished to stop or he did not insert 
a marble for 30 seconds. 

The color of the marbles and the particular holes 
into which they were to be placed were randomized 
among the subjects so that all possible combinations 
and switches were used. The maximum number of 
marbles that could be inserted on each part of the game 
was 400, 


Procedure 


Performance variable. The subjects were tested 
individually. The experimenter brought the subject into 
a room which contained a small table and two chairs 
and told him: 

We're going to play some games today. They are fun 

games and I think that you will like them. You will 

be able to play each game as long as you want to. 
The subject was seated at the table and the experi- 
menter arranged the materials and gave the instructions 
for Part I of the game. 

Each subject played one part of the game under an 
NS condition and the other part under an S condition. 
In the NS condition the instructions were given and 
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the experimenter did not interact further with the 
subject during the games. No verbal support was 
given to the subject and care was taken to avoid nod- 
ding, smiling, or giving other types of nonverbal support. 
In the S condition the instructions were given and the 
experimenter continued after this to interact with the 
subject. This interaction included such nonverbal 
support as smiles and nods and verbal support in the 
form of comments about the subject’s performance. 
The following comments were used: “You really know 
how to play this game,” “That’s fine,” “Very good,” 
“You really know how to play Marble-in-the-Hole, 
don’t you?” These statements were made in a random 
order, one after every tenth marble was inserted. 

Half the subjects played Part I of the game under 
the NS condition and played Part II under the S condi- 
tion (NS-S group). The other half of the subjects 

Part I of the game under the S condition and 
played Part II under the NS condition (S-NS group). 
The subjects were systematically assigned to the two 
groups to control for MA, CA, length of institutionaliza- 
tion, and sex in each group. The composition of the 
two groups may be seen in Table 1. 

The experimenter recorded the number of re- 
sponses, the total time spent, and the number of errors 
for each part of each game. At the conclusion of the 
experiment, every child was told he had done very well. 

‘Social deprivation variable. Two experienced psy- 
chologists (the school’s principal psychologist, with 29 
years of experience at the school, and a staff psy- 
chologist, with 3 years experience at the school) were 
asked to read the social history of each subject, avail- 
able in the school’s files, and then independently to rate 
each subject on a social deprivation scale. Each rating 
was made on a 60-cm. continuous line evenly subdivided 
for reference into six areas, “very protected,” “pro- 
tected,” “slightly protected,” “slightly deprived,” 
“deprived,” and “very deprived.” The social depriva- 
tion score for each subject was the average of the ratings 
of the two judges. The possible range of scores was from 
1 (extreme of “very protected”) to 60 (extreme of “very 
deprived”). 

‘The judges were not instructed as to what consituted 
“deprivation” or “protection,” beyond being told that 
it was concerned with the amount and quality of the 
interaction that the child had with adult figures. The 
judges were asked to list for each subject the specific 
factors in the subject’s social history which influenced 
their rating for that subject. This last procedure was 
included in the hope that such a listing would uncover 
specific factors which eventually could be employed in 


TABLE 1 


Composition OF THE NS-S anp S-NS 
Groups oF SUBJECTS 


ean 
Number | Mean CA 
Group |w | cA. | MA Mean) Months lgan Ad 
(in yr.) Institu- i ya 
yr.) tionalized | Mission 
Nonsupport- 30 | 10.9 | 6.3 | 59.0 23.4 8.9 
Support 
Support-Non- | 30 10.7 | 6.1 | 57.3 22.9 8.8 
support 
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purifying and making more operational the social 
deprivation variable. 


The judges were cautioned against using any data : 


other than that contained in the pre-institutionaliza- 
tion history of the child in making their ratings. They 
had no knowledge of the subject’s performance on the 

imental task, and the experimenter had no 
knowledge of the subject’s social deprivation ratings 
until all subjects had completed the experiment. 


RESULTS 


The game used in the study proved to be of 
interest to the subjects. Although there was 
great variability in the length of time the 
subjects played the game, there were no 
noticeable differences among the subjects in 
their initial attitudes toward it. No subject 
mentioned that Part II of the game was simi- 
lar to Part I, and all responded to Part II as 
if it were a new game. 

Social Deprivation Variable 

The correlation between social deprivation 
ratings of the two judges was .77. The judges 
reported that it was difficult to rate certain 
subjects because of the meager amount of in- 
formation contained in their social histories. 
Both judges commented upon the high pro- 
portion of subjects who had experienced con- 
siderable amounts of social deprivation. The 
combined ratings of the judges placed 18 sub- 
jects in the “very deprived” range of the social 
deprivation scale, 19 subjects in the “de- 
prived” range, 10 subjects in the “slightly de- 
prived” range, 1 subject in the “protected” 
range and 0 in the “very protected” range. 
Judges were instructed also to list for each 
subject the specific factors in the subject’s 
social history which influenced their rating for 
that subject. The most frequent factors listed, 
and the frequency with which one or both 
judges noted them are presented in Table 2. 
The mean of the social deprivation scores of 
the NS-S group did not differ significantly 
from that of the S-NS group. 

Since the purpose of the study was to com- 
pare the performance of the more socially de- 
prived subjects with that of the less socially 
deprived subjects, subjects were divided at 
the median of the social deprivation scores 
(44.5). Two groups were formed in this man- 
ner, a high socially deprived group ( 
group) and a low socially deprived group 
group). The range of social deprivation scores 
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TABLE 2 
FREQUENCY WITH WHICH ONE OR BOTH JUDGES 
NOTED A SPECIFIC FACTOR AS INFLUENCING 
THEIR RATINGS 


Factors in Social History N 


Child comes from an orphanage or has lived 20 
in several homes 

Child’s parents are divorced or ted 16 

Child was abused (physical punishment, sex 14 
play) or neglected (inadequately fed or 
clothed) to the extent that legal action was 
taken to remove him from his home 

Child comes from original home in which he 13 
experienced considerable abuse or neglect, 
but no legal action was taken to remove 
child from the home 

Mother or father institutionalized (mental 12 
hospital, colony for the feebleminded, or 


jail) 

Child is illegitimate 

Mother verbalized her negative feelings to- 
ward the child 


wo 


was from 18 through 44 for the LD group and 
from 45 through 56 for the HD group. 


Analysis of Predictions 


The first prediction was that the more 
socially deprived subjects would spend a 
greater amount of time on the game than 
would the less socially deprived subjects. The 
difference in the average time spent on the 
game by the HD group (1,039.8 seconds) and 
LD group (655.5 second) was 384.2 seconds, 
which is a significant difference (¢ = 2.39, 
p < .01). (In all analyses of predictions one- 
tailed tests of significance were employed.) 

The findings relative to the second predic- 
tion, that the more socially deprived subjects 
would make a lower proportion of errors than 
the less socially deprived subjects, were not 
found to be significant. Error scores were com- 
puted by dividing the total number of errors 
each subject made by the total number of 
marbles he inserted. Because of the markedly 
skewed distribution of error scores, chi square 
was employed to test this prediction. The 
median of the error scores (.015) was used as 
the dividing point in setting up a 2 X 2 con- 
tingency table. In the LD group, 12 subjects 
had an error score at or below the median, 
while 19 subjects had an error score above the 
median. In the HD group, 18 subjects had an 
error score at or below the median, while 11 
subjects had an error score above the median. 
Although this difference was in the predicted 
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direction, it did not reach the .05 level of sig- 
nificance (X = 2.40; p < .10). 

The third prediction was that the more 
socially deprived subjects would more fre- 
quently make the maximum number of re- 
sponses on the game. In the LD group, 8 of 
the 31 subjects made the maximum number of 
responses the game allowed (inserting 400 
marbles) on either Part I or Part II of the 
game, while 16 of the 29 subjects in the HD 
group did so. This difference was significant 
(X? = 4.23; p < .025) and confirmed the 
third prediction. 

The fourth prediction, that the increase in 
time spent on Part II over that spent on Part 
I of the game would be greater for the more 
socially deprived subjects than for the less 
socially deprived subjects, also received con- 
firmation. The subjects who made the mazi- 
mum number of responses on both Part I and 
Part II (five in the LD group and five in the 
HD group) were not included in the test of 
this prediction since their performance pre- 
cluded any meaningful increases or decreases 
from Part I to Part II. The difference in the 
average time increase from Part I to Part II 
by the HD group (240.3 seconds) and the LD 
group (113.7 seconds) was 126.6 seconds, which 
was significant (¢ = 1.87; p < .05). The per- 
formance on the game by the HD and LD 
groups is shown in Figure 1, in which the aver- 
age amount of time spent on Part I and Part II 
is presented. 

In a further analysis of the fourth predic- 
tion, differences between groups in the number 
of subjects who did not complete Part I, but 
who did complete Part II, were noted. In the 


@ HIGH DEPRIVATION 


© LOW DEPRIVATION 


os | 


PART 2 


MEAN TIME IN SECONDS 


PART I 


Fic. 1, Mean time in seconds spent on Part 1 and 
Part 2 of the game by the HD and LD groups. 


418 


LD group, 23 subjects did not perform the 
maximum number of responses on either part, 
while 3 subjects made the maximum number 
of responses on Part II after making less than 
the maximum number of responses on Part I. 
In the HD group, 13 subjects did not perform 
the maximum number of responses on either 
part, while 11 subjects made the maximum 
number of responses on Part II after making 
less than the maximum number of responses 
on Part I. This difference in the number of 
subjects in the HD and LD groups who did 
not complete Part I but who did complete 
Part II is significant beyond the .01 level 

2 = 5.68). 

Thus three of the four predictions were con- 
firmed. The more socially deprived subjects 
were found to spend a greater amount of time 
on the game, to more frequently make the 
maximum number of responses allowed by 
the game, and to evidence a greater increase 
in time spent on Part II over that spent on 
Part I of the game. The prediction that the 
more socially deprived subjects would make a 
lower proportion of errors was not confirmed, 
though the findings were in the predicted di- 
rection. 


Other Findings 


As noted earlier, the study also was directed 
toward an examination of the relationship of 
the performance measures to the two condi- 
tions of reinforcement employed. Performance 
under the two conditions of reinforcement is 
shown in Figure 2, in which the average 
amount of time spent on Parts I and II of the 
game is presented. As can be seen in Figure 2, 


MEAN TIME IN SECONDS 


PART 2 


Fic. 2. Mean time in seconds spent on Part 1 and 
Part 2 of the game by the S-NS and NS-S groups. 
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subjects spent more time on Part IT than on 
Part I under both conditions of reinforcement, 
The mean difference between time spent on 
Part I and Part II is significant for the NS-S 
group (161.1 seconds) beyond the .01 level 
(t = 4.35), while the mean difference for the 
S-NS group (114.2 seconds) is significant be- 
yond the .05 level (t = 2.25). The difference 
(46.9 seconds) between these differences is not 
significant (¢ < 1.00), indicating that the 
magnitude of increase from Part I to Part II 
did not differ under the NS-S and the S-NS 
conditions. 

The difference in the average time spent on 
Part I between the NS-S (300.3 seconds) and ~ 
the S-NS (404.2 seconds) groups was not 
significant (¢ = 1.29; p > .20). The difference 
in the average time spent on Part II between 
the NS-S (461.0 seconds) and the S-NS 
(518.1 seconds) groups also was not signifi- 
cant (¢ < 1.00). The difference in the average 
total time (Part I and Part II) spent on the 
game between the NS-S (761.3 seconds) and 
the S-NS (922.3 seconds) groups was not 
significant (t < 1.00). Therefore, it is con- 
cluded that there was no evidence to indicate 
the groups tested with a sequence of NS-S 
and S-NS differed significantly on time spent 
on Part I, time spent on Part II, increase in 
time spent on Part II compared to Part I, nor 
on the total time spent on the game. 


Relation of Social Deprivation to Other Variables 


The characteristics of the HD and LD 
groups were examined further to ascertain 
whether the two groups differed on variables 
other than the performance measures em- 
ployed in the predictions. The composition of 
the LD and HD groups is presented in Table 
3. An analysis was made of the differences be- 
tween the two groups in regard to CA, MA, 
IQ, length of institutionalization, and age 
upon institutionalization. The mean IQ of the 


TABLE 3 
COMPOSITION OF THE HicH AND Low 
SOCIALLY GROUPS 


Mean ea 
Mean | Mean | Mean | Number | “(in yr. 
10 Monte Upon Ad- 


Group | N | CA D 
(in yr.) | (in yr.) Institu- | mission 
tionalizes 
Beal | ee 
High | 29 | 10.3| 6.3 | 61.0 | 22.9 | 84 | 
Low |31 |10.9| 6.1 | 56.0 | 23.4 9.2 
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subjects in the HD group was significantly 
higher than that of the LD group (¢ = 2.10; 
p < .05). The #’s associated with the differ- 
ences in the other factors were all less than 
1.00, with the exception of that associated 
with age upon institutionalization, which was 
1.55 (p > .10). In light of the higher mean IQ 
of the HD group, a correlation was run be- 
tween the subject’s total time score and his 
IQ. This correlation was not significant (r = 
—.09; p > .20), indicating that while a rela- 
tionship exists between the time scores and 
the measure of social deprivation, there is no 
evidence of a relationship between time scores 
and IQ. 


Discussion 


The hypothesis advanced in this study was 
that interaction with an adult and adult ap- 
proval provide greater reinforcement for the 
responses of those feebleminded subjects who 
have experienced a greater amount of social 
deprivation than for those who have experi- 
enced a lesser amount of social deprivation. 
The results lend support to the hypothesis in 
that three of the four predictions derived from 
the hypothesis were confirmed. It may be con- 
cluded that these findings offer further sup- 
port for the view advanced in earlier studies 
(Stevenson & Zigler, 1957; Zigler et al., 1958) 
that the rigid behavior observed in feeble- 
minded subjects is probably a product of 
higher motivation to maintain interaction with 
an adult and to secure approval from him 
through compliance and persistence. The re- 
sults of the present study also offer evidence 
that the feebleminded subjects’ higher motiva- 
tion to interact with an adult is related to the 
greater social deprivation such subjects have 
experienced. Furthermore, individual differ- 
ences among the feebleminded in the frequency 
with which they exhibit persistive and/or com- 
pliant behavior also can be related to differ- 
ences in the amount of social deprivation 
experienced. Since the persistence and com- 
pliance exhibited by feebleminded subjects 
have been found to be related to social depri- 
vation, the prediction can be made that such 
persistence and compliance would also be 
shown by subjects of normal intelligence who 
have also experienced social deprivation. 

The experimental method may have ob- 
scured differences between the high and low 
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socially deprived groups in their tendency to 
make errors. The subjects were not informed 
of their errors and received the reinforcement 
inherent in the completion of even incorrect 
responses and continued to receive the experi- 
menter’s attention or support following errors. 
Another factor that may have obscured differ- 
ences between the two groups in their tend- 
ency to make errors was the significantly 
longer performance of the HD subjects, mak- 
ing the performance of these subjects more 
susceptible to fatigue and transfer effects. 


Discussion of Other Findings 


Another finding of interest is the similarity 
in the performance of subjects under both of 
the conditions of reinforcement employed. 
Both the subjects who played Part I of the 
game under the NS condition and Part II 
under the S condition and the subjects who 
played Part I under the S condition and 
Part II under the NS condition showed a 
significant increase in time spent on Part II 
over that spent on Part I of the game. This 
gives strength to the assumption that there 
would be an increase in time spent on Part II 
over Part I under the NS-S condition due to 
the greater reinforcing effect of support. It 
also strengthens the assumption that there 
would be an increase in time spent on Part IT 
over Part I under the S-NS condition due to 
the belief of the subjects that continued per- 
formance on Part II may result in a reinstate- 
ment of support. However, the two conditions 
of reinforcement produced no significant differ- 
ences in the increase of time spent on Part II 
of the game over that spent on Part I, on the 
time spent on Part I alone, on the time spent 
on Part II alone, or on the total time spent on 
the game. 

The assumptions underlying the expected 
increase from Part I to Part II of the game 
in the present study do not appear capable of 
explaining the finding by Zigler et al. (1958) 
that feebleminded subjects show an increase 
in Part II over Part I of the game when both 
parts are played under identical conditions of 
support. It appears that another process re- 
lated to the increase from Part I to Part II 
of the game may be operating. The assumption 
can be made that institutionalized feeble- 
minded subjects begin the game with a greater 
amount of ambivalence toward it and toward 
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the experimenter than do normal subjects. 
Sarason and Gladwin (1958) have discussed 
the possibility that one of the major effects of 
institutionalization is an “avoidance and fear 
of new problem-solving situations” (p. 243). 
This suggests that institutionalized feeble- 
minded subjects would be characterized by 
certain negative feelings toward the experi- 
mental situation. 

Assuming that institutionalized feeble- 
minded subjects have such negative feelings, 
one may advance the hypothesis that such 
subjects begin playing Part I of the game with 
a relatively high positive tendency to re- 
spond due to a greater motivation to interact 
with and to gain the approval of an adult 
which, in turn, stems from the greater social 
deprivation they have experienced. At the 
same time, they begin playing Part I of the 
game with a negative tendency to respond due 
to a wariness or suspiciousness of adults which 
stems from the negative experiences, perhaps 
of a painful nature, which they have had at 
the hands of adults. If Part I is given under 
an S condition, it is likely that the negative 
tendencies would decrease. The deprived child 
learns during Part I that the experimenter is 
not like other strange adults he encounters in 
the institution who often initiate painful 
experiences (physical examinations, shots, etc.) 
with supportive comments. Then when the 
deprived child is switched to Part II of the 
game, whether Part II is given under S or 
NS, he meets it with a positive tendency which 
has been reduced much less than his negative 
tendency. The result would be that he per- 
forms longer on Part II than on Part I. 

This hypothesis may be employed to ex- 
plain the findings that normal children do not 
evidence an increase from Part I to Part II 
of the game (Zigler et al., 1958), since the as- 
sumption can be made that the negative tend- 
ency of normal children is much less than that 
of institutionalized feebleminded subjects. The 
finding that feebleminded subjects do not 
show an increase from Part I to Part II of the 
game when both parts of the game are given 
under an NS condition (Zigler et al., 1958) may 
be due to the inadequacy of the NS condition 
in decreasing the negative tendency. Studies 
investigating this hypothesized negative reac- 
tion tendency are now in progress. 

It may be concluded that this study offers 
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evidence that differences among subjects” 
the frequency with which they manifest 

tain behaviors which have often been ch 
terized as rigid (perseveration, stereo 
etc.) are related to differences in the a 
of social deprivation the subjects have ex 
enced, It supports the position advancı 
earlier studies (Stevenson & Zigler, 
Zigler et al., 1958) that constructs 
explain the performance of normal indi 
are adequate in explaining the performa 
feebleminded ones. This study also offers 
ther evidence of the danger in describi 
group as though its members uniformly 
sess a particular characteristic, e.g., rig 
without considering such factors as histo 
differences among subjects, variability in m 
tivation, the nature of the reinforcement pr 
vided, or the tasks employed. 


SUMMARY 


Sixty feebleminded children with MAs 
4 to 8 years played a simple, monoto 
game until satiated. The game consisted 
two highly similar parts. Following satiati 
on Part I of the game, the subjects were 
sented Part II and were again allowed to pla 
until satiated. Two conditions of reinf 
ment were introduced. Half of the subjects r 
ceived both verbal and nonverbal support 
the experimenter during Part II. The ot 
half of the subjects received only the expe 
menter’s attention during Part I and 
experimenter’s verbal and nonverbal suj 
during Part II. Two judges read the soc 
histories of the subjects and rated each subje 
on the degree of pre-institutional social dep 
rivation. 

The hypothesis was advanced that int 
tion with an adult and adult approval pi 
greater reinforcement for the responses of fe 
bleminded subjects who have experien 
greater amount of social deprivation than 
those who have experienced a lesser ami 
of social deprivation. Three of the four pred 
tions derived from this hypothesis were 
firmed. It was found that the more so 
deprived subjects spent a greater amounl 
time on the game, more frequently made! 
maximum number of responses allowed | 
the game, and, evidenced a greater increa 
time spent on Part II of the game over 
spent on Part I. The findings related to” 
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fourth prediction, that the more socially de- 
prived subjects would make a lower propor- 
tion of errors, were in the direction predicted, 
but failed to reach the .05 level of significance. 

These results offer further support for the 
view that the “rigid” behavior observed in 
feebleminded subjects may be attributed to a 
higher motivation to maintain interaction 
with an adult and to secure approval from 
him through compliance and persistence. 

The assumption that there would be an in- 
crease in time spent on Part II of the game 
over Part I under both conditions of reinforce- 
ment received support. However, the two 
conditions of reinforcement produced no sig- 
nificant differences in the increase of time 
spent on Part II over that spent on Part I, 
the time spent on Part I alone, the time 
spent on Part II alone, nor the total time 
spent on the game. The further assumption 
that institutionalized feebleminded subjects 
come to the experimental situation with both 
a positive and a negative tendency to respond 
was advanced to explain the increase in time 
spent on Part II compared to Part I. 
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CASE REPORT 


MOTIVATIONAL STRUCTURE OF AN HOSPITALIZED MENTAL DEFECTIVE 
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REUD, McDougall, Murray, and others, in 

attempting to isolate basic dimensions of 
motivation, did not have the benefit of the factor 
analytic methods which have enabled Cattell and 
his co-workers to substantiate the concepts of 
drive, ego structure, and sentiment. On the as- 
sumption that factor analysis of many scores ob- 
tained on objective attitude measuring devices 
would reveal a limited number of motivational 
components which could account for the multitude 
of phenomena observed in dynamic behavior, 
Cattell pioneered three researches: an R-technique 
study to isolate and identify the basic motivational 
determinants (Cattell, 1950), another R-technique 
study to confirm the findings of the first and to 
develop alternative objective measuring methods 
(Cattell & Miller, 1952), and a P-technique study 
to confirm further and to clarify the ideas elicited 
by the other two studies as well as to investigate 
the applicability of these ideas to an individual 
(Cattell & Cross, 1952). The present case study 
continues this line of research. 

The objective here was to discover the motiva- 
tional dimensions of an institutionalized mental 
defective by objective P-technique factor analysis 
and to ascertain whether the findings were con- 
gruent with clinical understanding of the case and 
ven earlier factor analytic results on nondefec- 

ives, 


METHOD 


The subject, a female patient at Pacific State 
Hospital in California, was a heavy-set 15-year-old 


subject’s father molested her sexually, her 

killed him with the subject a witness to the killing. The 
subject was described by the hospital staff as quiet, 
somewhat withdrawn, and undemonstrative, but usually 


1 Presently at System Development Corporation, 
Paramus, New Jersey. 


attitudes, were designed to measure attitudes in an 
objective manner by means of ratings, questions about 
activities, word association, and paired comparisons, 
Each previously known dynamic factor in the literature 
was represented by at least two marker variables, ise., 
variables which were strongly loaded in the previously 
known factor were carried over in some form and used 
in this investigation. 

To distinguish each type of evaluative variable from — 
all other variables in the factor matrix, a letter prefix 
was assigned as follows: RT, ratings by school teacher; 
RW, ratings by ward technicians; RS, self-ratings; 
QA, questions regarding activities; WA, word associa- 
tion; and PC, paired comparisons. 

The subject was rated by herself and two others on 
14 traits. The best possible score was 10; the worst 
possible was 0, Of the 42 trait ratings, only 9 proved 
variant enough to be retained for analysis. 

Most of the original 22 questions regarding the sub- 
ject’s daily activities required only a “yes” or “no” 
answer. A few, such as the question, “How many 
cigarettes have you smoked?” required a quantitative 
answer. Of the original group of 22, 9 questions were 
retained for analysis, the remainder being eliminated 
because they displayed little or no variance. 

‘Thirty-four word association items were presented 
to the subject in the format: 


The second and third words of the series were assumed 
to represent one of several hypothesized drives and 
sentiments. From these two words, the subject was to 
choose the one which she first associated with the first 
word of the series, It was assumed that the choice of onë 
word over another would be determined by her at- , 
titudes. The subject’s choices were then pooled to 
obtain a score for each of the hypothesized drives and 
sentiments. Since the sequence of items was j 
from day to day, there was no chance for the subject to 
mechanize her responses. 

Paired comparisons consisted of having the subject — 
indicate which of a pair of statements, each printed on 
a separate slip of paper, most strongly represented her 
present desires. Sixteen statements were constructed 
so as to define each attitude by inclusion of its partsi 
the stimulus situation, an organism, a need, a course of j 
action, and an object concerned in that course of action: 
One of the statements was, “I want a good looking 
boy to make love to me.” Since each statement was paired ñ 
against every other statement, the subject made 120 
choices each day for 100 days. Again the sequence 
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acceptable fashion in one’s activities. It is thought 
of as a sentiment rather than the drive of self- 
assertion (Cattell, 1957) because its pattern more 
closely fits that of the previously defined senti- 
ment (Cattell, 1950; Cattell & Cross, 1952; 
Cattell & Miller, 1952). This sentiment is some- 
what analogous to what Horney (1950) means in 
speaking of the “tyranny of the should.” 


Factor [V—Gregarious Erg 


Loading 
WA 42 Self-sentiment (word association) 52 
WA 36 Gregarious drive (word association) 49 
WA 41 Narcissistic-play drive (word associa- 49 
tion) 
PC 12 I want to take an active part in recre- .37 
ation 
PC 13 I want to watch games like baseball -36 
PC 11 I want to go to a good movie every pa 
week 


“Man is a social animal...” is here again 
restated in factor analytic terms. A drive to be 
part of the crowd is manifested by the desire for 
participation in essentially social activities such as 
baseball and movies. The gregarious erg is not as 
clear here as in previous research. It seems here 
to be mixed in with goodly helpings of self-regard 
and self-love or desire for self-enhancement. 
Variables WA 36, PC 12, and PC 13 are markers 
from previous research, This factor demonstrates 
superb simple structure and has no significantly 
high correlations with any other factors. Perhaps 
further research will be able to rotate the con- 
fusing Variables WA 42 and WA 41 out of the 
picture. 


Factor V—Appeal-Dependence Erg 


Loading 
WA 39 Lack of self-assertion (word associa- -62 
tion) 
WA 38 Lack of escape-fear (word associa-  .54 
tion) 
RS9 Poor state of health 39 
PC 14 I want to feel close to God +35 
PC 15 I want to have the advice of my par- 34 
ents 
WA 37 Appeal-dependence drive (word asso- 32 
ciation) 


The strong appearance here of Variables PC 14, 
PC 15, and WA 37 leads to the conclusion that 
this factor represents the dependency drive. It is 
the “Mama’s boy,” the homesick child at camp, 
and the confused neurotic’s plea for help. A lack 
of self-assertion and escape-fear is entirely con- 
sistent with this interpretation, as is a poor state 
of health. Picture the person who has been “down 
on his luck” for a relatively long period of time. 


Case REPORT 


Factor VI—Menstruation-Depression 
Loading 
PC 10 I don’ want a good-looking boy to .43 
make love to me 


PC 23 I want time to enjoy my own com- AL 
pany 

RS9 Poor state of health .29 

QA 34 Are you menstruating? (Yes) .25 


A more solid, impressive display of the efficacy 
of factor analysis could hardly be found outside 
the textbooks of Thurstone and Burt. It naturally 
follows that a girl who is menstruating considers 
herself in a poor state of health, wants to be alone, 
and especially wants no amorous entanglements 
at this delicate time. This type of factor has not 
appeared before in factor analytic research. 


Factor VII—Self-Assertion 


Loading 

PC 18 I want to earn more money 52 

PC 15 Idon’t want to have the advice of my 48 
parents 

QA 27 Did you get a letter from home yes- 40 
terday? 

WA 40 Parental-protective drive (word asso- 38 
ciation) 

QA 28 Did you write a letter home yester- 28 
day? 

PC 23 I don’t want time to enjoy my own .27 
company 

RT2 Reduced quantity of work .26 

RS7 Loneliness .25 

PC 19 I want to succeed in my life’s work <25 


PC 22 I don’t want to spend more time on =. 24 
smoking or drinking 


RS 6 Happiness 24 
WA 39 Self-assertion drive (word associa- 07 
tion) 


This factor has the poorest simple structure in 
the rotated reference vector matrix. The variance 
of the factor is spread thin over many seemingly 
unrelated variables. The presence of Variables 
PC 18, PC 15, PC 19, and PC 22 lead strongly to 
the interpretation that this represents a motiva- 
tional component of self-assertion even though the 
factor is generally muddy, Perhaps further re- 
search will provide the answer to the unsatis- 
factory pattern found here. 

Factor VIII—Sub-Factor: “Good Time Yesterday” 

Loading 

QA 29 Did you spend any money yesterday?  - 54 

QA 31 Did you read any books on the ward -49 
yesterday? (Yes) 

QA 32 Did you eat between meals yester- -46 
day? (Yes) 

QA 33 Did you play any games on the ward 46 
yesterday? (Yes) 

PC 22 I want to spend more time on smok- -26 
ing and drinking 

PC 15 I don’t want to have the advice of my 22 
parents 


Case REPORT 


This factor correlates highly with Factor IX, 
Narcissism (.51), and has no real substance in it- 
self. Therefore, it is interpreted as & subfactor of 
Factor IX. This interpretation means that in an- 
other experiment, with possibly a larger number of 
measurements being taken, this “factor” would 
resolve itself into Factor IX. 


Factor IX—Narcissism Erg 
Loading 
QA 31 Did you read any books on the ward -68 
yesterday? (Yes) 
QA 33 Did you play any games on the ward .58 
yesterday? (Yes) 
PC 22 I want to spend more time on smok- .38 
ing or drinking 
QA 27 Did you get a letter from home yes- .32 
terday? (No) 
RS8 Low amount of worry .32 
PC 23 I want time to enjoy my own com- -28 


pany 

QA 26 Did you dream about your boyfriend .28 
last night? (Yes) 

WA 41 Narcissistic-play drive (word asso- 14 
ciation) 


As previously mentioned, most of the varia- 
bles of Subfactor VII are included here as a sub- 
set. This component is interpreted as an erg of 
narcissism, a self-love or desire to make oneself feel 
better, etc. In Freudian terms, it represents one of 
the paths along which the self-worshipping libidinal 
energy may flow. It compels its possessor to have 
Interest investments in sex, drinking, and other 
“fun” activities. In the realm of clinical diagnosis, 
the clinician would suspect conflict if this factor 
and Factor III, the self-regard sentiment, were 
simultaneously running at relatively high levels in 
the same person. 


Factor X 
Loading 

RS7 Low amount of loneliness -45 
PC 22 I want to spend more time on smok- -36 

ing or drinking 
RS 6 Unhappiness 34 
RS8 I don’t want to succeed in my life’s 32 

work 
PC 14 Low amount of worry -31 
PC 19 I don’t want to feel close to God -30 
PC 20 I want to help people in trouble -30 


There is seemingly no plausible interpretation 
which can be made of Factor X. It is reproduced 
here as much as an indication of the writers’ be- 
wilderment as for the reader’s amusement. An 
awkward, tentative hypothesis is that this is a 
Component somehow connected with philanthropy, 
a sentiment possessed by one with much money 
(no worry) and many friends (no loneliness) but 
with an urge to help others. Variables RS 8 and 
PC 19 may indicate guilt feelings. 
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Factor XI—Repressed Sex Component 
Loading 
WA 35 Sex-mating (word association) 39 


WA 40 Parental-protective drive (word as- .35 
sociation) (negative) 
PC 23 I want time to enjoy my own com- 34 


pany 

QA 26 Did you dream about your boyfriend cot 
last night? (Yes) 

QA 27 Did you get a letter from home yes- .31 
terday? (No) 


This is the last factor to be retained from the 
factor matrix. As such, the variance which it 
accounts for is necessarily lower than that ac- 
counted for by the other 10 factors, It is here 
interpreted as a component of repressed sex energy. 
The clash of feelings for one’s parents and feelings 
for oneself comes into sharp focus in this pattern. 
The rise of this factor in intensity from day to day 
could be a signal of dammed-up sex energy look- 
ing for release, The sense of this factor, even though 
it correlates .37 with the Sex Erg, is not one of a 
constant flow of sexual vitality. Rather, it sug- 
gests the accumulation of sex vitality which has 
not yet gained expression. 


SUMMARY 


Objective tests whose motif was structured after 
that of devices employed by previous researchers 
were administered to a maladjusted mental 
defective who was a patient in a mental institution. 
The scores on the tests were correlated and factor 
analyzed, On interpretations of the factors after 
appropriate rotation to oblique positions, the 
factors so discovered appeared to be essentially 
congruent with previous research. Further, the 
factor pattern that emerged was capable of being 
interpreted as the motivational structure of the 
subject. 

The possibility that not all ergs (drives) and 
metanergs (sentiments) were found does not 
detract from the basic fact that with suitable 
instruments one can measure and reveal the 
fundamental dynamic structures of a malad- 
justed mental defective. The implications of this 
for clinical practice are promising. Further con- 
firmatory research is needed to furnish a valid, 
reliable instrument which can be successfully and 
economically used as both a diagnostic and a 
predictive tool. The hope that such a tool can be 
produced depends on the degree to which clini- 
cians and psychometrists can work together. 
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CRITIQUE AND NOTES 


GSR CONDITIONIN G, ANXIETY, AND EXTRAVERSION 


WESLEY C. BECKER anp HALSEY H. MATTESON 
University of Illinois 


‘sING basic notions derived at least in part 

from different aspects of Hullian theory, 
Eysenck and Spence have developed two different 
but not necessarily mutually exclusive hypotheses 
concerning the relationship of personality char- 
acteristics to conditioning. Eysenck (1955) has 
proposed that individuals in whom reactive in- 
hibition develops rapidly and dissipates slowly 
tend to develop extraverted behavior patterns. 
Since reactive inhibition interferes with the repe- 
tition of a conditioned response, Eysenck has pre- 
dicted that extraverts will condition less readily 
that introverts. On the other hand, Spence (1958) 
has hypothesized that the higher emotional re- 
activity of the anxious person acts like other 
drives in increasing reaction potential during 
conditioning; therefore, a positive relationship can 
be predicted between the degree of manifest 
anxiety and level of conditioning. 

Studies from Eysenck’s laboratory have sup- 
ported Eysenck, whereas studies from Spence’s 
laboratory have supported Spence. Franks (1956) 
found that dysthymics (introverted neurotics) con- 
ditioned at a higher level than hysterics (extra- 
verted neurotics), using the GSR and eyeblink 
conditioning simultaneously. In a second study, 
using normal subjects, Franks (1957) found a 
significant relationship between the Extraversion 
(E) scale of the Maudsley Personality Inventory 
and the level of eyeblink conditioning. No signifi- 
cant relationship was found between the Neuroti- 
cism (N) scale of the Maudsley Personality Inven- 
tory and conditioning. Since N is highly correlated 
(approximately .90) with the Taylor Manifest 
Anxiety scale (MA) used by Spence, Franks 
argued that his data gave no support to Spence’s 
theory. 

At the Iowa laboratory, Farber, Spence, and 
Bechtoldt (1957) found eyeblink conditioning 
Positively related to the MA but not to Guilford’s 
Rhathymia (R) scale. R is highly correlated with 
E and has also been used by Eysenck as a measure 
of extraversion. Several additional studies showing 
Significant relationships between eyeblink condi- 
tioning and manifest anxiety were summarized by 
Spence (1958). 

The impetus for the present study came from a 
recent failure by Becker (1959) to find support for 
either Eysenck’s or Spence’s hypothesis when 
GSR conditioning was used on unselected male and 


female college students. In this study, Becker 
found excellent evidence that the measures were 
reliable and reflected a conditioning process, but 
no relationships were found to questionnaire 
measures of introversion or anxiety. While both 
Eysenck? and Spence (1958) have been unhappy 
with attempts to test their hypotheses using GSR 
conditioning, each of them can point to one study 
with the GSR which suggests support for their 
theories. Franks’ (1956) simultaneous use of GSR 
and eyeblink conditioning has already been 
mentioned as supporting Eysenck’s hypothesis. 
In his recent review, Spence (1958) reported that 
subjects selected on the basis of greater GSR and 
heart-rate changes to noxious stimuli showed more 
eyeblink conditioning. These results suggested a 
re-examination of the relationship of anxiety and 
extraversion to GSR conditioning, using extreme 
anxiety and extraversion groups (to compensate 
for unreliability) and all male subjects (to rule 
out complications from suspected sex differences), 


METHOD 


Guilford’s R scale and Cattell’s Anxiety (A) scale 
were given to 273 male college students (nonvolunteer) 
enrolled in introductory psychology. Eysenck previ- 
ously used R as a criterion of extraversion, and A is 
known to be highly correlated? with other measures of 
anxiety. Four groups of 10 subjects with extreme 
scores on both A and R were selected for conditioning. 
The cut-off scores for these four extreme groups were 
determined from a scatter plot so that the 10 subjects 
in each group would deviate from the total means for 
A and R by approximately the same amount. 

A Hunter Model 300 GSR amplifier with visual 
reading was used to measure basal resistance and re- 
sistance changes as a result of stimulation. A wire 
wound potentiometer was used to control shock level 
and a Pentron tape recorder for presentation of stimuli. 

After acquainting the subject with the operation of 
the apparatus, finger electrodes were attached to the 
index and third fingers of the left hand, and shock 
electrodes were attached to the tender side of the right 
arm just above the elbow. All contact areas were cleaned 
with alcohol and rubbed with electrode jelly. Care was 
taken to avoid circulation blockage in attaching 


1 Personal communication. 

2 With 80 college males, Becker (1959) has found 
correlations of .79 with Eysenck’s N, and .79 and .83 
with Guilford’s C and D, respectively, All three of 
these measures correlate about .85 with the Taylor 
MA (Farber et al., 1957). 
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electrodes. The GSR was calibrated and the maximum 
shock level the subject would tolerate was determined. 
Then, to allow time for adaptation to the situation, 
the experimenter played a card identification game 
with the subject. All the above took 10 minutes. After 
being given a catalogue of courses to look over, the 
subject was instructed as follows: “I am going to play 
a tape with random words on it. You do not need to 
pay attention to the words, so you can just read along.” 
A list of 104 words spaced by 5-second intervals was 
then played twice. The word “repeat” occurred 20 
times on this list and was followed immediately by a 
1-second shock on 12 of the trials. Eight of the repeat 
trials were considered conditioning test trials. The 
word crafty preceded repeat 12 times, thus permitting 
a measure of temporal generalization, and the word 
receipt occurred 7 times randomly in the list, thus 
permitting a measure of generalization on the basis of 
sound similarity. Finally the word constant appeared 11 
times in the list, but never in a space just preceding 
the shocked word. This served asa control on the effect 
of frequency of word presentation. 

Shock was administered by a hand switch. When 
the GSR to the shock given by any subject decreased to 
a noticeable degree over trials, the shock level was 
increased. This was a crude attempt to maintain equiva- 
lence of the UCS for all subjects and to compensate for 
such factors as adaptation to shock, loosening of the 
shock electrodes, and the like. Basal resistance was 
adjusted before each word when necessary to center 
the meter. Recordings of basal were made at the 
beginning, in the middle, and at the end of both series 
of trials. The maximum decrease in resistance to each 
stimulus word was recorded. 

The problem of determining appropriate measures 
of conditioning is a critical one. Two scores were used. 
The first is simply the mean conditioned response 
amplitude. The second is a criterion-type score based on 
the number of trials on which the conditioned response 
was one-half or greater than the preceding uncondi- 
tioned response. Conditioned response amplitude is 
clearly a measure of reaction potential (sx) and thus 
is appropriate in testing Spence’s theory. Hull (1943) 
hypothesized that GSR amplitude is a direct linear func- 
tion of sEr. The criterion measure, on the other hand, 
eliminates sources of variation in the conditioning score 
which could not be logically a function of reactive 
inhibition. The greatest such source might be called 
general autonomic reactivity, which should be related 
to Hull’s D but not Zz. In a previous study Becker 
(1959) found thet CR amplitude correlated .67 and 78 
with UR amplitude in two different testing sessions. 
These correlations and similar findings by others of a 
general autonomic responsiveness factor indicate a need 
to control for such variance in the conditioning measure 
when testing Eysenck’s hypothesis. Becker (1959) 
found essentially zero correlations between the criterion 
conditioning score and UR amplitude, indicating that 
this source of variation is successfully controlled by the 
criterion score. The criterion measure also controls to 
some degree for adaptation effects, basal resistance 
artifact, and differential subject shock levels to the 
degree that these factors affect the CR and UR equally. 
Logically, the criterion conditioning measure makes 


sense in that conditioning is considered stronger the 
closer CR approximates UR. 

Responses to ‘generalization stimuli provide still 
another kind of measure of conditioning. However, there 
are many theoretical ambiguities concerning the ef- 
fects of both reactive inhibition and drive level on 
generalization so that clear-cut predictions could not be 
made. The generalization data will be examined solely 
for their empirical interest. 


RESULTS 


In the larger sample of 273 subjects, A and R 
were found to correlate —.13. For the 40 subjects 
in the conditioning sample, the correlation between 
A and R was —.06. Split-half reliabilities for A 
and R, respectively, were found to be .76 and .83 
on 273 students. Thus it is unlikely that differences 
in magnitude of relationship of the conditioning 
measures to A and R can be attributed to unre- 
liability in the personality measures. 

The odd-even reliability of the criterion condi- 
tioning score was .68 and that for the CR ampli- 
tude score .93, Using similar procedures, Becker 
(1959) found for the criterion conditioning score a 
split-half reliability of .80 and a between-sessions 
reliability of .71. For the CR amplitude score, 
Becker (1959) found a split-half reliability of .84 
and a between-sessions reliability of .63. 

A critical problem facing the experimenter using 
the GSR in a conditioning study is to demonstrate 
that conditioning has actually taken place. Since 
GSR measures tend to show adaptation effects over 
time, the usual learning curve cannot be used as 
evidence. However, it is possible to compare the 
subjects’ responses to the conditioned stimulus and 
the generalization stimuli with their responses to 
neutral stimuli. Table 1 presents a comparison for 
both the amplitude scores and a criterion-type 
score based on a ratio of magnitudes rather than 
frequency. This latter measure correlates over .90 
with the criterion measure used in the main 
analysis, but it illustrates more clearly the condi- 
tioning effect. From Table 1 it is apparent that 
significant learning effects are reflected in the 
conditioning and generalization measures. 

The summary of mean amplitude scores by 
personality groups is presented in Table 2. Since 
the mean basal resistance measure correlated 35 
with the conditioned response amplitude measure 
and there were group differences in basal resistance; 
it was considered necessary to correct for the 
differences in basal resistance by the use of covar= 
ance analysis. As would be expected, the high 
anxiety (HA) subjects tended to have lower basal 
resistances. This lower basal level in the HA groups 
would tend to obscure the predicted positive 
relationship between A and GSR amplitude. Table 
3 presents the group means after covariance ad- 
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TABLE 1 
EVIDENCE FOR CONDITIONING 
2 
Ratio of 
Indicated 
Amplitude | Stimulus to 
Scores Uncondi- 
Stimulus tioned 
Stimulus 
Mu o M o 
Unconditioned Stimulus 14.8 | 7.3 
Conditioned Stimulus 6.7*| 4.3 | .45*| .19 
Generalization Stimuli 3.8*| 2.3 | .26*| 15 
Control Word “Constant” 2.4 | 1.8] .16 | .16 
All Other Words 2.0 | 1.5 | .14 | .09 


* Significantly different from control words beyond the .001 
level (#’s ranged from 7.5 to 11.3). 


TABLE 2 
MEAN AMPLITUDE Scores UNADJUSTED 
Low R High R Row M 
HA 8.1 8.1 8.10 
LA 5.3 5.1 5.35 
Column M 6.7 6.7 
TABLE 3 


Mean CR AMPLITUDE SCORES, COVARIANCE 
ADJUSTED FOR BASAL RESISTANCE 


DIFFERENCES 
Low R High R Row M 
HA 8.8 8.3 8.55 
LA 5:3 4.4 4.85 


Column M 7.05 


os 
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TABLE 4 
MEAN CRITERION CONDITIONING SCORES 


Low R High R Row M 
HA 8.7 8.8 8.8 
LA 7.0 6.3 6.7 
Column 7.9 7.6 


justment for basal differences. The analysis of 
Covariance gave an F ratio of 8.44, significant at 
the .01 level, for the difference in GSR amplitude 
between the HA and LA groups. No other effect 
approached significance. These findings support 
Spence’s theory. 

Group means for the analysis of the criterion 
Conditioning scores are presented in Table 4. 
The only effect to approach statistical significance 
was the difference between HA and LA groups 


(F = 2.82, p < .10). The findings give no support 
for Eysenck’s theory. The generalization scores 
showed no significant differences between groups. 


DISCUSSION 


The results fail to support Eysenck’s theory 
while giving support to Spence’s notions. Before 
turning to the implications of these results, brief 
mention should be made of a problem of control 
typical of most shock studies. Lacking a reliable 
measure of shock current through the subject, an 
attempt was made to equalize the uncontrolled 
factors of skin resistance, adequacy of contact, 
and the like, by requiring each subject to receive 
the maximum shock level he could tolerate. While 
systematic study demands that better methods of 
control of shock level be developed, it is possible 
to assess the possible effects of lack of such control 
on the present findings by examining the mean 
shock levels by groups and the correlations between 
shock level and conditioning scores. Examination 
of group means reveals no significant differences 
between high and low groups. Nevertheless, the 
mean shock level for the high R groups is still 
considerably higher than that for the low R 
groups (246 volts versus 218 with o of 47 and 66, 
respectively). If shock level were positively cor- 
related with the criterion conditioning score used 
to test Eysenck’s hypothesis, then this difference 
in shock level could have acted to suppress a real 
relationship between GSR conditioning and R. 
However, the correlation between shock level and 
the criterion conditioning score was found to be 
—.11. The correlation between shock level and the 
amplitude conditioning score was —.29. The nega- 
tive relationships, though insignificant, may be 
related to the fact that shock level was increased if 
the GSR to shock decreased appreciably. The 
above findings support the conclusion that the 
method of determining shock level did not produce 
systematic biasing effects which would necessitate 
modification of the main findings. 

In considering the implications of this study for 
Eysenck’s inhibition theory, the reader should 
recognize the possibility that the GSR response 
mechanism functions according to completely 
different principles than a skeletal motor response, 
The eyeblink response serves as a protective avoid- 
ance response, while the GSR to shock is nonin- 
strumental and probably adaptively superfluous. 
The present findings may be interpreted as only 
raising a question about the applicability of Ey- 
senck’s theory to nonskeletal responses. However, 
this report is an addition to that growing list of 
failure by American psychologists to confirm 
Eysenck’s reactive inhibition theory of extraver- 
sion (Becker, 1959; Farber et al., 1957; Miller, 
1957; Ray, 1959; Rechtschaffen, 1958). 
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SUMMARY 


Hypotheses by Eysenck and Spence relating 
GSR conditioning to extraversion and anxiety, 
respectively, were examined. Four groups of 10 
subjects scoring on the extremes of anxiety and 
extraversion measures were selected from a group 
of 273 male introductory psychology students for 
conditioning. Using a conditioned response ampli- 
tude measure, which appeared to be the more 
appropriate measure for testing Spence’s hypothe- 
sis, a significant positive relationship was found 
between anxiety and conditioning. No significant 
relationship was found between extraversion and 
conditioning when either the conditioned response 
amplitude measure was used or (more appro- 
priately) when a criterion-type conditioning meas- 
ure was used. The results give clear support to 
Spence’s theory while failing to support that of 
Eysenck, 
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Ree research on the relationship between 
neurotic anxiety symptoms and persuasi- 
bility has yielded contradictory results. Some 
studies (Cervin, 1956, 1957b; Janis, 1954) indicate 
that low persuasibility is associated with high 
anxiety, others (Janis & Hovland, 1959) fail to 
confirm this finding. 

Other research (Hovland, Janis, & Kelley, 
1953, 1957) indicates that public commitment to 
an opinion either tends to freeze the subjects in 
their opinions and make them less susceptible to 
persuasive communication, or at least makes the 
subjects less susceptible to persuasive communica- 
tion than private commitment (Bennett, 1955). 
It is possible that commitment has something to 
do with instability of the relation between per- 
sonality factors and persuasibility. 

The present study is concerned with the rela- 
tionship of emotional responsiveness (Cervin, 
1957a) to persuasibility under conditions of public 
commitment of subjects to their opinions before 
the experiment, It is identical to the previous 
study (Cervin, 1957b) except that in the latter 
subjects did not commit themselves publicly to 
their opinions, whereas in the present one they 
did. Any difference in the results should therefore 
be attributable to the changed condition of 
commitment. 


PROCEDURE AND RESULTS 


Twenty-eight pairs of subjects were selected 
from among the students of the General Arts 
Course at the University of Toronto. Within each 
experimental pair subjects were equated on a 
number of variables, but differed on scores of 
emotional responsiveness (Cervin, 1957a). Twenty- 
two control pairs were equated also on emotional 
responsiveness. As in the previous study, all 
subjects filled out an opinionnaire on a number of 
topics (such as, “Undergraduates should not 
marry,” “Sexual deviates should be institution- 
alized for life,” etc.) of some interest to students. 
Religious and political topics were eliminated as 
too emotionally involving. In the experimental 
situation a topic was selected for each pair on 
Which subjects within the pair disagreed. Their 


opinions were announced by the experimenter 
and confirmed by the subjects. Each pair was 
then instructed to discuss the topic and arrive 
ata common statement of opinion (agree, disagree, 
compromise). After the discussion, subjects 
again recorded their opinions on the topic dis- 
cussed. If only one subject changed his opinion, 
this was called one-sided change or persuasion; 
if both changed, it was called two-sided change or 
compromise; if neither changed, i.e., subjects 
agreed to disagree, the case was classified as 
equivalent to compromise. As before, the analysis 
was made in terms of the number of pairs in 
each of these categories, both for experimental 
and control pairs. 

Persuasion. In order to test for relative fre- 
quency of one-sided persuasion as compared 
to compromises, the frequency of one-sided changes 
of opinion was compared with that of two-sided 
changes in experimental and in control pairs. 
The ratios of the number of one-sided to two-sided 
changes of opinion are 13/15 for experimental, 
and 10/12 for control pairs. These ratios are not 
significantly different. 

The conclusion therefore is that when opinions 
are held publicly, the correlation between the 
difference in the level of emotionality within pairs 
and the frequency of one-sided changes of opinion 
is zero, i.e., one-sided persuasion does not occur 
more frequently in pairs characterized by differ- 
ential emotionality than in those of equal emo- 
tionality. This result is contrary to the result for 
privately held opinions obtained before, but 
agrees with that of Janis and Hovland (1959). 

Persuasibility. Persuasibility was defined as 
responsiveness (in terms of change of opinion) to 
constant communication stimuli, and therefore 
was measured by the number of pairs with one- 
sided changes of opinion (persuasion) where 
communication stimuli were constant. In order 
to test for persuasibility of high as compared with 
low emotional subjects, the frequency of changes 
of opinion by high emotional subjects in experi- 
mental pairs that showed one-sided opinion 
switches was compared with that of low emotional 
subjects. This ratio is 10/3, i.e., high emotional 
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subjects changed their opinions in 10 pairs and 
low emotional subjects only in 3 pairs. The 
expectation for this ratio, based on the results of 
the previous experiment, is 3/10. The probability 
of obtaining 10 or more successes out of 13 in a 
binomial distribution with p = 3/13 = .23 and 
q &.77, ie., for (.23 + .77)%, is p < .001. The 
conclusion therefore is that when opinions are 
held publicly, the high emotional subjects tend to 
change their opinions more frequently than low 
emotional subjects, i.e., under public commitment 
persuasibility of high emotional subjects is higher 
than that of low emotional subjects. This result is 
again contrary to the result for privately held 
opinions (Cervin, 1957b). 

It may be that the model of persuasion proposed 
earlier (Cervin, 1957b) in its present form does 
not apply under conditions of publicly held 
opinions, or else, under these conditions, a drive 
reduction occurs in high emotional subjects 
which brings it down under the drive level of 
low emotional subjects. 
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ELLY (1955), in his personal construct theory, 

stresses the importance of each individual’s 
personal dimensions with which he construes 
events in his world. These dimensions, or personal 
constructs, are viewed as bipolar dichotomous 
abstractions along which the person ascribes 
meaning to his world. All of a person’s thinking, 
problem solving, and predictive behavior are 
directly related to the nature of his construct 
system, 

This emphasis on personally meaningful con- 
structs, or dimensions of thinking, may have 
important implications for psychological measure- 
ment. Kelly, of course, developed a technique of 
personality assessment, the Role Construct 
Repertory Test, that directly follows from his 
theory of behavior. In introducing his theory and 
technique to the clinical setting he says the 
following with regard to tests and testing: 

Whose yardstick does this test represent? 
When a psychologist attempts to understand a 
client, he is limited, of course, to those yardsticks 
which he has accumulated in his own repertory. 
Perhaps it is more accurate to say that he 
measures the client within his own set of 
coordinate axes. The so-called “objective” 
tests are designed exclusively along the lines 
of psychologists’ axes. The client is plotted 
with respect to these given axes and his own set 
of axes is ignored entirely (Kelly, 1955, p. 206). 
The semantic differential as developed by 

Osgood, Suci, and Tannenbaum (1957) involves 
assumptions quite similar to those of role construct 
technique. Bipolar construing dimensions—per- 
Sonal constructs in Kelly’s system and scales in 
the semantic differential—are put to quite similar 
use, But the techniques differ in one potentially 
very important area, Kelly emphasizes the 
Personal nature of the dimensions and therefore 
allows each subject to use his own constructs in 
the examination. With the semantic differential, 
however, all subjects construe the objects under 
question with the same set of scales—a set deter- 
mined by the examiner. 


1 This investigation was supported in part by a re- 
Search grant, MY 2244 (C1), from the National Ad- 
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The present study was addressed to the appli- 
cability of a personal construct approach within 
the conventional framework of the semantic 
differential. The problem was to demonstrate 
that using personally meaningful dimensions can 
enhance the semantic differential technique. It was 
predicted that the introduction of subject oriented 
personal constructs within the conventional 
semantic differential should yield concepts more 
saturated with meaning, without significant 
change in the total semantic structure. 


METHOD 


Subjects for this experiment included 16 students in 
an undergraduate psychology class. Each subject was 
given a list of seven bipolar adjective scales (Osgood 
et al., 1957) with demonstrated high factor loadings 
on the semantic differential “evaluative” factor. The 
subjects were asked to think about these seven dimen- 
sions and select the three that they considered most 
personally meaningful in thinking about people. Each 
subject was then presented with seven additional scales 
loaded heavily on the “potency” factor. Subjects were 
again asked to select the three most personally mean- 
ingful. The same procedure was followed with seven 
scales with loadings on the “activity” factor. Thus, 
from 21 scales each subject selected 9 that he felt he 
could meaningfully use in construing people. Further- 
more, these 9 scales were evenly distributed among 
the three major semantic differential factors. The 
nature of these factors can be determined by inspection 
of Table 1 where all scales are listed by factor. 

After selecting the scales, each subject followed the 
procedure of the semantic differential and applied all 
21 of the scales to each of the following seven concepts: 
big business leader, junior executive, foreman, un- 
skilled laborer, politician, labor racketeer, and com- 
munist. The ratings were completed in the conventional 
manner as described by Osgood. 

The data were analyzed via Osgood’s generalized 
distance formula, yielding a score (D) which is a 
measure of the degree to which concepts are similar or 
different in meaning, In terms of the formula, a D score 
matrix can be computed with measures of the distance 
in meaning of each concept from every other concept. 
A hypothetical concept located at the neutral point of 
every scale can serve as a norm of meaninglessness, or 
point of origin, and D scores can be computed indi- 
cating the distance of each concept from such a mean- 
ingless concept. Concepts furthest from this point of 
origin are most saturated with meaning. In the D score 
matrix there is a direct relationship between the 
magnitude of the score and the degree of distance in 
meaning between any two concepts. 
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RESULTS 


Three D score matrices were computed with 
scale scores grouped by factors. One matrix was 
based on the factor scores for all 21 scales, the 
conventional approach to the semantic differen- 
tial. A second matrix was based on factor scores 
computed from the nine personal scales selected 
by the various subjects. The third matrix was 
computed from factor scores derived from the scale 
ratings on the three scales least often selected by 


TABLE 1 


FREQUENCY OF SELECTION OF SCALES AS AMONG 
Most MEANINGFUL 


F. 
Scales Selection (N. fa 16) 
Evaluative Factor 
Good-Bad 7 
Pleasant-Unpleasant 10 
Bitter—Sweet 4 
Nice-Awful 6 
Fragrant—Foul 0 
Honest—Dishonest 11 
Fair-Unfair 10 
Potency Factor 
Large-Small 11 
Hard-Soft 7 4 
Strong-Weak 11 
Deep-Shallow 8 
Heavy-Light 5 
Thick-Thin 2 
Rugged-Delicate 7 
Activity Factor 
Calm-Agitated 11 
Sharp-Dull 14 
Ferocious-Peaceful 0 
Hot-Cold 5 
Angular-Rounded 2 
Active-Passive 10 
Fast-Slow 6 
TABLE 2 


Distance Measures (D) COMPARING DISTANCES OF 
CONCEPTS FROM THE POINTS OF ORIGIN IN THREE 
D Score MATRICES 


Twenty- Personal t 
enty- Infrequent 


Concepts one 

Matrix Matrix Matrix 
Big Business Leader 1.91 2.47 95 
Junior Executive 1.67 2.37 72 
Foreman AA Va PYA 1 -63 
Unskilled Laborer 1.18, 1.27... 1.07 
Politician £.27.).1.29 98 
Labor Racketeer 2.06 2.58 1.63 
Communist 1.39 1.92 1.00 


CRITIQUE AND NOTES 


the 16 subjects for each factor. A list of the 21 
scales, together with the frequency tabulations, is 
presented in Table 1. 

Observation of the three D score matrices 
revealed marked differences in score magnitudes. 
Scores in the personal scale matrix were substan- 
tially larger. The D scores representing the 
distance of each concept from the point of origin, 
or the point of meaninglessness, in the semantic 
field are presented in Table 2. These scores reflect 
the degree of saturation. 

The Wilcoxon matched-pairs signed-rank test 
was applied to the scores reported in Table 2, 
The distances from the point of origin in the 
personally meaningful matrix were found to be 
significantly greater than the distances based on 
all 21 scales (p < .01). These later distances were 
in turn significantly greater than those based on 
the infrequently selected scales (p <.01). Signifi- 
cance levels were based on a one-tailed test. 

A question is raised as to whether the demon- 
strated increase in the meaningfulness of concepts 
resulting from the use of a limited number of 
personally meaningful dimensions occurs at the 
expense of “distortion,” or changes in the semantic 
field. Significant changes do not appear to have 
occurred, since models constructed from the three 
matrices have very similar shapes, although 
differing greatly in size. 


SUMMARY 


In accord with Kelly’s (1955) emphasis on the 
importance of personal meaningfulness to the 
subject in psychological measurement, the possi- 
bility of employing personal constructs within 
the semantic differential technique was investi- 
gated. A direct relationship was predicted between 
the personal meaningfulness of scales and the 
degree to which the concepts rated by the scales 
are saturated with meaning. The prediction was 
supported. Furthermore, increased saturation 
does not appear to occur at the expense of “dis- 
tortion” in the semantic field. 
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| Boe investigation of both humor preferences 
and humor interpretations has a long 
psychological history, but our ability to predict 
_ either of these behaviors is not very impressive 

as yet. The present study is concerned with the 
possible relationships between these two aspects 
of response to humor and a measure of 
incongruency. 

The degree to which a humorous stimulus is 
amusing to an individual has been found to be in 
part a function of the content of the humor relative 
to other measured aspects of the respondent’s 
behavior. For example, humor with hostile content 
is most appreciated by individuals who express 
aggressive sentiments on paper-and-pencil tests 
(Murray, 1934), who are aggressive in a hospital 
ward setting (Byrne, 1956), and who produce 
TAT stories with aggressive content (Grziwok & 
Scodel, 1956). It has been proposed frequently 
that conflict in a drive area may lead to either a 
positive or a negative response to related humor 
(Ghosh, 1939; Levine, 1956; Redlich, Levine, & 
Sohler, 1951). This suggests a curvilinear relation- 
ship between humor preference and conflict. 

a A second humor variable concerns the ability of 
_ individuals to interpret the point being made by 
' the humorist, regardless of their response to the 
funniness of the humor. Clinical evidence has 
been reported (Redlich et al., 1951) that failure 
to understand or distortion of the meaning of 
Cartoons is closely related to personality dynamics. 
It has been found (Byrne, 1956) that neuropsy- 
chiatric patients who express hostility in their 
everyday behavior are better able to recognize 
the hostile content of cartoons than patients who 
fail to express hostility. Senf, Huston, and Cohen 
(1956) found that schizophrenics had an impaired 
ability to interpret cartoons, though their study 
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did not include the relating of particular conflicts 
to specific misinterpretations. 

It seems reasonable to suppose that both the 
extent to which a humor stimulus is perceived as 
funny and the interpretation given to its content 
are influenced by the degree of conflict in the drive 
area represented by the stimulus. 

McReynolds (1956) has presented a theoretical 
approach to the problem of conflict. This con- 
ceptualization assumes that percepts representing 
one’s experiences tend to be organized or assimi- 
lated into one’s apperceptive mass, and that 
assimilation may be hindered if there are marked 
incongruencies among the conceptual schemata 
according to which the percepts would be assimi- 
lated. By incongruency is meant a state of contra- 
diction, disharmony, or inconsistency, Percepts 
which occur in areas of marked incongruency 
would therefore tend to remain unassimilated, 
and the conceptualization proposes that the 
greater the quantity of unassimilated material, 
the greater the feeling of anxiety. It holds further 
that individuals tend to avoid the perception of 
material which would be difficult to assimilate. 
From this it can be predicted that subjects having 
marked incongruency in a given drive area would 
tend to avoid percepts in this area, 

The present study investigated the relationships 
between (a) incongruency and humor preferences 
in the areas of sex and hostility, and (6) incon- 
gruency and humor interpretations in the same 
areas. With regard to the former, a curvilinear 
relationship was predicted; for the latter, it was 
hypothesized that the greater the incongruency in 
a given drive area, the greater is the tendency to 
interpret cartoons in such a manner as to avoid 
percepts in that area. 


Mernop 


Subjects. The subjects consisted of 33 male neuro- 
psychiatric patients at the Palo Alto Veterans Ad- 
ministration Hospital.* They were mostly closed ward 


3 Subjects were selected from patients who had 
taken the MMPI in such a manner that no member of 
the sample had an F score of over 70. This procedure 
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patients, and represented a considerable variety of 
diagnostic classifications. The median age was 36, with 
a range from 21 to 65. 

Incongruency measures. The general technique has 
been described elsewhere (McReynolds, 1958). In the 
present instance, the subject responded twice to each 
of 60 statements. Twenty of the items were sexual in 
content (e.g., making love, daydreaming about sex), 
20 items had hostile content (e.g., playing a joke on a 
bully, getting even with someone who has hurt me), 
and 20 were neutral or buffer items. First, the subject 
was asked to indicate his liking for each of the 60 
activities, presented in a dittoed list, by checking one 
of the following: Mainly Like, Mainly Dislike, Both 
Like and Dislike, and Neither Like Nor Dislike. Later 
in the testing session, the subject received the same 
items in a different order, typed individually on 
cards. This time the subject’s task was to sort the 
activities according to his personal moral judgment into 
the categories Mainly Good, Mainly Bad, Both Good 
and Bad, and Neither Good Nor Bad. For each item, 
the degree of incongruency was indicated by the 
divergence between the subject’s two ratings, For 
example, an item responded to as Mainly Like and 
Mainly Bad was scored 4 points incongruency; an 
item responded to as Mainly Like and Mainly Good 
received 0 points, etc. The scoring procedure was 
rationally determined in advance. For each subject, a 
sex incongruency score and an hostility incongruency 
Score were obtained by summing the separate incon- 
gruencies for the appropriate 20 items. 

Humor measures. The humor test is more fully 
described elsewhere (Byrne, 1957). It consists of 64 
dittoed cartoons of the kind which appear in popular 
magazines, so selected by five judges that there are 
16 cartoons in each of the following areas: hostility, 
sex, ridicule, and nonsense. In the present study, the 
cartoons were mounted individually on 6 X 8 inch 
cards, The subject was asked to sort them into four 
equal piles of 16 “Most Funny,” 16 “Next,” 16 “Next,” 
and 16 “Least Funny.” For each type of cartoon, a 
score of 3 was assigned for those placed in the most 
funny category, 2 for those in the next category, 1 for 
the next, and 0 for those rated least funny. This proce- 
dure yielded hostile, sexual, ridicule, and nonsense 
humor preference scores. The first two of these were of 
major interest because they corresponded directly to 
the incongruency measures; but, as will be seen pres- 
ently, the ridicule humor preference measure was also 
pertinent. 

Later in the testing session, the subject was given 
typed definitions of what was meant by sexual cartoons, 
hostile cartoons, cartoons which make fun of people 
(ridicule), and nonsense cartoons. He was asked to 


was followed because there is some reason to believe 
that persons with high F scores tend to answer per- 
sonality items (such as the incongruency test) in an 
invalid manner. It is possible that this restriction of 
the sample limits the generalizability of our findings; 
L. Ullmann, e.g., has found, in unpublished research, 
that persons with low F tend to be Internalizers rather 
than Externalizers. 

4 The items and instructions used in the incongruency 
tests, and the definitions presented to the subject for 
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TABLE 1 
PRODUCT-MOMENT CORRELATIONS BETWEEN 
INCONGRUENCY SCORES AND HUMOR 
INTERPRETATION SCORES 


(N = 33) 
Hostility Sex 
Humor Identification Incon- Incon- 
gruency gruency 
Correct Interpretations 
1) No. hostile cartoons called —.AL —.15 
hostile 
3 No. sex cartoons called sex 710 — 32 
3) No. ridicule cartoons called 41 27 
ridicule 
Total Interpretations 
4) Total called hostile —=.37 —.02 
5) Total called sex =.05 —.22 
6) Total called ridicule g 43 


Note.—One-tailed tests of significance are appropriate for cor- 
relations in Lines 1, 2, 4, and 5, for which directional predictions 
were made. For these a value of .296 is required for significance at 
the .05 level, and .409 at the .01 level. The two-tailed test is ap- 
Propriate for Lines 3 and 6, where a value of .349 is required for 
the .05 level, and .449 for the .01 level. 


sort all the cartoons into these four categories, putting 
as many as he wished into each group. From these 
sortings, two humor interpretation scores were obtained: 
the number of cartoons correctly assigned, i.e., the 
number of hostile cartoons categorized as hostile by 
the subject, the number of sex cartoons classified as 
sex, etc., and the total number of cartoons assigned to 
each drive area. : 

Procedure. Subjects were tested individually or in 
groups of two to four. The order of the tasks was as 
follows: incongruency (like-dislike), cartoons (humor 
preference), a sentence completion test (unrelated to 
the hypotheses reported here), incongruency (good- 
bad), and cartoons (interpretation). 


RESULTS AND Discussion 


Humor preferences. No significant relationships, 
curvilinear or linear, were found between incon- 
gruency and humor preferences. Scatterplots 
indicated clearly that the predicted curvilinear 
relationships did not exist, and this conclusion 
was supported by a test for linearity of regression 
(Guilford, 1956, p. 294). ho 

These results do not support the proposition 
that conflict is an essential variable in the predic- 
tion of humor appreciation. It is possible that the 
negative finding can be attributed to one or more 
aspects of the research design or the measuring 
instruments used. A more convincing possibility, 
at the moment, is that humor appreciation reflects 


the different types of cartoons have been deposited with 
the American Documentation Institute. Order Docu- 
ment No. 6693 from ADI Auxiliary Publications 
Project, Photoduplication Service, Library of Congress; 
Washington 25, D. C., remitting in advance $1.25 for 
microfilm or $1.25 for photocopies. Make checks 
payable to: Chief, Photoduplication Service, Library 
of Congress, 
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" attitudes, beliefs, and other habits, but not inner 
conflict. Further research may help to clarify this 

issue. e 
Humor interpretations. As predicted, the subjects 
"appeared to interpret cartoon content in such a 
_ manner as to avoid percepts in areas of incon- 
_ gtuency. Thus, the correlation between hostile 
incongruency and number of hostile cartoons in- 
terpreted as hostile is —.41, and the analogous 
‘correlation for sex is —.32. That is, there was a 
tendency for subjects having marked incongruency 
in hostility to avoid seeing—or at least reporting— 
hostile content in those cartoons which actually 
represented hostility. The same was true, though 
toa lesser degree, for the sex area. In order to hold 
_ that the tendency to avoid seeing cartoons as 
hostile is actually due to incongruency in the 
ys area of hostility, however, it is necessary to have 
some evidence that this tendency is not also 
+ related ‘to sexual incongruency; an analogous 
Statement can be made for the sex area. In other 
words, in order to support the thesis that the 
subjects tended to avoid percepts in a given area 
_ because of incongruency in that area, it is neces- 
_ Sary not only to-show a significant negative rela- 
tionship between the number of cartoons selected 
and incongruency in the area, but also it is neces- 
‘sary that there be a low and insignificant relation- 
ship between the number of cartoons selected in 
that area and incongruency in other areas. As 
‘indicated in Table 1, the results tend to conform 
With these requirements. Thus, the correlation 
between number of hostile cartoons called hostile 
and sexual incongruency is.—.15, and the correla- 
tion between number of sex cartoons called sexual 

~ and hostile incongruency is —.10. 
The implication that the subjects tended to 
_ void percepts in areas of incongruency is further 
_ Supported by correlations between incongruency 
_ lM given areas and total number of cartoons se- 
lected for those areas. Thus, the correlation between 
hostile incongruency and total number of cartoons 
termed hostile is —.37, and the analogous correla- 
tion for sex is —.22. Further, the correlations 
between total number of cartoons selected in one 
area and incongruency in the other area are both 
low (—.02 and —.05), further supporting the idea 
the specificity of incongruency. 

Though no predictions were made concerning 
the relationships between the humor interpretation 
Variables and ridicule incongruency, pertinent 
Statistics were computed and are presented in 

Table 1. It may be noted that both measures of 
humor interpretation for the ridicule cartoons 

į ate correlated positively with both hostility and sex 
imcongruency, with the relationships considerably 
higher for hostility. In order to interpret these 
findings, it is necessary to note that the ridicule 
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cartoons can be considered as representative of 
indirect, disguised aggression, in distinction to the 
hostility cartoons, which represent direct, open 
aggression (Byrne, 1957). Thus, the results 
suggest that subjects with marked incongruency 
concerning hostility tend to interpret cartoons in 
such a manner as to obtain percepts of indirect 
aggression, Possibly such percepts tend to satisfy 
to some degree the aggressive impulses without at 
the same time being incongruent with feelings 
against aggression. The positive correlations 
between the humor interpretation scores for the 
ridicule cartoons and sexual incongruency suggest 
that the perception of indirect aggression serves 
to some degree to relieve sexual incongruency also. 


SUMMARY 


The present study was undertaken in order to 
test two hypotheses about the relationships 
between incongruency and responses to humor. 
Thirty-three male neuropsychiatric patients were 
given an incongruency test, a humor appreciation 
test, and a humor interpretation test. Each con- 
tained hostile and sexual items. The results suggest 
that (a) degree of appreciation of cartoons repre- 
senting a given drive area was not related to 
incongruency in that drive area, and (b) subjects 
tend to interpret the cartoons in such a manner as 
to avoid percepts in areas of marked incongruency. 
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HILE reconciliation between the older trait 

theories and the more recent situational 
theories of leadership is not yet complete, there is 
growing recognition (cf. Carter, 1953; Lippitt, 
Polansky, & Rosen, 1952; Rosen, 1959) that there 
are properties of people, valued by others, which 
are in fact “cross-situational” sources of power. 
That is to say, persons derive power from the 
possession of attributes that have utility for others 
in many different kinds of situations. 

For example, helpfulness in a person should be 
valued by others in a large variety of situations. 
Such an attribute, whether expressed at work, at 
play, or elsewhere, should lend its possessor 
esteem in relation to others, Ethical systems 
abound with descriptions of such properties, and 
encourage the development and manifestation of 
certain properties over others; yet systematic 
empirical knowledge regarding the identity of 
such properties is meager. Nor is there much 
scientific knowledge concerning the extent to which 
the layman perceives their existence. 

Granted that the objective sources of social 
power need not, in principle, bear a one-to-one 
relationship to the layman’s corresponding 
Phenomenological account; it is still appropriate 
in the earlier stages of theory development to 
consult the layman, for at least three reasons: 
First, ample evidence (cf. Lippitt, Polansky, & 
Rosen, 1952) exists indicating that perceptions of 
Social power and its sources applied to actual 
group situations have appreciable veridicality. 
Second, there is considerable evidence of a rela- 
tionship between perceptions and actions, vis-à-vis 
Social power and resources. Third, there is reason 
to believe that in this value-charged area the 
layman sometimes reveals more by his omissions 
than by his reporting. 

The Present note is an attempt to establish the 
identity of some cross-situational sources of power 


1 This study was conducted as part of a larger 
sacatch study under grant (M-450) from the United 
States National Institute of Mental Health. Principal 
Gator was Ronald Lippitt, Research Center for 

toup Dynamics, University of Michigan. 


as they are perceived by persons of differing 
backgrounds. It is first necessary to amplify the 
meaning of social power and its sources (Gold, 
1958; Levinger, 1959; Lippitt, Polansky, & Rosen, 
1952) as we understand them. Social power is de- 
fined as the potential ability to influence some 
other person(s). We conceive of such power as 
stemming from the possession of resources, i.e., of 
properties that have utility for the other person or 
persons. In principle, a resource may be a trait, a 
material possession, a fund of knowledge, or even 
“reflected glory.” The definition of resource is not 
restricted to properties having positive utility, but 
this distinction will not be considered here. More- 
over, this paper deals with perceived social power 
and perceived resources. 


METHOD 


A number of boys of grade-school age were first 
approached in a school setting and asked, by means of 
open-ended interviews, to name reasons why a person 
could get other persons to do things. Their responses 
led to construction of a provisional list of 25 statements 
dealing with sources of power. This list was then 
submitted to 23 high-school age boys attending a 
year-round camp, with instructions to check the items 
that seemed most relevant and to add any that might 
have been overlooked. The six most frequently checked 
items were then administered, in the course of individual 
interviews, to 59 other subjects ages 12-14, in a summer 
camp for underprivileged and somewhat emotionally 
disturbed boys. The instructions were as follows: 

As you know, some guys really know how to get 
other guys to do things. In order to get the other 
guys to listen to them and do what they want, they’ve 
got to know how to go about it. Now I'd like to show 
you some cards with different things written on 
them. I want you to tell me which of these things is 
most important for a guy to be in getting others to 
do what he wants.... Which is the next most 
important thing... 

‘The six items were: a guy who is fun to be with, 
a guy who is big and strong, a guy who knows how to 
do lots of things, a guy who plays fair, a guy who 
isn’t afraid of anybody else, a guy who is always willing 
to help out another guy. In order to facilitate discus- 
sion, these items will be designated, respectively, by 
the following terms: Sociability, Physical Strength, 
Expertness, Fairness, Fearlessness, and Helpfulness. 
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A few months later, the same six items were pre- 
sented in questionnaire form to 14 female school 
teachers enrolled in a graduate extension course. The 
instructions were similar, although the wording was 
less colloquial. 


RESULTS AND Discussion 


Before proceeding to consider the actual order 
in which the six items were ranked, let us first 
examine the degree of agreement among the 
different judges as to the relative importance of 
the six resource areas for getting other persons to 
do things. We computed Kendall’s coefficient of 
concordance, w, for the two sets of rankings 
(Kendall, 1948). The two coefficients showed 
that considerable agreement existed both among 
the 59 campers and among the 14 school teachers. 
These coefficients were .60 and .56, respectively, 
each being significant at beyond the .001 level. 

‘The mean ranking given by the male adolescents 
was in the following decreasing order of impor- 
tance: Helpfulness, Fairness, Sociability, Expert- 
ness, Fearlessness, and Physical Strength. From 
the viewpoint of the adult sample, the mean 
rank order of these items was the same, with the 
minor exception of Fairness being placed above 
Helpfulness. (Essentially the same results were 
obtained subsequently with a sample of 22 under- 
graduates.) 

The question arises as to whether the differential 
importance attributed to these resource areas 
does not reveal that some of them are more 
cross-situational than others. For example, the 
cluster represented by Fearlessness and Physical 
Strength would appear to have more circumscribed 
utility (at least in Western cultures) than the 
other four resource areas. 

It is reasonable to suppose, too, that the utility 
of Fearlessness and Physical Strength may be 
more limited in at least two ways: first, with 
respect to the variety of possible situations for 
which the properties might be relevant; and 
second, with respect to the frequency with which 
any given situations are likely to recur. Fearless- 
ness and Physical Strength presumably become 
relevant in situations of stress or emergency. 
Resembling Weber’s “charisma,” these properties 
lend their owner authority under extraordinary 
conditions (Weber, 1947). 

In order to test the inference that Fearlessness 
and Physical Strength were of less cross-situa- 
tional utility than the other four resources, the 
sample of teachers was asked to rank the relative 
importance of the six resource areas with respect 
to each of six hypothetical situations: (a) running 
a successful party for club members, (b) conduct- 
ing a successful meeting of co-workers, (c) raising 
children properly, (d) organizing a successful 
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fishing trip, (e) dissuading an angry crowd of men, 
and (f) the successful training of commandos, It 
was felt that the customary ranking assigned to 
the six resource areas with reference to the 
“general” situation would be seen as least applic- 
able in situations e and f. 

It was first established that there was consensus 
within each situation in ranking the relative 
importance of the six resource areas with regard 
to that situation (the six coefficients of concord- 
ance were all significant at beyond the .01 level). 
Spearman rho’s were then computed to compare 
the mean ranking for the general situation with the 
mean rankings of the resource areas for each of 
the specific situations. The correlations between 
the general ranking and those given for situations 
a through f, respectively, were: .60, .83, .94, .71, 
.09, and —.14. These correlations demonstrate 
that the customary ranking given to the six 
resource areas was applicable only to the first 
four situations. 

More specifically, Fearlessness was ranked as 
the most important source of power for both the 
commando training and the crowd dissuading 
situations, while Physical Strength was ranked 
3.5 and 4, respectively, for those two situations. 
These shifts indicated, as expected, that the 
relevance of Fearlessness and Physical Strength 
is increased in situations of stress. 

It is not immediately clear why the subjects 
did not assign a ranking of even greater importance 
to Physical Strength with regard to the two 
Stressful situations. One possibility is that the 
latter two situations, as depicted, were not suffi- 
ciently discriminable from the remaining situations 
to provide a more clear-cut test of the inference 
in question. Another possibility is that these 
stress-type situations were not appropriately 
geared to the characteristics of the present sample. 
Perhaps, if the life situations of these subjects had 
been closer in time (or had at least entailed ex- 
posure) to combat conditions, the subjects would 
have shown a greater readiness to acknowledge 
that Physical Strength could be a source of power 
(Berkum & Meeland, 1958). Still, another possi- 
bility is that strong pressures exist in our culture 
against acknowledging that any property capable 
of serving as a negative resource could be a source 
of power, 

It should be noted, however, that consensus 
among these subjects in ranking Physical Strength 
as least important in the general situation was 
overwhelming (12 out of 14 subjects ranked it 
sixth), and that this degree of consensus exceeded 
by far the degree of consensus obtained for any 
of the other five resource areas. 

Persons of different backgrounds thus tend to 
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agree about the relative importance of various 
sources of social power. In view of the fact that 
these results held up across two such widely 
differing samples, it is questionable whether the 
findings can be dismissed on the grounds of either 
= sampling unreliability or sampling bias. One way 
of explaining the rank order reported here is that 
the highest ranked resources seem to have the 

greatest cross-situational utility. This interpreta- 
` tion leads us to propose two general hypotheses for 
future testing: the larger the number of situations 
for which a given resource is appropriate, the 
greater the perceived importance of the resource 
as a source of social power; the more recurrent the 
kind of situation for which a given resource is 
appropriate, the greater the perceived importance 
of the resource as a source of social power. 
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RONBACH (1958) has recommended that com- 
plex “interpersonal perception” scores, such 
as measures of profile similarity, be analyzed into 
simpler components. This type of analysis would 
presumably lead to a more parsimonious interpre- 
tation of the results, and indicate the extent to 
which the “dyadic” components of these measures 
contribute useful information. By “dyadic” com- 
ponents we shall here refer to scores that involve 
the comparison of two profiles. “Assumed Simi- 
larity” (AS) scores, which have been effective 
predictors in previous studies of leadership (Fied- 
ler, 1958), quasi-therapeutic relations (Fiedler, 
Hutchins, & Dodge, 1959), and social relations 
(Fiedler, Warrington, & Blaisdell, 1952) are of 
this type. These AS scores are obtained by com- 
paring a person’s description of two others (or 
himself and another) by means of the D score 
(Cronbach & Gleser, 1953; Osgood & Suci, 1952). 
This paper briefly summarizes a reanalysis of 
certain data from a study relating AS to adjust- 
ment (Bass & Fiedler, 1959),? to determine em- 
pirically the extent to which dyadic or pattern 
scores contribute to prediction and interpretation 
of results. 


PROCEDURE 


The subjects were 200 members of small antiaircraft 
artillery units.‘ Each subject described himself, his 
“ideal self,” his “confidant” with whom he would dis- 
cuss his personal problems, and four members of his 
work group. Descriptions were made on scale sheets 
resembling the semantic differential. Each of these 
scale sheets contained 20 pairs of bipolar adjectives of an 
evaluative nature, such as: 


:— Unfriendly 
— Uncooperative 


Friendly —:—: 
Cooperative —:—: 
From these scale sheets two types of AS scores could 


1This study was supported by Research Grant 
M-1774, National Institute of Mental Health, United 
States Public Health Service. 

2 We wish to express our appreciation to S. Alexander 
for comments on the manuscript. 

* A mimeographed copy of the complete report may 
be obtained from the authors, Department of Psychol- 
ogy, University of Illinois, Urbana, Illinois. 

4 These data were originally obtained in a study by 
Fiedler, Hutchins, and Dodge (1959). 


be obtained. The first of these involves the similarity a 
man sees between himself and others in his group—i.e., 
AS by the subject /o others. The main AS score of this 
type is the ASg (mean AS between subject and group 
members). Each subject described himself and four 
members of his group; an average D score was then 
computed between the subject’s self-description and 
his description of each of the four group members, 
Each of these D scores is algebraically defined as 


follows: 
ASg = y X (Xip — Xu)’ 


Where ASg is the AS between subject and a particular 
group member, Xip is the score on Item 7 for the 
perceiver’s self-description, X;, is the score on Item ¢ 
for the perceiver’s description of the group member, and 
n is the number of items in the scale sheet description, 
In addition to ASg, two other D scores of this first type 
were computed. One was ASc, the AS between the 
subject’s self-description and his description of his 
confidant. The other was ASIc, the AS between the 
subject’s ideal-self-description and his description of his 
confidant. These AS scores are algebraically defined in a 
manner similar to ASg above. 

The second type of AS score involves the similarity 
that the subject’s fellow group members perceive be- 
tween themselves and the subject. The main score of 
this type was called ASs, the average AS between the 
group members’ own self-descriptions and their descrip- 
tions of the subject. As before, each of these D scores is 
algebraically defined as follows: 


ASS = Vv Sie ED ae 
i=l 


where ASs is AS to the subject by a particular group 
member, X;p is the score on Item i for the self-descrip- 
tion of the perceiver (in this case, subject’s fellow group 
member), Xj, is the score on Item i for the group 
member’s description of the subject, and » is again the 
number of items in the scale sheet description. In addi- 
tion to ASs, two other D scores of this second type were 
used in the present study. One of these was C’s ASs, or 
AS between confidant’s self-description and his descrip- 
tion of the subject; and the other was C’s ASIs, or AS 
between confidant’s ideal-self-description and his de- 
scription of the subject. 

AS is operationally defined by D scores. The D score 
can be algebraically analyzed into a number of simpler 
components by expanding the term under the radical, 
converting the raw scores to deviation scores and 
simplifying. This analysis indicates that: 
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= (My — Ma)? + Vi + Vz — 2 cove 
* 


where M; is the mean of the item scores in one of the two 
profile descriptions, Mz is the mean of the items in the 
second description; V; and V2 are the variances of the 
item scores within the first and seconds descriptions, 
respectively; and covyz is the covariance between the 
item scores in the two profile descriptions. The present 
analysis of AS scores utilized six components for each 
AS score: the two means, the two variances, the covari- 
ance, and the absolute difference between the means 
(the “elevation difference”). Four of these are “mo- 
nadic” components, involving only one of the two 
profile descriptions of the AS score, viz., the means (the 
“elevation components”), and the variances of the 
items within each of the two descriptions, The remain- 
ing components (the covariance and the elevation dif- 
ference) are the two “dyadic” components of the AS 
score, 

Five criterion measures indicative of the subject’s 
adjustment were obtained; a Taylor Manifest Anxiety 
(MA) scale score, a General Army Adjustment Index, 
a Self-Satisfaction measure (the self—ideal-self discrep- 
ancy), the subject’s sociometric acceptance by the 
group, and the crew leader’s rating of the subject’s ad- 
justment to military discipline. A previous investigation 
(Fiedler, Hutchins, & Dodge, 1959) had shown that 
individuals who perceived others in their group as 
similar, and those who were seen by fellow group mem- 
bers as similar (a) tended to be better adjusted than 
those who saw others, and were seen by others, as 
relatively less similar, and (b) tended to improve 
in adjustment. Although from a substantive point of 
view, æ was a relatively unimportant aspect of the 
original study, it is methodologically simpler for pur- 
poses of this paper. Hence, the present study reanalyzed 
the data relevant only to a, using components of the 
Several AS scores, 


RESULTS AND DISCUSSION 


Factor analyses of the self-description and 
description of confidant scales indicated that 16 of 
the 20 scale sheet items were highly loaded on an 
“evaluative,” first centroid factor. The present 
analyses of AS score components are based on 
these 16 items, 

Each of the various components was correlated 
With the adjustment criteria. Since there were six 
different AS scores and five different adjustment 
Criteria, 30 different sets of relationships between 

S score components and adjustment criteria 
Were involved, In 24 of these 30 relationships, one 
Or more of the AS score components correlated with 

he adjustment criteria as high as, or higher than, 
did the corresponding AS score, Of these com- 
Ponents, the profile means (or “esteem” scores) 
appeared to be the most promising as predictors. 
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TABLE 1 


CORRELATIONS BETWEEN DESCRIPTION MEANS OF 
THE AS SCORES AND THE ADJUSTMENT 
CRITERIA 


Adjustment Criteria 
M 
[ean Components FA Ary Salt Disci- | Socio- 
just-| Satis- ary | metric 
scale | ment | faction Ratings | Status 
Subjects Description 
of 
Self 270" a «21° | 13 
Ideal-Self -10 s . 14" 
Confidant 29%" | 13 27" | 00 
Group Members 37°" | 23" | 16% +00 
Confidant’s Descrip- 
tion of 
Self —.05 +05 09 21% 04 
Ideal-Self +00 07 05 |-.10 «06 
Subject —.09 04 02 132° | 358° 
Group Members’ De- 
scriptions of 
Self —-06 |—.06 06 05 02 
Subject 08 = |—.04 -02 2199 | 450% 


A Sy 
A Spurious correlations, based in part on same self- or ideal- 
self-descriptions, 
*p <05. 
"p< 01. 


In 24 of the 30 comparisons, the multiple correla- 
tions between the two description means and the 
criterion were as high as, or higher than, the corre- 
lation between the corresponding AS score and 
the criterion. 

More specifically, the subject’s self-description 
means (“self-esteem”) and the means of the 
descriptions of his group members were most im- 
portant for predicting adjustment criteria of the 
self-report type (MA scale, Army Adjustment, and 
Self-Satisfaction). The means of the group mem- 
bers’ descriptions of the subject were significantly 
related to adjustment criteria based on “ratings- 
by-others” (Sociometric Status, Discriplinary Rat- 
ings). The correlations between the elevation com- 
ponents (means) of the various AS scores and the 
five adjustment criteria are presented in Table 1. It 
appears, then, that individuals who describe both 
themselves and their fellow group members more 
favorably tend to appear as better adjusted on self- 
report measures. Subjects who are described more 
favorably by their fellow group members also tend 
to be seen as better adjusted on sociometric indices. 

To what extent do the remaining, more complex, 
nonlinear, and dyadic components of the AS score 
(the variances, covariance, and elevation differ- 
ence components) contribute to prediction of 
criterion variance beyond that already determined 
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TABLE 2 


CORRELATIONS BETWEEN VARIANCE COMPONENTS 
OF THE AS SCORES AND THE ADJUSTMENT 


CRITERIA 
Adjustment Criteria 
Variance 
Components Ma | Army | Self- | Disci- | Socio- 
scale | Adjust-| Satis- | plinary| metric 
ment | faction [Ratings | Status 
Subjects Descrip- 
tion of 
Self —.32**| —.14* “of —.12 | —.13 
Ideal-Self —.17* | —.06 —.06 | —.14* 
Confidant —.25**| —.06 | —.34**| —.16* | —.11 
Group Members —.26**) —.12 | —.36**| —.04 | —.14* 
Confidant’s Descrip- 
tion of 
Self 01 00 | —.11 | —.16 +00 
Ideal-Self —.02 | —.03 | —.06 «12 | —.13 
Subject +10 12 04 | —.09 | —.27%¢ 
Group Members’ 
Descriptions of 
Self —.12 +08 | —.08 +03 | —.14 
Subject —-20**| .10 | —.18* | —.18* | —.37%* 


a Spurious correlations, based in part on same self-descrip- 
tions. 
* p< .05, 
"p< 01, 


by the profile means? Multiple regressions were 
computed by successively adding to the two 
profile means the two variances, the covariance, 
and the elevation difference components, in that 
order, determining at each step the increment in 
criterion variance accounted for by the addition 
of that component. In only 5 of the 30 cases did 
the variance components add significantly to the 
description means in predicting adjustment meas- 
ures,’ While the profile variances tended to be 
related to the various criteria to about the same 


ë These particular results may reflect the fact that 
the elevation scores were entered into the multiple 
regressions before the variance components, If this 
procedure were reversed and the variance components 
were inserted in the multiple regression formula first, 
followed by the elevation components, we might then 
have concluded that the elevation components add little 
to criterion variance beyond that obtained with the 
variances. However, the means are obviously easier to 
interpret than are the variances or the more complex D 
score components. Therefore, our interest in this study 
was in determining whether the profile means alone are 
generally sufficient to account for about as much crite- 
tion variance as the D score can account for, and 
whether the addition of the more complex, nonlinear, 
and dyadic components of the D score to the profile 
means generally tends to increase significantly predic- 
tion of these adjustment measures. 
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extent as the profile means, the means and vari- 
ances based on the same profile tended to be 
substantially ‘intercorrelated. Therefore, the pro- 
file variances generally add little to criterion 
prediction beyond that accounted for by the 
means. Table 2 presents the correlations between 
the variance’ components of the AS scores and the 
various adjustment criteria, 

Tn four additional cases the elevation difference 
component added significantly to criterion pre- 
diction by the means and variances. The dyadic 
covariance component of the AS score tended to 
have the lowest correlations with the adjustment 
criteria, as well as the lowest regression weights in 
predicting the adjustment measures. 

These results support Cronbach’s (1958) con- 
tention that analysis of the D score into monadic 
components permits simpler and more meaningful 
interpretations of interpersonal perception scores 
than is possible with D. Such a research strategy 
should tend to yield conceptually clearer interpre- 
tations of findings obtained using these measures. 

It should be pointed out, however, that these 
conclusions are based on relationships between AS 
to specific other persons and personal adjustment 
criteria. Most previous work has dealt with the 
score ASo, the AS between the most and least 
preferred co-workers the subject has ever known. 
This score has yielded substantial correlations with 
group effectiveness criteria in a number of differ- 
ent studies (Fiedler, 1958), and the ASo score may 
well yield more powerful prediction than its com- 
ponents in relation to group effectiveness criteria 
(Hutchins, 1958; Peters, unpublished). 


REFERENCES 


Bass, A. R., & Fiepter, F. E. Interpersonal perception 
scores: A comparison of D scores and their compo- 
nents. Technical Report No. 5, 1959, Group Ef- 
fectiveness Research Laboratory, University of 
Illinois, United States’ Public Health Project 
M-1774, 

Cronsacu, L. J. Proposals leading to analytic treat- 
ment of social perception scores. In R. Tagiuri & 
L. Petrullo (Eds.), Person perception and inter- 
personal behavior. Stanford: Stanford Univer. 
Press, 1958, Pp. 353-379. 2 

Cronsacu, L. J., & Greser, Gorpine C. Assessing 
similarity between profiles. Psychol. Bull., 1953, 
50, 456-473. 

FIEDLER, F. E. Leader attitudes and group effectiveness. 
Urbana: Univer. Illinois Press, 1958. 

Fieve, F. E., Hurcnms, E. B., & Dopor, Joan S- 
Quasi-therapeutic relations in small college and 
military groups. Psychol. Monogr., 1959, 136, 
Whole No. 473). 


CRITIQUE AND NOTES 445 


FIEDLER, F. E., WARRINGTON. W. G., & BLAISDELL, F. 
J. Unconscious attitudes as correlates of socio- 
metric choice in a social group. J. abnorm. soc. 
Psychol., 1952, 47, 790-796. 

Hurcuins, E. B. Task-oriented and quasi-therapeutic 
role functions of the leader in small military groups. 


Unpublished doctoral dissertation, University of. 
Illinois, 1958. 

Oscoon, C. E., & Sucr, G. J. A measure of relation 
determined by both mean difference and profile 
information. Psychol. Bull., 1952, 49, 251-262. 


(Received January 18, 1960) 


Journal of Abnormal and Social Psychology 
1961, Vol. 62, No. 2, 446-449 


THE GENERALITY OF ACCURACY OF INTERPERSONAL PERCEPTION! 


VICTOR B. CLINE anp JAMES M. RICHARDS, Jr. 
University of Utah 


N a previous paper by the authors (Cline & 
Richards, 1960) evidence was presented indi- 
cating that accuracy of interpersonal perception is 
to some degree a general trait over diverse per- 
sons (about whom judgments are made) and 
different measuring instruments, using a procedure 
in which an interview of the person to be judged 
is presented by means of a sound color movie. 
These conclusions are considerably different from 
those reached by Gage and Cronbach (1955), 
Crow and Hammond (1957), and Stone and Lea- 
vitt (1954), among others who conclude that there 
is little or no generality in judging ability. Because 
of this divergence of conclusions, a most important 
question is how well the generality we obtained 
holds up in replication, or cross-validation, stud- 
ies. This is especially true since the filmed inter- 
views used in the previous research were selected 
on the basis of how well they discriminated good 
judges from poor judges, or, in other words, how 
well they correlated with total judging score. 
This note reports the results of four such repli- 
cation studies. 


METHOD 


The procedure used in all of these studies was for a 
group of judges to view a filmed interview, in which 
questions were asked the person in the film by a skilled 
interviewer about their personality, values, etc. The 
persons who were in the films were quite diverse with 
regards to age, social status, cultural background, and 
included both men and women. 

The judges who viewed these films filled out various 
judging instruments dealing with the person in each 
film. These instruments were constructed on the basis of 
intensive interviews, both with the person in the film 
and with five of his close associates, and a battery of 
psychological tests. The judges saw a filmed interview, 
then the projector was stopped, and the judging 
instruments for that film filled out. Then the second 
film was shown, the projector stopped, and the judging 
instrument for that film filled out, etc. The experimental 
procedures and the development of the judging instru- 
ments are described in detail elsewhere (Cline & 
Richards, 1960). 

Four different judging instruments were used in these 
replication studies. These were: the Behavior Postdic- 
tion test, where the judge was required to postdict the 
interviewee’s real life behavior on a seven-item-per-film 
multiple-choice test; the Trait Rating test, where the 
judge rated the interviewee on 25 adjectives using a six- 


1 This study was supported by Contract NONR 1288 
(04), Group Psychology Branch, Office of Naval Re- 
search. 


category Likert scale; the Opinion Prediction test, 
where the judge predicted how the interviewee had 
responded to 20 selected MMPI items; and the Adjec- 
tive Check test where the judge had to guess which one 
of a pair of adjectives each interviewee had checked as 
being descriptive of himself for each of 20 pairs of 
adjectives. On each of these four instruments items were 
designed so that the subjects could not obtain a high 
score on the basis of always checking the most socially 
desirable alternative or through some other similar 
response set. Precautions were taken to choose inter- 
viewees very heterogeneous as to age, social-cultural 
background, sex, etc, In addition, a Total Judging score 
was computed by converting scores on individual 
instruments to standard scores and adding across 
instruments. Due to difficulties in scheduling and/or 
motivating subjects, not all of these four judging 
instruments were used in all of the replication studies. 


RESULTS 


Subjects in the first replication study consisted 
of 295 students of both sexes recruited from ele- 
mentary psychology classes at the University of 
Utah in the Fall of 1958. These subjects were 
tested using the 10 films employed in our original 
study (which gave evidence for the generality of 
judging ability). In an effort to reduce the testing 
time, a shorter form of 6 films was selected from 
these 10 films, with selection again being made on 
the basis of how well the film discriminated poor 
judges from good judges. Intercorrelations of the 
accuracy scores on the different judging instru- 
ments for these subjects for both the 10-film 
form and the 6-film form are presented in Table 1. 
In this and all subsequent tables, all correlations 
between individual judging instruments and Total 
Judging are part-whole correlations corrected for 
the contribution of the part, through use of the 
usual correction formula (Gulliksen, 1950). 

Subjects in the second replication study con- 
sisted of 51 students of both sexes recruited from 
upper division psychology classes at the University 
of Utah in the Spring of 1959. These subjects were 
tested with six films. One film, however, had been 
changed from the study presented in Table 1. 
In this case, selection was mot made on the basis 
of discrimination of good judges from poor, but 
tather because the person in the film was dissimilar 
to the other five persons (films), both generally in 
personal characteristics and specifically on the 
Trait Rating instrument. At the same time 
Trait Rating was revised so that the adjectives 
included were those on which persons in the film 
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TABLE 1 
INTERCORRELATIONS BETWEEN AccuRACY SCORES ON DIFFERENT JUDGING MEASURES FOR 
University og UTAH STUDENTS TESTED IN FALL QUARTER, 1958 
(N = 295) 
10-Film Form 6-Film Form 
Trait Rating Adjective Total Judging Trait Rating Adjective Total Judging 
Behavior Postdiction 24 04 31 -36 -28 .39 
Trait Rating .31 44 AL +50 
Adjective Check 31 44 
7.05 = .11 r 01 = .15 
TABLE 2 TABLE 4 


INTERCORRELATIONS BETWEEN ACCURACY SCORES ON 
DIFFERENT JUDGING MEASURES FOR UNIVERSITY 
or UTAH STUDENTS TESTED IN SPRING 
QUARTER, 1959, Use Revisep SIX 


Fum Form 
(N = 51) 
+, |Opinion| Adjec- 
Trait f re Total 
Rating | Predic-| tive [Judging 


Behavior Postdiction 16 f2 afak 8T 


Trait Rating -31 | .13 | .28 
Opinion Prediction E LN | eae 
Adjective Check .36 
705 = .27 r 01 = 35 
TABLE 3 


INTERCORRELATIONS BETWEEN ACCURACY SCORES ON 
DIFFERENT JUDGING MEASURES FOR IDAHO 
Stare HOSPITAL PATIENTS TESTED 
with Revisep Srx-Firm FORM 


(N = 32) 
i jective Total 
ty aes tay 
Behavior Postdiction AL 61 67 
Trait Rating 31 -37 
Adjective Check 57 
r 05 = .35 r 01 = 45 


were relatively dissimilar. Both of these changes 
make it less likely that any generality of judging 
ability found using these films and this revision of 
Trait Rating can be attributed to real similarity 
of the persons judged to each other. 

: The correlations for these subjects using these 
Six films are presented in Table 2. 

Subjects in the third replication study consisted 
of 32 mental patients at the Idaho State Hospital 
South, Blackfoot, Idaho. These patients repre- 
Sented a variety of diagnostic categories, and were 
Selected primarily on the basis of being sufficiently 


INTERCORRELATIONS BETWEEN ACCURACY SCORES ON 
DIFFERENT JUDGING MEASURES ror T Group 
MEMBERS TESTED WITH REVISED SIX 
Fum Form 


(N = 118) 


ait opea Adjec- 


Tı lon Adi 
Rating |Predic-| (tive, [Judging 
Behavior Postdiction 428. | .36 | .36 | .52 
Trait Rating 27 | .03 | 26 
Opinion Prediction .07 | .47 
Adjective Check +35 
7.05 = 17 r Ol = .23 
TABLE 5 
RELIABILITIES OF JUDGING INSTRUMENTS 
(N = 149) 
Behavior Postdiction 37 
Trait Rating 60 
Opinion Prediction 199) 
Adjective Check 61 
Total Judging 54 


intelligent, motivated, and in contact with reality 
to complete the judging task, They were tested 
using the revised six-film form used in the second 
replication study. K 

The correlations for this group are presented in 
Table 3. 

Subjects in the fourth replication study con- 
sisted of 118 members of “T groups” obtained 
from rather heterogeneous sources. By T groups we 
mean groups of people, 8-15 in number, who had 
participated in special training in group dynamics. 
Thirty-six subjects were students from T groups 
at the Brigham Young University, Provo, Utah; 
17 subjects were adults from T groups at the 
Intermountain Training Laboratory at Cedar City, 
Utah; and 65 subjects were adults from T groups 
at the National Training Laboratory at Bethel, 
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Maine? All of these subjects were tested using the 
revised six-film form used in the second and third 
replication studies. 

Correlations for these groups, combined into a 
single group, are presented in Table 4. 

The data presented so far have dealt with gen- 
erality across judging instruments. Another im- 
portant question is to what extent is there general- 
ity within instruments across persons, or, in other 
words how reliable are the various judging instru- 
ments. To put it another way, is a judge who is 
accurate in judging one person also accurate in 
judging other persons on the same instruments? 
In order to answer this question, reliability coeffi- 
cients were computed using as subjects 149 judges 
from the second replication University of Utah 
student group, the Bethel National Training 
Laboratory, and Brigham Young University T 
groups. These subjects were tested with all four 
judging instruments using the revised six-film 
version of our judging test. These reliabilities are 
presented in Table 5. In Table 5 the reliabilities of 
Behavior Postdiction, Opinion Prediction, and 
Adjective Check are Kuder-Richardson Formula 
21 coefficients, and the Trait Rating reliability is a 
Spearman-Brown coefficient based on a first three 
films-second three films split. Because of the 
computational labor involved in computing a 
split-half reliability for Total Judging, this relia- 
bility was estimated from the formula presented 
by Mosier (1943) for determining the reliability of 
a composite, using unit weights for the parts and 
disregarding the cor.elations between the parts. 


Discussion 


One of the most important scientific criteria for 
evaluating research findings is whether it is 
possible to replicate the results. In practice, 
however, this criterion is somewhat difficult to 
apply to the results of the present studies, since 
not all of the individual correlations are statisti- 
cally significant, These difficulties are compounded 
by the fact that there is some question as to 
whether Type I error (rejecting the null hypothe- 
sis when it is true) or Type II error (accepting the 
null hypothesis when it is false) is more serious in 
the reported investigations of the generality ques- 
tion. Nevertheless, in all of the studies reported 
here there seems to be a consistent pattern of low 
positive correlations between different judging 
instruments, and somewhat larger positive corre- 
lations between judging instruments and Total 
Judging (where the contribution of the particular 
instrument has been eliminated or partialed out). 
The average interinstrument correlation for the 


2 Great appreciation is expressed to Jack Gibb for his 
assistance and cooperation in collecting this data. 
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six-film form (used in the second, third, and fourth 
replication studies), computed through use of 
Fisher’s z coefficient, is .25 and the corresponding 
average part-whole correlation is .41. Both of these 
correlations are significant at beyond the .001 
level. It will also be noted that all 38 correlations 
reported here are positive. While this result is 
obviously relevant to the question of whether or 
not there is generality among the various judging 
instruments, we know of no really satisfactory 
way to evaluate it statistically. As a rough approxi- 
mation, the binomial probability of obtaining all 
one sign in a random distribution of signs with 38 
cases is also much less than .001. 

The reliabilities obtained for the judging instru- 
ments are not extremely high. It should be noted, 
however, that Kuder-Richardson Formula 21 relia- 
bilities are a lower bound estimate of the reliability 
(Gulliksen, 1950), and that the Trait Rating and 
Total Judging reliabilities obtained in this research 
are also lower than the maximum reliabilities for 
the instruments involved, since no attempt was 
made to weight in such a way as to maximize 
reliability. We therefore consider these reliabilities 
to be consistent with the correlations between in- 
struments. 

The relatively low reliability for Postdiction can 
be substantially attributed to the fact that it has 
only about one-third as many items (42 vs, 120) 
as the other instruments, The standard formula for 
relating test length to reliability (Gulliksen, 1950) 
indicates that if Postdiction were three times as 
long, and thus had approximately as many items 
as the other judging instruments, its reliability 
would be .51, a value more comparable to the 
reliabilities of the other judging instruments. 

We regard these results as confirming the pre- 
vious findings of generality of accuracy of inter- 
personal perception. The consistency of results 
over the four replications is stronger evidence than 
is a statistical test of the significance of correlation 
coefficients within a given replication. The absolute 
size of the correlations between judging instru- 
ments has decreased from that obtained in the 
original study (Cline & Richards, 1960), as could 
be expected in a cross-validation such as this. 
The correlations between judging instruments are, 
for the most part, less than the limits set by their 
reliabilities, a finding that tends to confirm the 
view that judging ability is factorially complex. 

In the course of our research considerable effort 
was devoted to the treatment of scores on the 
Trait Rating in an analytic way suggested by 
Cronbach (1955), since our original study had 
suggested the importance of this type of analysis 
in studies of the generality of accuracy of intet- 
personal perception. Two separate programs havé 
been written for the IBM 650 computor, one for 
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obtaining the total score and the four basic com- 
ponents in difference score form, and one which 
performs the same computations ‘but in addition 
breaks down two components into a correlation 
term and a variance term. These programs were 
tailored to our instrument, and were not written 
in the general case. Work with these programs 
indicated that due to some of the special charac- 
teristics of this analytic treatment of scores, and 
the particular form of the Trait Rating test used 
in these studies, not all of the components could 
be meaningfully interpreted. Because of these 
limitations, we made no further analysis of these 
components. These limitations do not, however, 
appear to prevent the total score on Trait Rating 
from being meaningful, and this score was retained 
in the reported studies. Trait Rating has subse- 
quently been revised to eliminate these difficulties 
of interpretation, and we are currently completing 
a study of the characteristics and interrelations of 
the various components on a new group of subjects. 


SUMMARY 


Results of our previous research indicated that 
there is some degree of generality in accuracy of 
interpersonal perception or judging ability. Four 
replication studies of the original research were 
carried out, with the measures of judging ability 
being somewhat revised after the first of these 
Studies. In the three subsequent studies using the 
Tevised measures, the average interinstrument 
correlation between different measures of judging 
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ability was .25 and the average corrected part- 
whole correlation between individual judging 
instruments and a Total Judging score was .41. 
Since both of these correlations are significant at 
the .001 level, we conclude that these studies con- 
firm the previous findings of a low but consistent 
degree of generality of judging ability. The relia- 
bilities of the judging instruments, while not ex- 
tremely high, also tend to indicate that there is 
some generality of judging ability. 
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ANTI-SEMITISM AND THE EXPRESSION OF FANTASY AGGRESSION! 


DONALD WEATHERLEY? * 
University of Colorado 


CCORDING to the scapegoat theory of preju- 
dice, the manner in which a person copes 
with his aggressive urges has an important bear- 
ing on the degree to which he exhibits prejudicial 
reactions toward members of minority groups. 
The prejudiced person is assumed to be appre- 
hensive about directly aggressing toward the 
real instigators of his hostility and as a conse- 
quence tends to express aggression in indirect 
forms and/or against substitute objects—i.e., he 
displaces aggression. This tendency to displace 
aggression is considered an important factor in 
producing and maintaining ethnic prejudice (e.g., 
Dollard, Doob, Miller, Mowrer, & Sears, 1939). 
Tf there is in fact a functional relationship be- 
tween a tendency to displace aggression and de- 
gree of prejudice we should expect to find that in- 
tolerant persons have a greater tendency than 
tolerant persons to displace aggression. Studies 
bearing directly on this proposition (Berkowitz, 
1959; Lindzey, 1950) have produced inconclusive 
results, The pertinence of these studies to the 
scapegoat theory involves the assumption that 
prejudice is but one aspect of a more general 
tendency to displace aggression on the part of 
intolerant individuals, As Lindzey (1950) first 
suggested, however, the prejudiced person may 
be one who has acquired a specific tendency to 
displace aggression selectively toward members of 
certain minority groups. Thus, for example, the 
highly anti-Semitic individual may have a strong 
tendency to displace aggression onto Jews, but 
may not necessarily show other evidence of a 
strong tendency to displace aggression. The pres- 
ent study was designed to test this hypothesis, 
Subjects differing in degree of anti-Semitism 
were compared with respect to their readiness to 
express experimentally aroused aggression in sub- 
sequent fantasy productions, By a special manipu- 
lation of the instructions for the fantasy measure, 
it was possible to obtain measures of aggression 
expressed toward fantasy characters with Jewish 
or non-Jewish names, As a test of the hypothesis 
that High A-S subjects have a greater specific 
tendency than Low A-S subjects to displace ag- 


1 This paper is based on a doctoral dissertation done 
at Stanford University. The work was done while the 
author was on active duty in the Army Medica] Service 
Corps, enrolled in the Army Senior Psychology Train- 
ing Program. 

2 The author gratefully acknowledges the aid given 
by J. K. Adams, Robert Sears, and C. L. Winder. 
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gression selectively onto Jews, it was predicted 
that as a function of aggression arousal: High A-S 
subjects should express more fantasy aggression 
toward Jewish-named characters than Low A-S 
subjects; and High A-S subjects, as compared with 
Low A-S subjects, should direct more aggression 
toward fantasy characters with Jewish names than 
toward characters with non-Jewish names, The 
second prediction, it can be seen, refers to an inter- 
action between the variables of A-S level and char- 
acter name. 


Mernop 
Subjects and Procedure 


The data were obtained from 100 male, non-Jewish 
college students, selected on the basis of extremely 
high and extremely low scores on the Levinson Anti- 
Semitism (A-S) Scale. Twenty-five High A-S subjects 
and 25 Low A-S subjects were subjected to an aggression 
arousing situation, and a specially designed fantasy test 
was then administered. Control subjects, individually 
matched with the angered subjects on the basis of A-S 
scale scores, took the fantasy test without prior aggres- 
sion arousal. 

The data were collected in small group sessions. In 
Arousal group sessions aggression arousal was accom- 
plished by making highly insulting and depreciating 
comments to the subjects in the course of administer- 
ing a brief paper-and-pencil task to them. The insulting 
was done by the author. Immediately after this proce- 
dure a different experimenter administered the fantasy 
test. The situation was arranged so that the subjects 
would not perceive a relationship between the aggres- 
sion arousing situation and the subsequent fantasy test. 
Following the fantasy test and before the subjects were 
told of the deception employed, they were asked to 
complete an open-ended questionnaire that gave them 
an opportunity to express hostility toward the author. 

The same procedure was followed for the control 
subjects, with the exception that the paper-and-pencil 
task was administered in a friendly, nonprovoking 
manner. 

A more complete description of this procedure has 
been reported elsewhere (Weatherley, 1956). 


Fantasy Test 

Eight original pencil sketches, each depicting four 
male figures involved in an ambiguous situation, were 
successively projected on a screen for 20 seconds in 
constant order in all sessions.* The subjects were given 
4 minutes to write a story about each picture. 


* Photographs of these pictures have been deposited 
with the American Documentation Institute. Order 
Document No. 6741 from ADI Auxiliary Publications 
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In addition to the usual group TAT instruction (Mc- 
Clelland, Atkinson, Clark, & Lowell, 1953) the sub- 
jects were given the age, name, and;occupation of a 
fictional character at the time each picture was shown. 
‘They were instructed to tell what was happening to 
that character in their story about the picture. Four 
characters were named and described to the subjects. 
Thus each character was presented twite in the series 
of pictures. 

Two of the characters were given Jewish sounding 
names: Sam Goldblatt and Herb Rosen, The other two 
characters were called Ken Taylor and Jim Brooks. 
These were considered non-Jewish names. The choice 
of names was based on empirical results reported else- 
where (Weatherley, 1956). Each character was not 
only named, but described to the subjects as either a 
28-year-old college graduate student or a 16-year-old 
high school student. Each of these two descriptions was 
combined equally often with Jewish and non-Jewish 
names, 


Analysis of Fantasy Data 

For each subject, a count was made of the number 
of aggressive acts directed toward each of the four 
characters which the subjects had been told to include 
in their stories, From this count, two total scores were 
derived for each subject. These scores corresponded to 
the total number of aggressive acts directed toward 
Jewish and non-Jewish characters, respectively. The 
data were then reduced to difference scores by sub- 
tracting the two scores of each control subject from the 
comparable scores obtained by his matched partner in 
the Arousal group. Since the Arousal and Control groups 
differed only in that the former was subjected to an 
Aggression arousing experience prior to taking the fan- 
tasy test, these A-C difference scores reflected the net 
effect of aggression arousal on fantasy aggression. 


RESULTS 
Effectiveness of Aggression Arousal 


The questionnaire administered immediately 
after the fantasy test and before the subjects were 
told of the deception provided one basis for com- 
Paring the reactions of the angered subjects with 
that of control subjects. As evaluated by three 
raters, 65% of the Arousal group subjects were 
judged to have expressed hostility toward the 
experimenter who had insulted them, while none 
of the subjects in the Control group did so. (There 
Was not a significant difference within the Arousal 
group between High and Low A-S subjects on this 
Measure.) In addition to this questionnaire, at the 
end of the Arousal group sessions the subjects 
Were asked to describe in writing how they had 
felt when insulted earlier in the session, All but 6 


Project, Photoduplication Service, Library of Congress; 

fashington 25, D. C., remitting in advance $1.25 for 
microfilm or $1.25 for photocopies. Make checks pay- 
able to: Chief, Photoduplication Service, Library of 
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TABLE 1 
A-C MEAN DIFFERENCES IN NUMBER or AGGRESSIVE 
ACTS TOWARD CHARACTERS WITH JEWISH AND 
Non-Jewisn Names 


Object of Aggression it between 
jewish Characters a 
'on-Jewish Characters 19 


t between Means 


* p <.05, using a one-tailed test. 
-p< 05. 
89 < 01. 


of the 50 Arousal group subjects admitted that 
they had felt angered by the comments of the 
experimenter. In addition, all of these subjects 
stated that they did not perceive a relationship 
between the aggression arousing situation and the 
subsequent fantasy test. Thus both in the report 
of their subjective feelings and in an objective 
measure of hostility expressed, the Arousal group 
subjects gave convincing evidence that aggression 
arousal had been effective, 


Effect of Aggression Arousal upon Fantasy Ag- 
gression 

The net effect of aggression arousal upon fan- 
tasy aggression is reflected in the A-C mean differ- 
ences reported in Table 1. It can be seen that for 
High A-S subjects, aggression arousal produced a 
slight increase in fantasy aggression toward 
Jewish-named characters and a slight decrease in 
ion toward characters with non-Jewish 
names, Neither of these A-C means differed sig- 
nificantly from zero, They did, however, differ 
significantly (p < .05) from one another when a 

one-tailed test was used, 
For Low A-S subjects aggression arousal re- 
sulted in a very significant decrease in fantasy 
directed toward Jewish-named charac- 


ables of anti-Semitism and “Jewishness” of the 
characters selected as targets of fantasy aggres- 
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sion. These obtained differences between High 
and Low A-S subjects were in accord with the 
predictions made.’ 


Discussion 


Despite evidence that the aggression arousing 
situation was successful in making the subjects 
angry, it did not result in a significant increase in 
fantasy aggression—none of the A-C means was 
significantly positive. Displacement of aggression 
in fantasy form was thus not formally demon- 
strated to have occurred. In general the reverse 
was true, especially in the case of the Low A-S 
subjects. Lindzey (1950), however, was able to 
demonstrate an increase in the fantasy aggression 
of both tolerant and intolerant individuals fol- 
lowing a frustating situation. This discrepancy 
between Lindzey’s results and the present findings 
suggests that situational factors may play an 
important role in determining the net effect of 
aggression arousal upon subsequent, indirect ex- 
pressions of aggression. 

In contrast to the frustration procedure used by 
Lindzey, the present study used a technique that 
evoked anger directed toward an easily identified 
instigator, implicitly involved strong sanctions 
against counteraggression, and evoked anger in 
subjects to an extent that made it relatively easy 
for them to recognize their own hostile feelings. 
Factors such as these may well have led to in- 
hibiting influences upon subsequent fantasy ag- 
gression, thus reducing the average level of fan- 
tasy aggression in the angered subjects. In any 
event, the results dramatically demonstrated that 
aggression arousal did produce differences in reac- 
tion to the subsequent fantasy test. 

In terms of the hypothesis under study, an im- 
portant finding was that aggression arousal re- 
sulted in the High A-S subjects directing signifi- 
cantly more aggression toward Jewish characters 
than they did toward non-Jewish characters. This 
finding suggests that highly anti-Semitic persons 
do tend to select Jews over non-Jews as targets for 
aroused aggression. Furthermore, as predicted, it 
was shown that High and Low A-S subjects dif- 


4The procedure of comparing High and Low A-S 
subjects in terms of A-C differences insures that any 
significant differences between them are a function of 
aggression arousal. It is of interest, nevertheless, to 
examine the data for possible differences in fantasy 
aggression between High and Low A-S subjects under 
control conditions. An analysis of variance of the fan- 
tasy data obtained from the Control group indicated 
that the High and Low A-S subjects who were not 
subjected to the aggression arousal situation did not 
differ significantly in the amount of fantasy aggression 
they directed toward the four fantasy characters. 
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fered reliably in this respect. Also as predicted, 
High A-S subjects directed significantly more ag- 
gression toward the Jewish-named characters than 
did the Low A-S subjects as a function of aggres- 
sion arousal. These results then support the hy- 
pothesis that persons high in anti-Semitism have a 
greater specific tendency than persons low in anti- 
Semitism to displace aggression selectively onto 
Jews. 

Another aspect of the data serves to underscore — 
the specific nature of the differences found between _ 
prejudice groups. High and Low A-S subjects did 
not differ significantly as a function of aggression — 
arousal in the amount of aggression expressed — 
toward the non-Jewish characters. The implica- 
tion is that at least in a situation where Jewish 
objects are available as targets for aggression, the 
High A-S subject is not any more likely than the 
Low A-S subject to direct aroused aggression 
toward non-Jewish substitute objects. This finding — 
argues against an explanation of prejudice as 
representing merely one facet of a more general 
tendency to displace aggression diffusely. 

So far, Low A-S subjects have been discussed 
only in connection with the differences found be- 
tween prejudice groups. However, the reaction of 
the Low A-S subjects to the aggression arousing 
situation deserves attention in its own right, Low { 
A-S subjects in the Arousal group directed signifi- — 
cantly less aggression toward the Jewish-named 
characters than did their matched partners in the 
Control group. Thus they showed a definite in- 
hibition of postarousal fantasy aggression with re- 
spect to Jewish-named characters. Even more 
noteworthy is the fact that as a function of 
aggression arousal Low A-S subjects directed sig- 
nificantly less aggression toward Jewish-named ` 
characters than they did toward the non-Jewish 
characters. This differential response to the fantasy 
characters is the reciprocal of that found in the 
case of High A-S subjects. In effect, the tolerant 
subjects gave evidence of a specific tendency to 
avoid expressing aroused aggression toward Jews. 

This finding challenges the traditional image of 
the very tolerant individual as one who is a bias 
free neutral in his reactions to minority group 
members. The Low A-S subjects clearly demon- 
strated a response bias when angered that was 
consonant with their expressed attitudes of ex 
treme tolerance. The finding agrees with the propo- 
sition that extreme attitudes in any direction are 
likely to entail an appreciable emotional involve- 
ment. True emotional neutrality, in the case of 
prejudice as with any other dimension of attitude, 
may be found most often in persons who lie be 
tween the positive and negative extremes. 
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SUMMARY 


In order to test a hypothesis derived from the 
scapegoat theory of prejudice, two groups of male 
college students, chosen on the basis of extremely 
high and extremely low scores on the Levinson 
Anti-Semitism Scale, were subjected to an aggres- 
sion arousing situation. Following aggression 
arousal a specially designed fantasy test was ad- 
ministered, which yielded measures of aggression 
expressed toward fantasy characters with Jewish 
or non-Jewish names. Control subjects, matched 
with Arousal group subjects on the basis of A-S 
scale scores, took the fantasy test without prior 
aggression arousal. 

Aggression arousal produced significant differ- 
ences in the fantasy aggression of High and Low 
A-S subjects. The nature of these differences sup- 
ported the hypothesis that persons high in anti- 
Semitism have a greater specific tendency than 
persons low in anti-Semitism to displace aggres- 
sion selectively onto Jews. In addition, evidence 
was obtained suggesting that individuals very 
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low in anti-Semitism have a specific tendency to 
avoid expressing aroused aggression toward Jews. 
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SOME COMMENTS ON THE EPPS EGO DISJUNCTION SCORE 


EDWARD F. GOCKA! ann VITALI ROZYNKO* 
Veterans Administration Hospital, American Lake, Washington 


N ego disjunction score (EDS) derived from 
four pairs of needs in the Edwards Personal 
Preference Schedule (EPPS) was reported to relate 
positively, in general, to degree of psychopathol- 
ogy (Trehub, 1959). The groups tested could be 
ranked on their mean EDSs, from low to high, 
in the following order: college students, adoles- 
cents, neurotics, character disorders, and schizo- 
phrenics, A statistical analysis yielded three sig- 
nificantly different subgroups. The first subgroup, 
with the lowest mean score, consisted of the col- 
lege students alone, The second consisted of ado- 
lescents, neurotics, and character disorders, 
whereas the third subgroup, with the highest mean 
score, consisted of schizophrenics alone, The score 
did not discriminate at a significant level within 
the second subgroup. Several comments need to 
be made concerning this score, 

The need pairs selected by Trehub (1959) were 
Aggression-Deference, Succorance-Nurturance, Au- 
tonomy-Abasement, and Order-Change, An ego 
disjunction indicator for a particular need pair 
was obtained by summing the standard score 
units for each need, The value of 100 was then 
subtracted from this sum, and positive residuals 
were considered as indicating the presence of dis- 
junction. The degree of disjunction for each need 
pair was considered to be reflected in the magni- 
tude of the positive residual. Thus, the total EDS 
for each subject consisted of the sum of his posi- 
tive residuals on the disjunction indicators. 

If we are to make comparisons between dis- 
parate classifications of people in terms of some 
personality score based on standard units, then we 
must be certain that contingency factors involving 
variables such as sex, age, education level, occu- 
pational status, and others of a demographic na- 
ture are controlled or accounted for in the analysis. 
Failure to control such contingency variables could 
lead one to impute some exotic discriminating 
power to a measure when, in reality, it was con- 
tingent on an ordinary demographic characteristic, 

Trehub (1959) used the standard score distribu- 
tion of the college normative sample published by 
Edwards (1954) in order to compute his EDSs. 
That these norms differ from the norms for an 


1 The authors wish to acknowledge the assistance of 
Hildegund Holloway in data analysis and John B. 
Marks, chief research psychologist at American Lake, 
for his constructive review of the manuscript. 
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adult sample is evident from results obtained by 
Koponen (1957) and published by Edwards (1959) 
in the revised edition of the EPPS manual. If | 
now consider only the demographic di 
within a particular group like the general 
population, we also expect to find 
demographic differences reflected in the 
scores. Some data reported by Koponen (19 
for 3,882 male household heads illustrate this 
When Koponen divided his subjects into 
groups, significant differences (.001 level) 
means were found for 13 of the 15 EPPS nec 
Similarly, when this sample was divided in | 
of grade school, high school, or college education 
levels, 11 of the 15 EPPS needs gave sig n 
differences at the .001 level. When the sample Wi 
divided into four income groups, 10 of the 1 
needs gave significant differences at the ‚001 
When divided by region of the country 
central, south, west), 10 of the 15 needs ga’ 
same general level of differences. Similar 
occurred with other demographic variables 
a female sample numbering 4,922 subjects, 
In this critique, we will focus only on the m 
lationship of age to EDS in order to demonstrat 
our point, Before turning to the analysis of a 
effects, from which we may roughly predict 0 
EDS increments for groups of different ages, Wi 
will consider some specific facts relating K ¥ 
general adult population to the college no 
sample, If we consider differences in age, We 
that 67.8% of Koponen’s male group is of 
or above; in contrast the college normative 
has only 0.9% of its male subjects with an 
40 or above. For the females, the percen' 
67.3 and 2.4, respectively. When consid 
Koponen’s overall data, especially for those 
making up an EDS, we find that six of the 
mean need values for the general adult male p 
lation are higher than the mean need values t 
college normative sample; the average raw soot 
increase over the college normative group 
2.44 per ego disjunction pair. Likewise, the ai 
raw score increase over the college normative 
per ego disjunction pair for the general 
male population equals 2.50. This is becau 
of the cight mean need values are higher 
general adult female population. The di 
standard deviations for the two pop 
pear to have neither the consistency nor the s6 
shown by their population means. The 
adult male population standard deviatio 
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sample by an average of .04 per ego disjunction 
pair. At best, for the present samples, we can say 
that the differences between standard deviations 
are so small that neither group would be favored 
greatly toward a larger mean EDS from the nature 
of these differences. 


means for all ego disjunction pairs for the adult 
population, Indeed, if we take the raw means for 
each group and compute EDSs based on the 
parameters of the college normative sample, we 
obtain a measure of the expected increase in the 
average EDS for the particular group compared 


puted from these means will give us some estimate 
of how much the actual average EDS will differ 
from the actual average EDS computed for the 
normative sample, Of course, if we use the means 
of the normative sample to compute its EDS we 
get a score of exactly zero, An actual average 
EDS for this group, based on the raw scores them- 


Parameters, and these standard scores were com- 
bined into an EDS. Table 1 gives these expected 

. For example, we would expect a differ- 
nce of 14 in the average EDSs computed for the 
male college norms and the general adult male 
Sample between the ages 30 to 39, provided that 
the standard scores for the general adult group 
Were based on the parameters of the male college 
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asked orally in fixed sequence, about an inflated balloon 
held by the tester. Five of the questions concerned the 
child’s desire to “pop” the balloon himself to or see 
the tester do it (e.g., “Do you think it would be fun 
to see me ‘pop’ this balloon?”; “If I gave one to you, 
would you take it home and ‘pop’ it, or would you 
just play with it?”). Subjects were not actually given an 
opportunity to break the balloon, however. The child’s 
aggression score was simply the number of times he 
chose the “popping” response. Although the possible 
range of scores was 0-5, the actual range was 0-2. 

The testers, who did not know anything about the 
experimental treatments the children had experienced 
prior to the test, were also asked to check any of the 
following adjectives that they felt described the sub- 
ject’s behavior during the testing period: angry, excited, 
tense, relaxed, cooperative. At the conclusion of the 
experiment, all children who had been frustrated were 
given balloons to take home and permitted to see 
another, nonaggressive cartoon as rewards for their 
cooperation. 


RESULTS 


The results of analysis of variance of the ag- 
gression test data, summarized in Table 1, indicate 
that the only significant effect was the main effect 
for the cartoon variable. Table 2, which gives the 
mean aggression scores of the six groups, shows 
that subjects who saw the aggressive cartoon were 
more willing to express aggressive impulses in this 
setting than those who saw the nonaggressive car- 
toon or no cartoon at all. This finding supports 
the hypothesis that viewing violence in a cartoon 
may actually stimulate or intensify the child’s 
aggression in a subsequent permissive situation. 
There was no evidence in these data that the sexes 


TABLE 1 
ANALYSIS OF VARIANCE FoR AGGRESSION-TEst DATA 


Source 


Cartoon 
Frustration 
Sex 


Frustration X Sex 

Frustration X Cartoon 

Sex X Cartoon 

Frustration X Sex X 
Cartoon 

Within cells 


TABLE 2 
Grove MEANS FOR AGGRESSION-TEST DATA 


Cartoon Treatment Frustrated Nonfrustrated 
Aggressive Cartoon 1.67 2.00 
Nonaggressive Cartoon -67 -83 
No Cartoon -50 -67 
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differed from each other in their reactions to the 
cartoon. q 

It is interesting to note that, contrary to what 
would be predicted on the basis of the frustration- 
aggression hypothesis, the frustrated and non- 


to aggressive expression as measured here. The 
data from the adjective check lists, filled in by the 
testers, indicated, however, that feelings of anxie 
and tension were more likely to occur subsequent 
to seeing an aggressive cartoon than after the other 
treatments. Eight of the 12 subjects (67%) who 
saw the aggressive cartoon were described 
“tense” while among the 24 other subjects, only 
8 (33%), 4 who had seen the nonaggressive car- 
toon and 4 who had not seen any cartoon, were 
rated in this way (x? = 3.94, p < .05). 
These findings may be interpreted as evidence 
that reaction to frustration did not increase overt 
aggression but rather heightened feelings of ten- 
sion—i.e., that the frustration-aggression hypothe- 
sis did not hold in this situation. On the other 
hand, it is also possible that, in spite of the ra- 
tionale underlying the use of this frustration pro- 
cedure, the task assigned to the “frustration” 
group was not actually frustrating. In view of these 
considerations, it is difficult to make any generali- 
zations about the possible effects of experimentally- 
induced frustration or subsequent aggression. 


Discussion 


The experimental results provide evidence sup- 
porting the basic hypothesis of the study which 
stated that film-mediated fantasy aggression may 
intensify aggressive expression in permissive play. 
Since the present findings may be directly applica- 
ble only to the restricted situation of permissible 
aggressive play, it is difficult to evaluate their 
implications, if any, for the validity of the catharsis 
hypothesis. This hypothesis is generally inter- 
preted, as it was in the Feshbach and Siegel” 
studies, to refer to interpersonal aggression sub- 
sequent to other aggressive activities, such as 4 
fantasy aggression. There is no evidence in the 
data to suggest that, in an unselected group of 
subjects, those who increase in the measure 0 
aggression used here become more hostile toward — 
other children, although pretests show that this 
measure differentiated at the .02 level the 
children rated high and low in aggression by theit ” 
nursery school teachers, Nevertheless, it should be ~ 
noted that our findings are entirely consistent 
with the results of Feshbach’s second experiment 
in which children’s “inappropriate aggression” l 
increased during play with aggressive toys. If, a 
is generally assumed, a child’s aggression is largely | 
a function of the strength of his aggressive drive 
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and his internalized inhibitions against aggression, 
then increases in aggression must be due to either 
temporary intensification of drive “or reduction of 
inhibitions, or both. In the present study, intensi- 
fication of instigation to aggression may have re- 
sulted directly from identification with aggressive 
cartoon characters with accompanying assumption 
of their motives. In Feshbach’s study, the aggres- 
sive toys may have provided cues eliciting associa- 
tions with real or fictional aggressive identificands, 
and consequently, increased aggressive motivation. 

The second possible, perhaps more parsimoni- 
ous, explanation is that the permissive “fun” 
context of animated cartoons or play with ag- 
gressive toys leads to a reduction of inhibitions 
against aggressive expression, particularly in sit- 
uations where aggressive play is not likely to be 
punished. It has frequently been observed that 
aggression in doll play increases from session to 
session, presumably because “the deliberate per- 
missiveness of the experimental procedure progres- 
sively acts to reduce the inhibitions about ag- 
gression the child has heretofore acquired” (Levin 
& Sears, 1956, p. 149). Moreover, Feshbach 
(1956) found that, particularly among boys in- 
itially low in aggression, aggressive behavior 
increased following permissive free play experi- 
ences, The child may regard these situations as 
“permission-giving,” with respect to aggression 
(Siegel & Kohn, 1959); i.e., he may infer that 
aggression is distinctly permissible and appropriate 
in these situations. The response of behaving 
aggressively may then generalize to other relatively 
permissive situations, e.g., doll play or a balloon- 
breaking test. 

These considerations may also help to explain 
the discrepancy between the findings of the present 
study and those of Siegel who found that children 
did not display significantly more aggression to- 
ward peers after watching an aggressive cartoon 
than they did after watching a control cartoon. 
The two studies appear to differ most importantly 
in the indices of aggression employed.? In the test 
used in the present study, a relaxed tester asked 
questions phrased in a way that permitted, or even 
encouraged, the free expression of aggressive feel- 
ing, not overt aggression. The test setting con- 


2 While Siegel’s subjects were somewhat younger 
than ours, this age difference could hardly explain the 
discrepancy in the results. It is also possible that 
differences in the cartoon stimuli account, at least 
partially, for the differences in the findings. For ex- 
ample, compared with those used by Siegel, the aggres- 
Sive cartoons of the present study may have provided 
better (i.e., more immediate or intimate) identification 
models or a more “permission-giving” context for 
aggression. There is no evidence to support such an 
explanation, however. 
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tained no apparent threat of counteraggression and 
there were no strong cues evoking internalized 
inhibitions against aggression, Increased aggressive 
feelings, stimulated by the aggressive cartoon, were 
therefore likely to be generalized and expressed in 
a permissive test situation. 

Siegel’s measure of aggression, on the other 
hand, was based on overt aggression in social play 
in the absence of adults. This is a situation in which 
strong inhibitions against aggression are likely to 
be evoked (Siegel & Kohn, 1959). Siegel and Kohn 
(1959) have pointed out that “lacking any adults 
to define the social situation and to express ex- 
pectations of his behavior in it, the child will . . . 
rely ... on his own learned standards of conduct 
... Which, in the middle-class child, will typically 
be that aggression is unacceptable in social play” 
(p. 134). It may be reasonably hypothesized that 
there was a reduction of inhibition against ag- 
gression among Siegel’s subjects, as among ours, 
after they viewed an aggressive cartoon, but this 
effect was not apparent in a test situation that 
contained cues that elicit powerful inhibitory 
responses, 


SUMMARY 


The purpose of this study was to test the hy- 
pothesis that exposure to aggressive fantasy in an 
animated cartoon may intensify children’s im- 
pulses to aggression, Subjects were 36 first grade 
children, 18 girls and 18 boys, of middle class 
origin. The intensity of the child’s aggressive im- 
pulses was inferred from his responses to questions 
concerning desire to “play with” or “pop” a large 
yellow balloon held by a tester. 

The experimental findings clearly supported the 
major prediction that exposure to aggressive fan- 
tasy in an animated cartoon would stimulate 
children’s aggressive behavior in play. Two possible 
explanations for these results were discussed: that 
intensification of instigation to aggression may 
have resulted from the child’s identification with 
the aggressive cartoon characters, with accom- 
panying assumption of their motives; that the 
relaxed, “fun” context of animated cartoons may 
lead to a reduction of inhibitions against aggressive 
expression in a permissive play situation. 
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RELATIONSHIPS BETWEEN HOPE OF SUCCESS AND FEAR OF FAILURE, 
ANXIETY, AND NEED FOR ACHIEVEMENT 


E. EDWARD REITMAN! anp CARL D. WILLIAMS? 
University of Miami 


n issue regarding the nature of what is meas- 
A ured by McClelland’s TAT test of need for 
achievement (n Achievement) has developed from 
a study by Clark, Teevan, and Ricciuti (1956). 
On the basis of a level-of-aspiration question- 
naire, they placed the subjects, on whom they 
had obtained an Achievement score, along a con- 
tinuum from hope of success (HS) to fear of failure 
(FF). They report that, as compared to the middle 
third of the continuum, the HS and FF groups had 
significantly lower n Achievement scores and 
significantly more positive subscores and fewer 
negative subscores on the n Achievement measure. 
The first finding of Clark et al. (1956) is not in 
agreement with previous work summarized in 
McClelland, Atkinson, Clark, and Lowell (1953). 
They stated that the n Achievement score could 
not be used as an indication of the HS-FF con- 
tinuum and attributed their discrepancy in results 
to the difference in the pictures used to elicit the 
Stories. 

By relating their findings to McClelland’s 
(McClelland et al., 1953) formulation, Clark et al. 
(1956) indicated that their HS group would be 
expected to have had little anxiety, the middle 
group some anxiety, and the FF group great 
anxiety over goal attainment. After thus suggest- 
ing that the HS and FF groups were genotypically 
different in regard to anxiety over goal attainment, 
they indicated that the two groups were pheno- 
typically similar as measured by McClelland’s n 
Achievement test. 

A directly opposite interpretation of these 
Tesults has been suggested by Atkinson (1958). 
He regards the TAT score of n Achievement as a 
Measurement of genotypic and the level-of-aspira- 
tion questionnaire as an expression of phenotypic 
Motivation, 

The measurement of anxiety, in addition to 
level-of-aspiration and n Achievement, may per- 
mit the issue to be resolved. It would follow that 
if the proposal by Clark et al. (1956) is correct, 
the FF group should be more anxious than the 
middle group, which in turn should be more anx- 
ious than the HS group. If Atkinson (1958) is 


1 Now at the University of Houston. 
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correct in regarding both the HS and the FF groups 
as being genotypically fearful of failure, then these 
groups should be more anxious than the middle 
group but should not differ in anxiety between 
themselves. 

The present investigation had three objectives: 
first, to attempt to clarify the contradictory results 
by repeating the Clark et al. (1956) study using 
pictures similar to those used in the earlier studies; 
second, to explore the generality of the Clark et al. 
findings by employing two additional samples of 
subjects; third, to contribute to the understanding 
of what is measured by McClelland’s TAT meas- 
ure of need for achievement and level-of-aspira- 
tion. 

METHOD 


The 113 subjects in this research were male students 
attending the University of Miami summer sessions. 
Seventy-two were enrolled in two introductory psy- 
chology courses taught during one semester by the 
same professor, using identical textbooks and examina- 
tions. Forty-one students were in one of three more 
advanced undergraduate psychology courses taught 
by three different professors. 

The procedure followed was as reported by Clark 
et al. (1956). The group TAT measure of n Achievement 
was administered early in the semester. In all, five 
individual testing sessions were required in order to 
administer the TAT to the 113 subjects. This was 
done under neutral conditions using the same instruc- 
tions, experimenter, and physical location with the 
following exceptions. One of the introductory groups 
was not tested in the same room and a new experimenter 
administered the TAT during a testing session for one of 
the advanced groups. Statistical tests of the TAT 
scores showed no significant difference between 
means of the two introductory groups or between 
means of the three advanced groups. The four 

ictures used to elicit the stories were obtained 
from McClelland,’ who stated that they were 
“roughly equivalent” to those used in the original 
studies. The pictures are numbered 33, 26, 24, and 9 
in Atkinson (1958). 

The TAT protocols were independently scored by 
two persons having validity coefficients of .85 and .88 
with regard to the material included in the manual 
provided by ‘Atkinson (1958). On the present set of 
scores, a rho of .87 was obtained between the ratings 
of these same persons. Later in the semester, the sub- 
jects were given the Taylor Manifest Anxiety (MA) 


scale. 
‘The level-of-aspiration questionnaire developed by 
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Clark et al. (1956) was administered immediately before 
the final examination. Again the analysis of the data 
obtained from the level-of-aspiration scale was similar 
to that used by Clark et al. In each case the range of 
the raw scores for each index was converted into seven 
class intervals and the value of seven was assigned in 
such a manner so as to insure that the HS subjects 
always received the highest scores. 


RESULTS 


Table 1 presents the intercorrelations between 
the four indices of the questionnaire for the in- 
troductory students and for the more advanced 
students. The correlations for the advanced group 
are very similar to those reported by Clark et al. 
(1956). The correlations for the introductory 
group are somewhat higher than those for the 
advanced group, very probably indicating the 
greater degree of homogeneity in this group. 
Both sets of correlations are positive and highly 
significant indicating the internal consistency of 
these indices. The four indices were summed and 
the resulting distribution divided into three parts, 
The upper third constituted the hope of success 
group and the lower third the fear of failure group. 

Table 2 presents the mean n Achievement score 
for the various groups. The mean for the intro- 
ductory students is significantly higher (6 < .01) 
than that for the advanced students, None of the 
six F tests between the HS, middle, and FF cate- 
gories were significant. 

Table 3 shows the mean MA scale scores for the 
HS, middle, and FF categories. For both the intro- 
ductory and advanced students the mean MA scale 
score increases from the HS group to the FF group. 
In neither case does this trend reach significance. 
However, for the advanced students, the difference 
between the FF group and the HS and middle 
groups combined is significant (p < .05). 

Two additional analyses reported by Clark et al, 


TABLE 1 


RELATIONSHIP AMONG Four INDICES oF Hore or 
Success AND FEAR oF FAILURE 


(Each index is correlated with the sum of the 


other three) 
Correlation with 
Three Remaining 
Indices* 
Index REF 
mony” Advanced 
Goan Toup 
Upper Limit minus Settle For .90 -82 
pba Limit minus Settle for Range .93 -86 
ect over Settle For +94 -85 
Affect Upper Limit minus Affect <91 -80 


Settle For 


= All of the correlations are significant beyond the .01 level. 
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TABLE 2 
TOTAL n ACHTEVEMENT SCORES FOR THE Ho) 
Success, Mipvie, AND FEAR oF FAILURE G] 
(Fı = Between groups/within groups; Fa = Mi 
and combined HS, FF group/within groups; Fy = 
FF and combined HS, Middle 
group/within groups) 


Introductory Group 
Group 
N , Mean n N , Meann 
Achievement Score Achievement § 
HS 20 6.15 15 
Middle 28 5.21 12 
FF 24 7.35 14 
Totals 72 6.18 41 
Fi, Fa, Fs-ns 
TABLE 3 


TOTAL MA Scare Scores ror THE Hore or Suc 
MIDDLE, AND FEAR or FAILURE Groups 

(Fı = Between groups/within groups; Fs = M 
and combined HS, FF group/within groups; Fa = 

and combined HS, Middle group/within gro 


Group 
N 
HS 20 12.85 
Middle 28 13.11 12 
FF 24 15.42 14 
Fi, ns F, ns | Fi, ns 
Fans Fz, ns 


(1956) were also made on the present data. 
ever, the relationships between the positi 
negative n Achievement subscores for the 
middle, and FF groups, and the median 
of HS-FF in relation to frequency of positi 
negative categories which were presented in 
et al.’s Tables 4 and 5 have been eliminated 
none of the comparisons proved statisticall: 
nificant. 


Discussion 


The intercorrelations obtained among th 
dices of the level-of-aspiration questionnaire, 
though slightly higher, closely parallel tho 
Clark et al. (1956). These results are not too 
prising since individuals within both groups 
quite homogeneous and, moreover, the very nal 
of the correlated items themselves tends to 
some expected degree of relationship. 

It might be noted from Table 2 that both 
of protocols, although similar in length and 
by the same judges, differed markedly in 
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mean n Achievement scores. However, the TAT 
scores for the introductory psychology students 
were significantly higher than for éhe more ad- 
vanced psychology students. Any of several fac- 
tors may have been responsible for the difference. 
First, the sets of protocols, although scored inde- 
pendently by the same two judges, were scored at 
different times. However, the high degree of cor- 
relation between their ratings makes this alterna- 
tive seem doubtful. More probably, the nature of 
the two groups may have contributed to the result. 
We may assume the more advanced students to 
be more sophisticated and therefore more guarded 
in their answers. 

Contrary to Clark et al.’s (1956) data and to 
the early work summarized in McClelland et al. 
(1953), the present results do not reveal any 
relationship between the HS-FF continuum and n 
Achievement scores. The fact that the pictures 
used in the present study were roughly equivalent 
to those in the earliest studies strongly suggests 
that the different results among the various studies 
are probably due to variables other than or in addi- 
tion to the pictures used. Thus the present results 
lend strong support to the statement by Clark 
et al. (1956) that “the level of n Achievement 
score cannot be used as an index of the FF-HS 
continuum” (p. 186). 

The MA scale was administered in an attempt 
to resolve the issue between Clark et al. (1956) and 
Atkinson (1958) concerning the genotypic vs. 
phenotypic status of the level-of-aspiration ques- 
tionnaire and McClelland’s TAT test, as measures 
of motivation. Analysis of the relationship between 
MA scale scores and the HS-FF continuum tend 
to support the position of Clark et al. that the HS 
and FF groups, as measured by a level-of-aspira- 
tion questionnaire, differ in level of anxiety. It was 
found that the higher the hope of success, the less 
the anxiety; and conversely, the greater the fear 
of failure, the greater the anxiety. Therefore 
Atkinson’s view that the FF and HS groups are 
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only phenotypically differentiated and both are 
characterized by underlying basic fear of failure is 
not supported by these results. 


SUMMARY 


Previous research has raised a question as to 
whether McClelland’s TAT test of n Achievement 
measures genotypic or phenotypic motivation. The 
present investigation measured level-of-aspiration, 
manifest anxiety, and need for achievement in 
college students in an attempt to resolve the issue. 
The major findings follow: 

1, Students in introductory psychology classes 
made significantly higher n Achievement scores 
than students in more advanced psychology 
classes, 

2, There were no differences in n Achievement 
among groups characterized as being hopeful of 
success, middle, or fearful of failure. 

3. For students in advanced psychology classes, 
those fearful of failure showed significantly more 
manifest anxiety than those in the middle or hope 
of success groups combined. 

Generally these results showed no relationship 
between the HS-FF continuum and TAT scores, 
introduced level of sophistication of the subjects 
as a variable that seems to affect n Achievement 
level, and did not support the view that McClel- 
land’s TAT test of need for achievement measures 
basic or genotypic motivation. 
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THE RELATIONSHIP BETWEEN ANXIETY AND ORAL WORD 
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ARASON (1959) has recently presented evidence 

that subjects differing in anxiety level respond 
differentially to a group Word Association test. 
His highly anxious subjects tended to respond with 
greater variability to the same stimulus items on 
their second administration and tended to give a 
greater number of rare responses, according to the 
Minnesota norms (Russell & Jenkins, 1954), than 
did subjects lower in anxiety. These trends were 
markedly increased under conditions of ego threat 
in which the subject was informed that the Word 
Association test was a sensitive measure of per- 
sonality. Sarason concluded that these results 
support the habit or interfering-response theory of 
anxiety rather than a drive theory (Spence, 1958; 
Taylor, 1958). 

Before one can accept such a conclusion, how- 
ever, it is necessary to determine whether these 
findings are not also congruent with a drive theory. 
In the theory explicated recently by Taylor (1958), 
the relationship between anxiety and performance 
is conditional upon the structure of the subject’s 
habit hierarchy upon entering the experimental 
situation. If the response to be learned (or emitted, 
as in the Word Association test) is strong in the 
hierarchy and involves few competing responses, 
high anxious (HA) subjects should perform more 
rapidly or more efficiently than low anxious (LA) 
subjects. On the other hand, if there is no strong 
dominant response and many competing response 
tendencies of approximately equal strength, HA 
subjects should perform more slowly and less 
efficiently than their LA counterparts. 

Both of these predictions are based on the Hul- 
lian notion that drive acts indiscriminately to 
raise the reaction potential of all habits in the 
hierarchy. Following this line of reasoning, it is 
difficult, without some prior knowledge of the 
subject’s hierarchy of responses to the stimulus 
words on a Word Association test, to make any 
prediction from drive theory discriminable from 
one deduced from a habit or interference theory. 
If the words used by Sarason (1959) elicited a 
number of competing verbal responses with none 
dominant over the others, then his finding of 
greater discrepancy in responses from one adminis- 


1 This study was carried out on a grant from the 
Faculty Research Committee at UCLA. The author 
would like to express his appreciation to Samuel 
Himmelfarb for running subjects and analyzing the 
data. 
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ASSOCIATION PERFORMANCE! 
} 


tration of the test to the others is in keeping with a 
drive theory of anxiety. Even the enhancing effects 
of experimental instructions could be accounted 
for by drive theory if such instructions are viewed 
as raising the subject’s drive level. If, on the other 
hand, the words used by Sarason consisted of 
stimulus items with strongly associated responses 
(i.e., Black-White) and only minimally competing 
responses, then Sarason’s results support a habit 
theory of anxiety. 

The present study is concerned with examining 
the relationship between anxiety and word associa- 
tion responses with the subject’s habit hierarchy 
taken into consideration. Laffal (1955) has sug- 
gested that the number of different responses (D) 
given to a stimulus word by an aggregate of sub- 
jects may provide some crude statistical estimate 
of the habit hierarchy for each individual subject. 
For example, the word Boy tends to elicit few dif- 
ferent responses in a group of subjects; most re- 
spond with Girl. According to Laffal, we can infer 
from such group data that the stimulus Boy is 
associated with a single, dominant response for 
most subjects and that there are few competing 
responses present. On the other hand, the word 
Flatter tends to elicit almost as many different 
responses as there are subjects responding to it. 
This kind of stimulus word has no dominant re- 
sponse associated with it, but evokes a number of 
competing responses of low strength in each sub- 
ject. Laffal marshals some impressive data on 
reaction times and number of reproduction faults 
to support his inference that group observations of 
the number of different responses reflect individual 
habit hierarchies. According to drive theory, the 
relationship between anxiety and word association 
performance is a direct function of the D value of 
the stimulus word (number of different responses 
stimulated by the word). For low D words, high 
levels of anxiety should result in shorter reaction 
times, fewer reproduction disturbances, and fewer 
rare responses than lower levels of anxiety. For 
high D words, the reverse prediction applies. 
the drive theory of anxiety is correct, we should 
find a significant interaction between anxiety an 
D level of the words, whereas a habit or inter- 
ference theory predicts a significant main effect 
due to anxiety (regardless of the D level of the 
words) and no significant interaction. The pre 
study attempted to examine these differenti 
predictions. 


f 
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A second and less important aspect of this 
study involves the oral administration in individual 
sessions of word association material. Although 
group forms of such tests are valuable for studying 
verbal habits, the investigation of personality 
factors in verbal behavior requires proper atten- 
tion to the possibility that untimed group forms 
may primarily reflect the subject’s pace of working 
and his tendency towards censoring his responses 
as much as it does his verbal habits. For a more 
critical test of the drive vs. habit theories, the 
individual, orally-administered method was used 
in the present study. 


METHOD 


Subjects. Subjects were students in the introductory 
psychology course at UCLA who volunteered as part 
of their course requirement. Each subject was admin- 
istered the 50-item form of the Heineman (1953) 
forced-choice version of the Taylor scale of anxiety 
(HMA scale) in class, From the total distribution of 
scores, 20 HA and 20 LA subjects were selected. Since a 
preliminary analysis of the scores revealed a 
significant sex difference, subjects were chosen 
from the distribution of their own sex. Each 
anxiety group contained 10 males and 10 fe- 
males. HA males scored 20 or above on the HMA 
scale and HA females 24 or above; LA males scored 
10 or below on the HMA scale, while LA females 
scored 7 or below. 

Procedure. Approximately one month after the 
administration of the HMA scale, the classes were told 
that a random sample had been drawn, and the sampled 
people were asked to participate in an experiment. 
So far as could be determined, no subject was aware of 
a connection between his HMA scale score and his selec- 
tion for the experiment. These subjects were given a 99- 
item Word Association test similar to that used by 
Laffal (1952), the only change being that 12 words were 
Substituted for homophones in the Laffal list. The 
subject was instructed to respond to each stimulus 
word with the first word which came to mind, On the 
second administration, he was asked to give the same 
word which he gave on the first administration. From 
the total of 99 words, 10 were selected as high, and 10 
as low D words on the basis of the number of different 
responses given by the total number of 40 subjects. 
Because this step involved the creation of norms on 
the very sample to which they were to be applied, a 
check was made at the stability of the D values by 
correlating those obtained here with the values for all 
overlapping words in the Laffal study. A correlation of 
.75 was found for the 87 words, suggesting considerable 
Stability from sample to sample. 


RESULTS 


Reaction time data. Each subject’s reaction time 
to the first administration of the Word Association 
test was recorded to the nearest tenth of a second. 
In order to correct these scores for skewness, a 
reciprocal transformation was applied to them 
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TABLE 1 
MEANS AND STANDARD DEVIATIONS OF RECIPROCAL 
OF REACTION TIMES BY ANXIETY AND D LEVEL 
OF THE ST™ULUS WORD 


High D Words Low D Words 
M SD M SD 
High Anxiety 
Males 455 -069  — .907 194 
- Females .550 .072 906. 133 
Low Anxiety 
Males -560  .131 1.020 211 
Females -580  .162 957 113 


(Edwards, 1950). Table 1 presents the means and 
standard deviations of the raw reaction times for 
HA and LA subjects as a function of the D level 
of the stimulus word. The trend for both D levels 
is for HA subjects to react more slowly than the 
LA subjects. Analysis of variance of the reciprocal 
reaction times revealed the anxiety variance to be 
nonsignificant (F = 3.42, .10 < p >.05), the 
variance for D level to be highly significant (F = 
25.25, <.001), and the Anxiety X D interaction 
to be lacking in statistical significance. 

Reproduction faults. A count was made of the 
proportion of times the subject failed to give the 
same response to the same stimulus word on the 
second administration. Table 2 presents the means 
and standard deviations of these reproduction 
discrepancy scores by anxiety and D level. Analysis 
of variance yielded significance only for D level 
(FE = 95.01, p <.001). The Anxiety X D interac- 
tion was not significant, and an inspection of 
Table 2 reveals that LA subjects tended to react 
with a larger number of reproduction faults to the 
high D words, contrary to both a drive or a habit 
interference theory. 

Communality of response words, Sarason (1959) 
notes that anxiety level relates positively to the 
number of rare responses emitted. No norms 
comparable to those used by Sarason were avail- 
able, but an analogous method of measuring com- 
munality of response was used, taking the D level 
of a word as an index, If a word tends to be re- 
sponded to with a small number of dominant 
responses by the total group (a low D word), then 
we can determine whether HA subjects give more 
different responses from LA subjects. The same 
analysis can be done for high D words. The means 
and standard deviations of the proportion of dif- 
ferent responses given by each anxiety group are 
presented in Table 3. With 10 subjects in each 
subgroup, it was possible for the group to give 
from 0-10 different responses to each stimulus 
word. Analysis of variance failed to reveal any 
significant differences except for D level (F = 
365.55, p < .001, for 1 and 72 df). 
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TABLE 2 
MEANS AND STANDARD DEVIATIONS OF THE PER- 
CENTAGE OF REPRODUCTION FAULTS BY ANXIETY 
AND D LEVEL OF THE STMULUS WORDS 


High D Words Low D Words 
M SD M SD 
High Anxiety 
Males 3:2 -98 1.3 -90 
Females 3.3 1.19 8 fhe} 
Low Anxiety 
Males 3.6 1.74 6 +66 
Females 3.5 2.06 8 68 
TABLE 3 


Means AND STANDARD DEVIATIONS OF THE PROPOR- 
TION OF DIFFERENT RESPONSES GIVEN TO EACH 
SrmuLUs WORD BY ANXIETY AND D LEVEL oF 
THE StimuLus WorD 


High D Words Low D Words 
“u SD “u SD 
High Anxiety 
Males a 077 .28 141 
Females 91 070 37 149 
Low Anxiety 
Males 90 118 29 +145 
Females 87 118 .32 .166 


Analysis over all 99 words. Although not as 
important theoretically, the same analysis was 
done over all 99 words on the word association 
task to test whether Sarason’s (1959) results are 
tied to the group administered, untimed version 
of the Word Association test. It is possible that 
Sarason’s results on anxiety reflect factors such as 
censorship of responses or differences in the pace 
of responding more than interference with verbal 
habits. An analysis of reciprocal reaction times, 
done on average RT over all 99 words, revealed a 
similar trend for slower reaction times for HA 
subjects (F for anxiety level = 2.914, .10 < p> 
.05 for 1 and 95 df) to that reported above. An 
analysis of variance on the total number of re- 
production faults given by the subject to all 99 
words, a measure which Sarason found related to 
anxiety level, showed no significance for anxiety 
level. A final analysis was done on the equivalent 
of Sarason’s communality score described in the 
section above. This score, the number of different 
responses given to each stimulus word, also failed 
to reveal any relationship to anxiety (F = 1.1135, 
p > .10). These analyses fail to support Sarason’s 
findings in any respect. 


Discusston 


The present study, although failing to replicate 
Sarason’s (1959) findings on the group Word 
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Association test, do tend to support his conclusion 
concerning a habit interference theory of anxiety, 
The presencę of slower reaction times to words 
with either simple or complex habit hierarchies is 
not compatible with drive theory and could in- 
directly be derived from a theory which assumes 
greater interfering response tendencies in HA in- 
dividuals. This conclusion rests, however, on an 
assumption which warrants closer scrutiny. 

The assumption is that the group data which 
yield D scores for stimulus words accurately reflect 
individual habit hierarchies. Some support for this 
assumption is provided from the reaction time 
and reproduction fault data, which indicate that 
we are dealing with two distinct classes of words. 
High D words always have longer reaction times 
than low D words and are much more likely to 
elicit response faults than low D words. Both 
findings are compatible with the idea that high D 
words do not have any strong dominant response 
associated with them and that the responses which 
are associated with them are strongly competing 
ones. The rapid reaction times and infrequent 
response faults to low D words suggests the reverse 
conclusion about the habit hierarchy associated 
with these words. These conclusions are supported 
more directly by a study by Rosen and Russell 
(1957) in which it was found that chains of associ- 
ations given by individuals to repeated presenta- 
tions of the same stimulus word parallel the cul- 
tural frequencies collated from group data. The 
assumption of different habit hierarchies associated 
with high D and low D does seem to be a reasonable 
one. 

The failure to replicate Sarason’s (1959) find- 
ings, then, is difficult to interpret. The present 
study involved a different form of the Taylor scale 
and a different group of stimulus words, factors 
which could account for the discrepancy. In spite 
of these differences in technique, however, it does 
seem possible that the factor of censorship over 
responses could most reasonably explain Sarason's 
findings. If HA subjects are more threatened by 
any testing situation than LA subjects, then they 
may be more likely to inhibit their responses and 
to censor them more carefully. These attempts to 
censor or inhibit responses could interfere with the 
recall of the first response on the second adminis- 
tration of the test. From this point of view, the 
present results are not incompatible with Sarason Be 
It is probably more difficult to censor or inhibit 
responses on the oral type of Word Association 
test. Nevertheless, the trend toward longer reat- 
tion times for HA over LA subjects may reflect an 
attempt in that direction. With the group admin- 
istration, particularly under the untimed condition 
used by Sarason, such attempts at censorship 
could be more actively pursued, resulting 
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greater discrepancy scores and a greater number 
of rare responses for HA subjects. 

The results reported here sugge8t that under 
certain conditions, HA subjects emit more inter- 
fering response tendencies than LA subjects. But 
there are several studies, in which the drive theory 
of anxiety is clearly substantiated (Spence, Farber, 
& McFann, 1956; Taylor, 1958), which seriously 
question the validity of the dichotomous view in 
which the Taylor scale either measures drive level 
or interfering response tendencies. It probably 
measures both reaction tendencies, and the diffi- 
culty posed here and elsewhere is that there is no 
way of anticipating on a theoretical level when the 
interfering responses are prominent and when the 
enhanced drive is more likely to influence per- 
formance. Until a more explicit statement of the 
telation between drive level (D) and responses 
learned to the drive stimuli (sp) is forthcoming, 
the predicted relationships between anxiety and 
performance will be frequently in error. 


SUMMARY 


The present study was concerned with the oral 
word association responses of subjects varying in 
T anxiety level. Two classes of words, believed to 
1 vary in the simplicity of their associated habit 
hierarchies (low D and high D words), were as- 
sumed to interact differently with anxiety level. 
T The results were as follows: 
` 1. HA subjects tended to give longer reaction 
times in the Word Association test, independent 
of the D value of the word. 

2. No relationship between anxiety level and 
_ humber of reproduction faults or number of rare 
Tesponses was found, contradicting earlier findings 
‘by Sarason (1959), 

3. Over all 99 words, constituting the total 
Word Association test, it was found that HA sub- 
Jects had longer reaction times than LA subjects. 

The results were interpreted as supporting a 
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habit interference view of anxiety, although it was 
suggested that a dichotomous view of the drive vs. 
habit issue is not necessary. Sarason’s (1959) 
earlier findings were interpreted as being due more 
to response censorship than to genuine differences 
in verbal behavior. 
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Analyses of the sociometric measures indicated 
that there was no significant Group effect; there 
were significant sex and grade differences, but 
these were not consistent between different meas- 
ures. 


Summary AND CONCLUSIONS 


Lipsitt’s (1958) Self-Concept and Ideal-Self 
scales and two Sociometric scales were given to 
fourth, sixth, and eighth grade subjects. 

1. Acceptance of others, acceptance by others, 
and acceptance by best friends were curvilinearly 
related to self-concept scores, with highest accept- 
ance in a group with moderate self-concept scores 
and lowest in a group with low self-concept scores. 

2. The trends were not significantly different 
in different grades or sexes, 

3. Acceptance by others was more strongly re- 
lated to self-concept than was acceptance by best 
friends. 

4, The sociometric measures were not signifi- 
cantly related to the discrepancy between ideal- 
self and self-concept scores, suggesting that the 
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discrepancy score obtained by subtraction may not 
be a valid measure of self-acceptance. 
xp 
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[HE assumption that people tend to de- 
fend their beliefs by avoiding exposure 
tocontrary points of view hasbeen widely 
accepted (Festinger, 1957; Hovland, 1954; 
Janis, 1957; Klapper, 1949, 1957-58). Experi- 
mental support for this view is not as impres- 
sive as one might wish, considering the extent 
that it has been relied on for theoretical pur- 
poses, but several studies—summarized by 
Festinger (1957, Ch. 6, 7) or inspired by his 
dissonance theory (Brodbeck, 1956; Cohen, 
Brehm, & Latané, 1959; Ehrlich, Guttman, 
Schönbach, & Mills, 1957; Mills, Aronson, & 
Robinson, 1959)—have provided some experi- 
mental corroboration. Since a person’s prac- 
tice of defending his beliefs by avoidance 
should tend to leave him unmotivated and in- 
experienced in developing supporting argu- 
ments or in refuting counterarguments, it is 
not surprising to find that while beliefs prove 
highly vulnerable in situations of forced ex- 
posure, they tend to remain relatively un- 
affected by persuasion campaigns that allow 
self-selected exposure (Hovland, 1959). 
Besides suggesting a reason for the vulner- 
ability of beliefs, the assumption of selective 
exposure has implications regarding the rela- 
tive effectiveness of various techniques for 
mmunizing beliefs in advance against forced 
exposure to massive doses of counterargu- 
. Ments. Resistance to such counterarguments 
would be analogous to the resistance to disease 
of a person brought up in an aseptic environ- 
ment, It has been found (Lumsdaine & Janis, 
1953; McGuire & Papageorgis, 1961), in keep- 
ing with this analogy, that a person’s belief is 
More effectively immunized against a later 
Massive dose of counterarguments by a re- 
Sistance-stimulating pre-exposure to the coun- 
terarguments, weakened by explicit refutation, 
than by prior familiarization with arguments 
Supporting the belief. 
The earlier study by the present authors 
1 This research w: t from the 
National Science Aiea, of Social Sciences. 
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demonstrated heightened resistance to subse- 
quent strong forms of the same counterargu- 
ments as those to which the person had been 
pre-exposed. The present study tests the hy- 
pothesis that such pre-exposure to weakened 
counterarguments produces resistance to sub- 
sequent strong forms of alternative counter- 
arguments as well. There are at least two 
reasons for predicting the generalized immuni- 
zation. First, the experience of having seen 
some counterarguments against the belief 
effectively refuted may lower the impressive- 
ness even of alternative counterarguments. 
Second, a person’s pre-exposure experience 
may make him more aware that his belief is 
indeed vulnerable to attack and therefore 
motivate him to develop supporting arguments 
that make his belief more resistant even to 
alternative counterarguments presented later. 
A secondary purpose of the study was to test 
the validity of each of these two conjectured 
mechanisms of the hypothesized generalized 
immunity effect. 


METHOD 


General Procedure 


The experiment was presented to the subjects as an 
attempt to obtain standardization data for a newly 
developed test of sci c information and aptitude 
with emphasis on verbal skills. Each subject took part 
in two half-hour experimental sessions one week apart. 
The first session was designed to expose the subject to 
an immunizing treatment defending a cultural truism, 
and the second, to strong counterarguments attacking 
this (or another) truism. 

First session. In the first, or “immunizing,” session 
each subject was required to read an immunizing mes- 
sage refuting a pair of counterarguments against a 
truism. The subject was allowed 6 minutes in which to 
read the 500-550-word refutational message and to 
select and underline the crucial clause in each para- 

Our main interest was implied to be in how the 
subject performed at this selecting and underlining task. 
At the end of the first session, the subject was asked to 
complete an attitude questionnaire measuring 22 beliefs 
including the experimentally crucial ones. The attitude 
scales are described more fully below. The belief scores 
obtained on the immunized issue during this first session 
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furnished a measure of the direct effect of the “im- 
munizing” treatment itself, prior to the strong counter- 
argument message which was presented in the second 
session. It was stressed that the subject was to indicate 
his own belief and not necessarily what the essay as- 
signed to him might have argued. Subjects were told 
that these opinions were being measured because an 
incidental purpose of the study was to determine if a 
person’s skill in reading comprehension was affected 
by his beliefs regarding the material that he read. 
Upon completion of this session, the subject was asked 
to refrain from any discussion of the test and to return 
the following week. 

‘Second session. One week later, the subject took part 
in the second session which included the presentation 
of the message containing strong counterarguments 
against his belief. To test the main experimental hy- 
pothesis (that the prior refutation of counterarguments 
tends to immunize the beliefs not only against the 
specific counterarguments refuted, but also against 
other counterarguments directed at the same belief) 
some subjects received strong versions of the same 
counterarguments that had been refuted in the previous 
session, while other subjects received strong forms of 
previously unmentioned alternative counterarguments. 
The subject was again allowed 6 minutes in which to 
read strong counterarguing messages (again between 
500 and 550 words in length) and to select and under- 
line the crucial clause in each paragraph, his perform- 
ance on this underlining task once more being repre- 
sented as our main interest. He then filled out again the 
opinion questionnaire used in the first session and was 
told to indicate his present personal belief, regardless 
of what had been argued in any of the passages that 
he had read. The beliefs that the subject indicated on 
the second administration provided the measure of the 
relative effectiveness of the strong counterarguments 
on immunized and nonimmunized beliefs. 

After this task, the subject was asked to list all the 
arguments that came to mind in favor of the critical 
truisms. In addition, he was asked to rate the strong 
counterargument message he had just read in the second 
session with respect to several evaluative dimensions 
(the validity of its conclusion, the amount of evidence 
presented, etc.). The purpose of these two tasks was to 
test the conjectured mechanisms of the generalized 
immunity effect. N 

Finally, the subject was asked to answer five ques- 
tions designed to determine the extent to which he had 
guessed the true purpose of the experiment, had dis- 
cussed the experiment with others, and felt that the 
time allowed for the various tasks had been adequate. 

At the end of the experiment, considerable effort 
was made to inform the subject of the nature of the 
experiment and of the deceits used. Particular stress 
was put on the fact that both immunization and coun- 
terargument messages were constructed for propaganda 
purposes and that no weight should be given to any 
argument simply because it had been included in one 
of these propaganda messages. 


Materials 


Issues. Two issues dealing with public health were 
selected (on the basis of data from a prestudy) for 
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extremeness and homogeneity of precommunication 
beliefs within the student population being sampled. 
The beliefs selected were the following: “Everyone 
should get a chest X ray each year in order to detect 
any possible tuberculosis symptoms at an early stage” 
and “Everyone should brush his teeth after every meal 
if at all possible.” 

The subjects indicated their beliefs about these 
statements by marking an “X” in the appropriate 
space on a 15-interval graphic scale labeled periodically 
from “Definitely false” on the extreme left to “Def- 
initely true” on the extreme right. In computing the 
results reported below, the intervals were scored 1, 2, 
..., 14, 15, going from left (false) to right (true). The 
preliminary data showed that the mean belief on each 
of the two issues was over 13, with the mode at 15 on 
the 15-point scale. 

Messages. Immunizing and strong counterargument 
messages? were prepared on each belief. The immuniz- 
ing messages consisted of three paragraphs: the first 
mentioning the two “misleading” counterarguments 
against the belief and the second and third containing 
refutations of these two counterarguments. Two im- 
munizing messages were prepared on each belief, each 
refuting a different pair of counterarguments, so that 
for some subjects the belief could be subsequently 
attacked by strong forms of the previously refuted 
counterarguments and for other subjects, by strong 
forms of new counterarguments. 

The strong counterargument messages were similar 
in structure to the immunizing messages. Each con- 
tained three paragraphs, the first of which briefly de- 
scribed two counterarguments against the belief, and 
the following two paragraphs strongly elaborated these 
counterarguments with specific evidence for their 
validity. Here again to implement the experimental 
design, there were two strong counterargument mes- 
sages on each issue, each message elaborating an 
alternative pair of counterarguments. All the messages, 
both immunizing and strong counterarguing, were be- 
tween 500 and 550 words in length. 

Design 

There was a total of 16 experimental conditions 
representing all possible combinations of the four im- 
munizing messages in the first session and the four 
strong counterarguing messages in the second. All these 
combinations were required to test the effect of some 
experimental treatment, to counterbalance material, or 
to obtain control data. Since some of the conditions 
entered into more frequent or more crucial comparisons 
than others, the subjects were not distributed in equal 
numbers over the conditions (see Table 1). In addition 
to conditions required for counterbalancing the 
materials, the design provided final opinion scores after 
each of five different hypothesis-relevant treatments: 


2The complete text of the eight communications 
used in this study has been deposited with the American 
Documentation Institute. Order Document No. 6769 
from ADI Auxiliary Publications Project, Photodupli- 
cation Service, Library of Congress; Washington 25, 
D.C., remitting in advance $1.25 for microfilm oF 
$1.25 for photocopies. Make checks payable to: Chief, 
Photoduplication Service, Library of Congress. 
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immunization, then presentation of the strong forms 
of the same counterarguments; immunization, then 
presentation of different strong counterarguments; 
immunization only; strong counterarguments only; and 
neither immunization nor counterarguments. 


Subjects 1 


A total of 73 subjects, all students enrolled in various 
undergraduate courses in the summer session at the 
University of Illinois, took part in both sessions of the 
experiment, Data from 9 additional subjects who took 
part in the first (immunization) session but did not 
appear for the second (strong counterargument) ses- 
sion are omitted from the analysis. 


RESULTS AND DISCUSSION 
Adequacy of the Experimental Conditions 


Post-experimental questionnaire results. An 
attempt was made to disguise the inductions 
of opinion change as a test of verbal skills. 
In response to a final question concerning what 
else the test might have been used to measure, 
16 of the 73 subjects gave responses indicating 
that they might possibly have perceived that 
the experiment involved persuasion. Actually, 
fewer subjects may have been suspicious at the 
time the opinions were being measured, since 
between the opinion measure and this final 
question there intervened the requirements of 
listing favorable arguments and rating the 
quality of the strong counterargument mes- 
Sage, tasks that may well have increased the 
perception of persuasive intent. At any rate, 
the data from these 16 “suspicious” subjects 
show no reliable differences from those ob- 
tained from the remaining 57 subjects. 

The responses on the post-experimental 
questionnaire also indicated good compliance 
with the instructions not to discuss the test 
between sessions. Only 8 subjects admitted 
any discussion and this was largely of an ir- 
relevant type such as telling others how long 
the session lasted. The subjects also indicated 
that the time was adequate for all parts of the 
test, except for the section requiring the list- 
ing of all the supporting arguments they could 
think of about the truisms; 10 subjects com- 
Plained that the allowed time for this section 
(5 minutes) was inadequate. 

Adequacy of the opinion items. The control 
data indicate that the initial beliefs on the 
two critical issues were, as intended, quite 
extreme in the sample. The first session mean 
belief score in the no immunization condition 
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was 13.29 on the 15-point scale for the two 
crucial issues. The submeans for the tooth 
brushing and chest X ray issues were, respec- 
tively, 14.03 and 12.53, and the modal belief 
was 15 for each issue. In the control condition 
where neither immunization nor strong coun- 
terarguments were communicated on a given 
issue, there was a slight decline in belief from 
first to second session, the 13.29 mean of the 
first session declining to 13.23 in the second, a 
fluctuation that falls far short of conventionally 
accepted levels of statistical significance. 

To measure the reliability of the type of 
opinion scale used in this study, one opinion 
item was repeated twice on different pages of 
the questionnaire. This item dealt with the 
contagiousness of mental illness and was se- 
lected as the reliability check because a pre- 
vious study (McGuire & Papageorgis, 1961) 
found that it behaved very similarly to the 
two crucial issues used in this study. Since the 
check item appeared twice in each session, 
each subject responded to it four times, The 
two means on the 15-point scale in the first 
session were 13.51 and 13.67; in the second, 
13.51 and 13.63, none of which differences 
remotely approach conventional levels of 
statistical significance. The test-retest coeffi- 
cient of correlation in the first session was 0.81 
and in the second 0.89. The higher reliability 
in the second administration (.10 > p > .05, 
based on a test of difference between z trans- 
formations) may reflect a practice effect that 
reduced random errors in responding. 

Effectiveness of the strong counterarguments. 
The two beliefs, despite their high initial ac- 
ceptance, proved quite vulnerable to the strong 
counterarguments in the second session. 
Where strong counterarguments were not pre- 
ceded by first session immunization, the be- 
liefs were reduced to a mean of 5.73 (see Table 
1), representing a drop of 7.50 points from the 
13.23 second session mean in the control group 
where neither immunization nor strong coun- 
terarguments were given (p < .001). As 
pointed out earlier, this high vulnerability of 
the beliefs in the absence of preparatory treat- 
ments can be explained by assuming that the 
subject normally minimized cognitive con- 
flict by avoiding exposure of his beliefs to 
counterarguments, and thus is unprepared to 
deal with them when exposure is enforced by 
the experimental conditions. 
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TABLE 1 
BELIEFS BEFORE AND AFTER EXPOSURE TO IMMUNIZING AND/OR STRONG CoUNTERARGUMENT MES 
(Numbers in the cells show mean belief strength on a 15-point scale, 19 indicating strong adherence to 
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ure to Refuted 


Immunizing Immunizing Pre-expo- 
Coi lowed by E; Control—Ni 
ema SENE eah Sees Fem cv MR neime tea 
0 e n- 
immunization Same Counter- SAt a Siepie 
First = om — (a) 14.3 
(73)* 
Second (©) 5.73 (d) 9.25 (e) 8.70 6) 14.15 
(26) (20) (27) (26) 


Note,—Significance levels 


ns; d vs. g, p < .001; e vs. g, p < .001. 


a Numbers in parentheses give number of cases on which the cell mean is based. 


Effects of the Prior Defensive Treatments 


Direct effect. The immediate effect of the re- 
futational message (prior to the presentation 
of the strong counterarguing message) has 
particular interest because of the possibility of 
a “boomerang” effect. A priori, such a defense 
might actually weaken the belief since it in- 
volves acquainting the subject with some 
novel counterarguments (albeit refuted) 
against a belief he had regarded as a truism, 
beyond attack. The results indicate that the 
direct effect of the refutational defense is, on 
the contrary, to strengthen the belief: the 
beliefs at the end of the first session averaged 
14.34 points in the refutational defense con- 
dition and 13.29 points in the control condi- 
tions that involved no immunization (Table 1). 
This direct gain of 1.05 points, small in ab- 
solute magnitude, is impressive when we con- 
sider the low “ceiling” under which it occurred. 
A gain of this magnitude represents a change 
of over 61% of the remaining scale distance 
and is significant beyond the .02 level. (Sig- 
nificance levels in this report are based on 
two-tailed analysis of variance tests.) Hence 
fears that pre-exposure to the refuted counter- 
arguments might boomerang seem unfounded. 

Immunizing effects. Aside from this direct 
strengthening effect, the refutational defense 
was found to increase considerably the person’s 
resistance to the subsequent strong counter- 
argument messages. The overall effect of the 
strong counterarguments in the prior immu- 
nization conditions was to reduce the beliefs 
to 8.94 points, representing a drop of 4.29 
points (p < .001) below the belief level in the 


of theoretically relevant comparisons (probability values shown are all based on analysis of variance | 
a two-tailed test): a vs. b, p = .02; a vs. f, ns; b vs, g, ns; CVS. d, p < 01; cvs. e, p < .05; cvs. f, p < 001; c vs. g, ? < «001; d vs. ¢ 


no immunization, no counterargument ¢ 
trol conditions. Prior immunization, whil 
from completely effective, left the beliefs 3.21 
points higher (p < .01) than the 5.73 poini 
level to which they were reduced when 
strong counterargument had not been | 
ceded by prior defense. The immuniza 
procedures, therefore, attenuated the effect 
the strong counterarguments by 43%. The 
direction of these differences, and even 
absolute magnitudes, follow closely 

found in the previous McGuire and Papag 
gis (1961) study. 

To test the main hypothesis that a 
eralized immunization effect was produc 
the pre-exposure, some subjects were presen! 
in the second session with strong forms of th 
same counterarguments that had been rel 
in the first session, while other subjects 
ceived strong forms of alternative coun 
arguments against the belief. The resistane 
developed against the same counterarg 
was slightly greater than that against 
alternative counterarguments, the beliefs 
both immunization and strong counte 
ments being 9.25 and 8.70, respectively, in 
two conditions (see Table 1). The 
of this difference (which does not app! 
the conventional levels of statistical sigi 
cance) indicates that the general 
developed by the pre-exposure is only slig 
if at all, less than the specific immunity. 


the alternative-counterargument 
tion conditions was significantly greater (0: 
p < .05) than the 5.73 level to which 
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counterarguments reduced the beliefs in the 
no immunization condition. In view of the 
triviality of this difference betwéen resistance 
to same and to alternate counterarguments, 
there would seem to be little necessity, in 
practice, to guess beforehand what counter- 
argument might be used at a future time 
against a belief we wish to immunize: the prior 
refutation of some counterarguments develops 
almost as much resistance to subsequent 
strong forms of alternative counterarguments 
as to the same counterarguments. 

A second test of the generalized immunity 
hypothesis is provided by additional data 
collected in the experiment. Besides the gen- 
eral opinion question, there were four addi- 
tional questions on each issue, each of the 
four measuring belief in one of the four coun- 
terarguments. Hence at the end of both ses- 
sions it was possible to measure not only the 
subject’s belief on the general issue, but his 
specific beliefs on the four counterarguments— 
the two he heard refuted as well as the two 
that had not been previously refuted. In con- 
ditions where the subject received neither 
immunization nor strong counterarguments 
on an issue, the mean tendency to disagree 
with the specific counterarguments was 10.62 
on the 15-point scale. When the subject re- 
ceived the strong counterarguments without 
any previous immunization on the given issue, 
the mean tendency to disagree with the coun- 
terargument was reduced to 6.41 (p < .001). 
When the strong counterarguments came 
after the same pair of counterarguments that 
had been refuted, the mean tendency to dis- 
agree was 8.46; and where the strong counter- 
arguments had been preceded by refutation 
of the alternative pair of counterarguments 
this mean was 8.60, both of which means are 
Significantly greater (p < .01) than the 6.41 
level in the no immunization conditions. Hence 
the prior experience of hearing some counter- 
arguments refuted actually produced (trivi- 
ally) more resistance—8.60 as compared to 
8.46—to subsequent strong forms of the al- 
ternative counterarguments than to the very 
Same counterarguments. These data on the 
agreement with the specific counterarguments 
corroborate the findings on the main issues 
as regards a highly generalized immunization. 

The generalizability of these results regard- 


479 


ing immunization against persuasion is limited 
by several considerations. Immunization has 
been demonstrated only with respect to spe- 
cific preselected beliefs and not across issues, 
Also, the persistence of the immunity con- 
ferred by pre-exposure to refuted counter- 
arguments remains to be investigated: the 
present study demonstrates only that appreci- 
able immunization remains in effect after one 
week. Furthermore, the results of this study 
were obtained on culturally accepted truisms, 
issues of a type for which previous defense by 
avoidance of exposure to counterarguments 
seems most probable. It is possible also that 
the results of this study, obtained on college 
students, may not be applicable to less “select” 
samples. Questions might also be raised regard- 
ing the independence of the two sets of counter- 
arguments in interpreting the finding of gen- 
eralized immunity. 


Hypothesized Mechanism for the Generalised 
Immunity Effect 


Counterargument ratings. One hypothesized 
mechanism of the generalized immunity effect 
was that pre-exposure during the first (im- 
munization) session to impressive refutations 
of some counterarguments would cause the 
subject to be less impressed by any subse- 
quent counterarguments against the belief. 
To test whether the strong counterarguments 
presented in the second session did indeed seem 
less credible to subjects who had the experi- 
ence of witnessing refutations of alternative 
counterarguments against the same belief, 
additional ratings were obtained from the 
subjects. Each subject was asked to rate on a 
seven-point scale the quality and credibility 
of the counterargument message presented in 
the second session. There were 10 questions 
touching on different aspects of the counter- 
arguments so that the most favorable possible 
score was 70 and the least favorable 7. Without 
immunization, the mean rating was 53.69 
points; with immunization, 45.96 points. The 
difference between these two means is signifi- 
cant beyond the .01 level. In the immuniza- 
tion conditions the ratings of the alternate 
(not previously refuted) counterarguments 
were lowered to 46.52, which is almost as low 
as the 45.20 rating of the very counterargu- 
ments that had been refuted. (The difference 
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between these two means is not significant.) 
The ratings of the strong counterarguments 
became significantly lower (p < .05) than 
the 53.69 rating in the no prior immunization 
condition. Hence one way that immunization 
“generalizes” is by causing strong forms of 
counterarguments against a belief to appear 
less impressive to the subject after he has seen 
refutations of some counterarguments against 
the same belief, whether these strong counter- 
arguments are the same or different from those 
previously refuted. 

Number and quality of favorable arguments. 
A further mechanism hypothesized to account 
for the generalized immunity effect predicted 
that pre-exposure to the counterarguments in 
refuted forms would make the subject realize 
that his “truism” could be attacked and thus 
stimulate him to think up more arguments in 
support of his belief. At the end of the second 
session the subjects were asked to list argu- 
ments favoring the initial belief on each issue. 
With immunization, the mean number of 
arguments listed was 2.62, whereas without 
immunization the mean dropped to 2.32. This 
difference, although in the predicted direction, 
did not attain the conventional levels of sta- 
tistical significance (.10 < p < .20). The argu- 
ments listed by the subjects were also rated as 
to their quality on a 0-5 scale by an independ- 
ent rater, a high school teacher unfamiliar 
with the experiment. With immunization, the 
mean quality rating was 3.19 whereas in the 
controls it dropped to 3.01, another difference 
which though not significant is in the pre- 
dicted direction. The lack of significance may 
be partially due to a failure to allow the sub- 
jects sufficient time for listing the arguments. 
Since the specific time complaints made in the 
post-experimental questionnaire were directed 
exclusively against this part of the “test,” the 
lack of time may have allowed an excessive 
contribution to the total variance of individual 
difference variables unrelated to the experi- 
mental manipulation, such as intelligence, 
speed of writing, etc. 

The results of the experiment, therefore, 
give clear indication of a generalized immunity 
conferred by pre-exposure to some refuted 
counterarguments against the given belief. 
They also indicate that one mechanism of the 
generalized resistance is the tendency of pre- 
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exposure to reduce the subsequent impressive- 
ness even of new counterarguments against the 
belief. Not ruled out as an additional mech- 
anism is stimulation by pre-exposure of the 
development of supporting arguments. 


2 SUMMARY 


A previous study showed that strong initial 
beliefs are more effectively immunized against 
persuasion by pre-exposing them to counter- 
arguments weakened by refutation than by 
pre-exposure to supporting arguments. The 
present study tested the hypothesis that pre- 
exposure to refutations of some counterargu- 
ments against the belief would have a gener- 
alized immunization effect, making the beliefs 
more resistant to strong doses not only of the 
specific counterarguments refuted but also of 
alternative counterarguments against the given 
belief. Such a generalized effect was hypothe- 
sized to be the result of lowering the perceived 
credibility of later attacks against the belief 
and stimulating the person to think up sup- 
porting arguments for his belief. 

The 73 college student subjects participated 
in two experimental sessions, one devoted to 
pre-exposure to refuted counterarguments, 
and a second, one week later, to the presenta- 
tion of strong counterarguments. The design 
allowed subjects to be exposed to counter- 
arguments only, immunization only, or im- 
munization followed either by the identical ot 
alternate counterarguments. 

As expected, the beliefs proved highly vul- 
nerable to the strong counterarguments when 
there was no prior immunization. Immuniza- 
tion had a direct strengthening effect on the 
beliefs and also substantially reduced the effect 
of the subsequent strong counterarguments. 
Pre-exposure to refutations of some counter 
arguments produced significant resistance tO 
later strong forms of, not only the same coun- 
terarguments, but alternative counterargu- 
ments as well, in accord with the predicted 
effect of generalized immunity. Regarding the 
hypothesized mechanisms of such generalize 
immunity, the quality and credibility of the 
strong counterarguments were rated lower by 
subjects who had received immunization 
by those who had not, even where the strong 
counterarguments were different from those 
previously refuted. There was also a tendency, 
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short of statistical significance, for immunized 
subjects to be able to think up more support- 
ing arguments, and ones of higher quality, for 
their beliefs than nonimmunized subjects. 
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INCIDENTAL VERSUS FOCAL PRESENTATION OF 
DRIVE RELATED STIMULI: 


FRED PINE 
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me contribution made by consciousness 

to processes of stimulus intake and sub- 

sequent thought is of central importance 
to dynamic theories of thinking—witness the 
roles of conscious and nonconscious processes 
in creativity (cf. Kris, 1952) and the role of 
incidental stimuli in dream formation (Freud, 
1955). Much current research has demon- 
strated that stimuli with various, marginal 
relationships to consciousness have effects on 
thinking processes. These effects have been 
shown in studies of subliminal stimuli (Eagle, 
1959; Paul & Fisher, 1959; Smith, Spence, & 
Klein, 1959), of partially seen stimuli (Fisher, 
1957; Shevrin & Luborsky, 1958), and of 
incidental stimuli (Pine, 1960). Klein (1959) 
has extensively explored the implications of 
these studies for a theory of consciousness. 

Effects shown in studies of marginal stimuli 
have often been distinctively different from 
the more veridical effects of consciously re- 
ceived and adaptively relevant stimuli. The 
present study deals with the contrasting effects 
of incidental and focal stimuli upon subsequent 
thought processes. It follows from an earlier 
one (Pine, 1960) in which incidental stimuli 
appeared in subsequently produced stories in 
transformed ways while focal stimuli appeared 
in more direct ways. The stimulus passages 
used had been written to emphasize symbolic 
qualities: a description of a cow emphasizing 
its oral-passive aspect and description of a hook 
emphasizing its phallic-aggressive aspect. 
Transformed effects of the incidentally pre- 
sented cow passage included, for example, an 
increase in passive and nurturant interpersonal 
relationships. 

The role of the drive relatedness of incidental 
stimuli is of particular interest. Drive repre- 
sentations are the principal thought contents 
that undergo symbolization, and drives are 
principal activators of defensive and disguise 


1 This study is one of a series supported by a grant 
from NIMH (Project Code Number: M-782, Cognitive 
Processes below the Level of Awareness). Principal in- 
vestigators: George S. Klein and Robert R. Holt. 


processes. Ini psychoanalytic theory, the rela- 
tionship of day residues (including incidental 
stimuli) to unconscious impulses is critical to 
an understanding of their use in the dream 
(Freud, 1955). The present study is specifically 
concerned with variations in the effects of 
incidental and focal stimuli as a function of the 
drive relatedness of the stimuli. Both drive 
and nondrive stimuli were given under inci- 
dental and focal conditions and the effects in 
all cases compared. The issue explored was 
whether or not the results obtained earlier 
with symbolic drive stimuli (transformed effect 
of incidental and direct effect of focal) would 
be repeated with an explicit (nonsymbolic) 
drive stimulus and with a neutral stimulus. It 
was hoped that the findings would shed some 
light on the relative importance of drive re- 
latedness and of incidentalness in producing 
transformations of the stimulus content. 


METHOD 


Subjects (Ss) were 24 male undergraduates seen 
individually in one hour sessions. Each first wrote two 
stories. Then, two stimuli were presented simultane: 
ously: a “focal” one that S read and an “incidental” 
one, produced in an adjoining room, that was experi- 
enced by S as irrelevant “noise.” The stimuli were 
verbal descriptions of (a) the dependent tie of the male 
child to his mother (the “drive” stimulus) and (b) the 
workings of a large-scale industrial economy (the 
“neutral” stimulus). Each was incidental for half the 
Ss. Following this, Ss made up two more stories. 

An oral, passive, mother-child theme was selected 
for the drive stimulus because of its universal signifi 
cance in personality formation. It read as follows: 

Dependent upon his mother from the moment of 
his birth—this is a man’s infancy. Still often feeling 
helpless in adulthood—this is a man’s secret. Grow- 
ing up is not easy. At the very beginnings of life, 
all people receive the protecting, cradling, feeding 
love of their mother. You too long ago felt warm 
and secure in your mother’s arms. You too have 
sucked ravenously until your belly was full with 
soothing warm milk. You too have experienced the 
pleasures of relying completely upon your mother 
who fulfilled your every wish. And then you find 
that growing up is not easy. In spite of growiné 
older, the tie to mother remains. Somehow wishing 
to be mothered even by other women in your adult 
years. The little boy within, always hiding behind 
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the appearance of the growing up man. Feelings of 

helplessness, of aloneness. The wish to be taken care 

of, and trying to hide this wish. Often feeling like a 

little boy. Wanting mother. 

In the neutral stimulus, an attempt was made to 
minimize libidinal and aggressive drive implications. 
Though all such implications could not be fully elimi- 
nated, the stimulus used was relativelyefree of drive 
content. It read as follows: 

Machines have taken a place of enormous im- 
portance in the workings of present day civilizations. 
It is amazing to think of how much machines and 
their products enter our lives every day. From morn- 
ing to night, every person makes some use of the 
products of industry. For example, transportation 
by car, plane, or boat is taken for granted today. 
Communication by telephone, printing press, or 
radio has become a natural part of our lives. The 
worlds of entertainment, commerce, government, 
and science all are largely based upon machine 
products, To a great extent, modern industry and 
the present day economy require the continued use 
of machine products by the public. Employment for 
millions of people can be found in the many factories 
throughout the country which manufacture such 
products. The demand for new goods is a constant 
spur to invention. The economic wealth of a nation 
is indicated by its accomplishments in industry. 

The two stimuli differ in ways other than their 
drive relatedness since they were written to make the 
drive stimulus more forceful in general. The oral drive 
stimulus has more intrinsic interest. It describes a rela- 
tion of person to person while the other describes a 
relation of person to machine. There are also differ- 
ences in writing style. The drive vs. neutral difference 
is, however, certainly a prominent one and the other 
differences (interest, relationships) are in the main 
intrinsically related to it. 

The procedure was as follows: S and the experi- 
menter (E) met in an office where extraneous noise was 
minimized. S$ had been told only that the study had to 
do with “imagination and some other things.” To be- 
gin with, S was given a blank card and told to imagine 
a picture on it and to make up a story about the pic- 
ture. This was repeated for a second story, with the 
additional instruction that the story be different from 
the first one. Instructions were given to S in writing 
to eliminate any need for an inquiry, and no mention 
was made of including people in the stories (to avoid 
bias in favor of the drive stimulus). Stories were hand- 
written by S to a length of 14 page, as suggested by E. 

Following the storywriting, S was told that he 
would get a rest from imagination tasks while Æ got a 
measure of his “ordinary reading rate” when reading 

Just for comprehension.” Instructions were for S to 
tead a passage silently several times, announcing each 
time a reading was completed so that Æ could note 
the time. S was encouraged to read at a steady rate 
€ven as he got increasingly familiar with the material 
after the first reading; he was told that the aim of the 
study was to explore “consistency” in reading rate. 
S was given either the drive or the neutral passage to 
read. Ten seconds after he began reading, the other 
Passage was broadcast from an adjoining office. The 
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reading instructions succeeded in keeping Ss well fo- 
cused on the reading task even though the incidental 
stimulus was quite audible. Æ had S read the passage 
repeatedly until the broadcast of the incidental stimu- 
lus was completed; 18 Ss read the focal stimulus three 
times, 4 read it four times, and 2 read it five times.? 
After the reading, E briefly and casually reviewed the 
salient points of the passage for S (to maximize its 
focal quality). 

The incidental stimulus was tape recorded by a 
male voice (not E’s) and played at a constant volume; 
it could be heard in the experimental room at close to 
ordinary speaking level. The incidental stimulus took 
under 2 minutes to read, and it was read once. The 
various devices used to increase the likelihood that S 
would experience the incidental stimulus as irrelevant 
are described in Pine (1960). 

Following the reading task, § was told that he 
would now “return to the imagination experiment” 
and was instructed to write stories once again: Two 
more stories were obtained, under conditions identical 
with those described earlier; S was asked for a different 
story each time. 

Memory and free association material were next 
collected (tape recorded). S was asked to recall all he 
could of the material he had read. He was then given a 
recognition test consisting of 29 pairs of closely 
matched phrases, with one member of each pair hay- 
ing appeared in the passage; 5 checked off the one in 
each pair that he thought was correct. Next, S was 
given the focal passage to read through once more and 
was then asked to free associate to it (for approxi- 
mately one minute), saying anything that came to 
mind about the passage or about his own life. Finally, 
S was reminded of the interfering “noise” that he had 
heard earlier; he was then given the same recall, recog- 
nition, and free association procedures for the inci- 
dental stimulus. 

Generally, Ss were quite surprised when asked about 
the incidental stimulus; by and large they had consid- 
ered it irrelevant, and their memory for it was poor. 


RESULTS 


Memory 

Recall scores were assigned each S by divid- 
ing each stimulus into information units and 
scoring the number of units recalled. The 
recognition score was simply the number of 
phrases that were correctly identified from the 
29 pairs. Average scores and comparisons of 
recall and recognition for incidental and focal 
stimuli appear in Table 1. 

Recall and recognition of the incidental 
stimuli are generally low. Recall and recogni- 
tion of the focal stimulus are significantly 
higher in each comparison, but still not at a 
very high absolute level. This is partly because 


2 There was no relationship evident between num- 
ber of readings and nature of effects. 
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TABLE 1 
Memory For INCIDENTAL AND FOCAL STIMULI 
Drive Neutral 
Aces Focal ro Focal 
Recall: 
Total number 38 38 40 40 
units 
Average units a 9.8 0.8 9.3 
recalled®* 
Range 0-5.5 | 7.5-13 | 04.5 | 5-18 
t 10.83 7.20 
p 001 -001 
Recognition: 
Total number 29 29 29 29 
units 
Average units 15.6 | 23.5 18.4 | 20.7 
recognized® 
Range 9-21 | 18-26 | 15-21 | 15-25 
t 5.91 1.77 
pe -001 -05 
ee eee u 
® Full credit given for near perfect recall of a unit; }4 credit for 
partial recall. 
b One-tailed tests. 


© The average recognition by chance alone would be 14.5 units 
in the forced-choice situation with 29 pairs of phrases. 


Ss were not pressed to give everything they 
could recall on the free recall, and because, in 
the recognition tests, the matched pairs of 
sentences were so similar that confusions were 
made even though the passages had been read 
focally? 


Stimulus Effects 


All stories were studied intensively in an 
attempt to develop scoring categories appro- 
priate to the raw data. Z went through half of 
the stories noting all possible derivatives of 
either the drive or the neutral stimulus 
(whether a single word or a broad the 1e). In- 
dications of direct as well as transformed 
emergence of stimulus content were recorded. 
The derivatives were then grouped into the 
categories that seemed best to describe them 
(separately for the two stimuli). Finally, Æ 
rated the other half of the stories by placing 
possible derivatives directly into the already 
formed categories. 

The rater knew neither which stories were 
pre and which post nor which stimulus was 
incidental and which focal. (Ss had written out 


8 The memory scores showed no consistent relation- 
ship to the stimulus effects to be described below. 
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TABLE 2 
Carecory Times AND RATINGS For Errecrs or 
THE DRIVE STIMULUS 


————— 


nciden- | Focale 
Catagory Titles and Groups 

2 kA kA 

ElE\ElE 


Direct: 

1. Significant stimulus words used in stories | 14 | 6 |16| 13 
with initial meaning retained (e.g., “he 
grew older”) 

2, Assorted specifics from the stimulus used | 42 | 37 | 39] 71 
directly (e.g., eating, birth, social dating, 
tie to parents, etc.) 

3. Childhood (and oral) objects and activi- | 8| 8| 6] 11 
ties (e.g., dollplay) 


Total 64 | 51] 61 | 95 


Middle: 

1. Phrases parallel to (not identical with) | 2 | 6| 2 

stimulus phrases and used in similar way 

(e.g., “he enjoyed himself greatly’’— 
from, pleasures of infancy theme) 

2. Oral aggression (e.g., “lion hunters heard | 1| 1 1 

a roar”) 

3. General theme: personality (e.g., story of | 1] 2] 0 

psychology lecture) 

4, Egocentrism: world centered on self (e.g, | 4| 2| 0 

daydream—self as hero) 

5. Miscellaneous: related to various stimu- |101 | 92 103 | 94 

Jus qualities in various ways 

DE eS en a 


Total 109 |103 |106 | 99 


Indirect: 

1. Significant stimulus words used in stories | 2 3| 3 

with altered meaning (e.g., “full of life” 
—irom, belly full) 

2, Phrases parallel to (not identical with) | 2] 2| £ 
stimulus phrases and used in altered ways 
(e.g, “he was a hard worker’’—from, 
growing up is not easy theme) 

3. Oral words used with nonoral meanings | 1 
(e.g., rock and roll) 

4. Condensations: story words combining | 0| 1] £ 
two or more stimulus words or concepts 
(eg, a “little women” story—from, 
young and female) 

5, Puns: story words viewed as plays on 6) 7|7 
stimulus words (e.g., “he felt a minor hes- 
itation’’) 

6. Slips: errors in story writing in which 4) 2) 1 
stimulus content comes through (c.g., 
“he couldn’t loose the game’’—from, 
theme of inability to break tie to mother) 

7. Negations and reversals of stimulus con- | 15 11 
tent (e.g., “it was a cold and uncomfortable 
day”) 

8. Symbols (of mother, of birth) (e-8., diving 
into water) 

9. Concrete representations of oral acts or | 9 3 
feelings (nonuniversal symbols) (¢.8-, “he 
was attached to his seat and couldn’t 


36 | 24 | 24 


move”) 
SSeS 
Total 7s | oo |47| 5 
LP coe 
Grand total zas (244 [214 | 22? 


® Total number of ratings per category for 12 Ss. 
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stories on precoded pages that were shuffled, 
typed, and re-coded before being seen by any 
rater.) Thus, at the time stimulus “deriva- 
tives” were selected, the rater had no way of 
knowing whether in fact these were in pre or 
post stories; his global guesses of which stimu- 
lus had been incidental were at a chance level. 
Sample reliability studies were carried out by 
two separate raters—each one rating half the 
stories for a single category. One of the cate- 
gories selected included relatively direct 
stimulus effects (reference to social dating—a 
derivative of the drive stimulus) and the other 
(oral symbols) included relatively indirect 
effects. With 24 Ss, rho between the ratings 
by Æ and the reliability rater for the dating 
category was .76 and for the symbol category 
was .79. Brief category titles and some ex- 
amples are presented in Tables 2 and 3. 
Although grouped in the tables, the “direct” 
and “indirect” categories are conceived of as 
extremes of a continuum. Direct categories are 
defined by the use of stimulus material in the 
stories in much the same form and content as 
it appeared in the stimulus; the content is 
such that a layman could readily recognize the 
link to the stimulus. Indirect categories were 
formed and grouped on the basis of clinical 
impression guided by psychoanalytic theory of 
the forms of thinking in the primary process 
(Freud, 1955). They all reflect transformations 
of the stimulus content based upon fluid 
associations, word play, and processes of 
symbolization.4 The middle categories, a re- 
sidual group, were those which did not fall as 
clearly into one or another of the extreme 
groups. The categories are not mutually ex- 
clusive. A single story could get ratings for 
direct and indirect categories for both stimuli. 
In the following, four sets of findings are 
presented, deriving from: (a) incidental drive 
stimulus, (b) incidental neutral stimulus, (c) 
focal drive stimulus, and (d) focal neutral 
Stimulus. Since each S got two stimuli, the 
analyses are not based on four independent 
groups, but on two. Nonparametric tests (two- 


4 “Direct” and “indirect” for the drive stimulus 
Were not defined in terms of the directness of expres- 
sion of oral content, but in terms of the directness of 
expression of stimulus content—oral or not. Linguis- 
tic, as well as drive, links to the stimuli were rated. 
Later exploratory analyses in terms of the handling 
Ka as dependent drive content per se showed no re- 
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TABLE 3 
CATEGORY TITLES AND RATINGS FoR EFFECTS OF 
THE NEUTRAL STIMULUS 


'Inciden- 
talë 
Category Titles and Groups 


Post 
Pre 
Posi 


g 
À 


Direct: 

1. Significant stimulus words used in stories | 4| 4| 0| 6 

with initial meaning retained (e.g., “he 
went to an employment office”) 

2, Assorted specifics from the stimulus used | 83 | 74 | 62 | 72 
directly (e.g., transportation, work, com- 
munication) 

3. Extensions of assorted specifics from the | 32 | 21 | 26 | 24 
stimulus—extensions consistent with the 
initial content (e.g., taxicabs rather than 
transportation) 

4. Commentary on the machine age (e.g, | 0| 0} 0 1 
Brave New World theme) 


Total 119 | 99 | 88 | 103 


Middle: 
1, Phrases parallel to (not identical with) | 1| 6| 4 5 
stimulus phrases and used in similar way 
(e.g., “everybody was working”’—from, 
employment-for-millons theme) 

2, Personalization: stimulus content usedin | 0| 2] 5 
self-reference context (e.g., “I see myself 
as a scientist”) 

3. Miscellaneous: related to various stimu- | 9 | 5 | 24] 17 
lus qualities in various ways 


Total 10 | 13 | 33 | 26 


Indirect: 
1. Significant stimulus words used in stories | 0| 0| 0 0 
with altered meaning 

2. Phrases parallel to (not identical with) | 1] 0j 2 0 
stimulus phrases and used in altered ways 
(e.g., “the warrior thrust his sword””— 
from, spur-to-invention) 

3. Altered use of specific stimulus content| 8| 7| 11] 12 
suggesting out-of-context associations 
(e.g, story of baseball team—from, 
many-people theme) 

4, Puns: story words viewed as plays on | 1| 7| 2 5 
stimulus words (e.g, “time spent in 
school’’—from, role of consumer) 

5. Slips: errors in story writing in which | 3| 0| 0 
stimulus content comes through (e.g., 
“the fish brakes water”’—from, machine 
and transportation themes) 

6. Negations and reversals of stimulus con- | 7| 2] 3| 6 
tent (e.g., a medieval setting—from, pres- 
ent-day theme) 

7. Concrete representations: symbol-like | 0| 0] 1 0 
representations of the stimulus content 


co 


Total 20 | 16 | 19 | 23 


Grand total 149 |128 |140 | 152 


à Total number of ratings per category for 12 Ss, 


tailed) were used to study group differences 
and pre-post differences in the stories. Com- 
parisons were made for the over-all groupings 
of direct and indirect categories and for the 
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total of all categories. Tables 2 and 3 give the 
total numbers of ratings for all categories and 
category groupings. Individual scores, from 
which rank orders were formed, were derived 
from each S’s number of ratings in the cate- 
gories. 

Incidental drive stimulus. In a previous 
study (Pine, 1960), it was found that an inci- 
dental symbolic drive stimulus emerged in 
indirect ways in subsequent stories. Effects of 
an incidental explicit (nonsymbolic) drive 
stimulus were studied here. The total number 
of ratings on the two pre and the two post- 
stories were computed for each S and ranked 
for the direct, the indirect, and the total drive 
stimulus categories. Pre to post changes were 
then compared by means of the signed rank 
test (Siegel, 1956). Instead of the expected 
increase in indirect derivatives, there was a 
significant decrease in the total number of 
derivatives (p: = .04). Though the decrease 
does not appear in every category (see Table 
2), it appears in the total for each subgroup 
of categories (the subtotals alone being in no 
case statistically significant). Difference scores 
(changes from pre to post stories) for the 
drive stimulus derivatives were also com- 
puted for each S$ and ranked; Ss who had 
received the drive stimulus incidentally were 
compared to Ss who had received it focally. A 
Mann-Whitney test (Siegel, 1956) showed the 
difference for the total number of derivatives 
to be significant (p = .04), with a decrease in 
the drive incidental group and an increase in 
the drive focal group.® 

Incidental neutral stimulus. To ascertain 
whether or not the neutral stimulus produced 
an effect, an analysis exactly parallel to the 
above was carried out (excepting that cate- 
gories for the neutral stimulus were used). No 
significant pre-post differences were obtained 
on direct, indirect, or total categories for the 
group of Ss that received the neutral stimulus 
incidentally; nor were there any significant 


5 The decrease is not a result of changes in story 
length, which remained constant for the pre- and post- 
stories (by Z’s instructions to S). Nor is it a result of a 
shift away from explicit oral content (which did not 
change) towards content related to the neutral stimu- 
lus (which also did not change). The change was pre- 
dominantly in terms of language usage (see Footnote 
4), which decreased in its relatedness to the drive stim- 
ulus in the poststories as compared to the baseline of 
the prestories. 
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differences between groups of Ss (neutral-in- 
cidental vs. neutral-focal). There were fewer 
derivatives rated in total for the neutral than 
for the drive stimulus (compare Tables 2 and 
3); this reflects the ease with which a clinically 
oriented rater finds oral-related content in a 
TAT-like story, and is not a true stimulus 
effect. 

Focal drive stimulus. In the earlier study 
(Pine, 1960), a focal symbolic drive stimulus 
seemed to emerge in direct ways in subsequent 
stories. Present findings with the focal explicit 
drive stimulus are similar. There was an in- 
crease in direct derivatives of the drive stimu- 
lus from the pre to the poststories in the group 
of Ss that received the drive stimulus focally 
(signed rank test; p = .054). Difference scores 
(pre to post changes) across groups (drive focal 
vs. drive incidental) also showed a significant 
difference (Mann-Whitney test; p = .04) with 
an increase in direct derivatives in the drive 
focal group and a decrease in the drive inci- 
dental group. No other changes from the pre 
to the poststories for the Ss in the drive focal 
group were statistically significant. 

The question arose early whether explicit 
drive stimuli, if threatening to S, would un- 
dergo some disguise as a function of defense 
even when focally presented. With this in 
mind, an oral-passive drive stimulus had been 
selected as one that would be likely to be 
threatening for the Ss used—a group of col- 
lege freshmen and sophomores presumably in 
the midst of an adolescent dependence-inde- 
pendence crisis. Though disguise processes 
(e.g., transformation of derivatives) were not 
evident, the data do suggest some defensive 
operations. The free associations to the drive 
stimulus were rated for anxiety; two raters, 
neither one aware of the data on stimulus 
effects, carried out the ratings jointly. The 
rank order for anxiety correlated inversely, 
though not significantly, with effects for the 
Ss in the drive focal group; rho (V = 12) was 
—.25 with direct effects, —.41 with indirect 
effects, and —.47 with total effects. The trend 
is, the more anxiety, the less of any effect. ih 
addition, it was striking that most of the in- 


®Free associations to the drive stimulus suggest 
that it was indeed somewhat threatening; the associa- 
tions varied, not from pleasant to unpleasant, but 
rather from effective to ineffective handling of €351- 
tially unpleasant content. 
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crease in direct derivatives from pre to post- 
stories reflected the cast of characters in the 
drive stimulus (women, boys—not necessarily 
ina relationship to each other) and the adult 
relationships suggested in the stimulus (seeking 
adult ties—marriage, dating); there, was almost 
no increase in (more threatening) direct deriv- 
atives concerning the dependent relationship 
of the infant to its mother. 

Focal neutral stimulus. Few if any effects of 
a focal nondrive stimulus were expected to 
emerge in the stories (since unthreatening 
material in full awareness is ordinarily kept 
from spilling over onto other thought prod- 
ucts). Analysis of derivatives showed no sig- 
nificant differences in pre-to-post or in cross- 
group comparisons. 

Since the rating categories used were empiri- 
cal ones and in no sense exhaustive, there 
remains a possibility that other categories 
would have shown different results. This 
problem is especially true for derivatives of the 
neutral stimulus; less theory is available to 
provide guidelines for the formation of “in- 
direct” categories there. Comparison of Tables 
2 and 3 indicates that there were in fact many 
fewer indirect derivatives rated for the neutral 
than for the drive stimulus, and many more 
direct derivatives. The data were analyzed in 
sufficient detail so that one can feel relatively 
certain that no gross group effects were missed. 
Nevertheless, one can neither rule out nor 
confirm the possibility that both stimuli, in 
either focal or incidental conditions, had a 
variety of idiosyncratic effects based on stimu- 
lus-personality interactions not controlled in 
the design, 


Discussion 


In an initial study of the contrasting effects 
of incidental and focal stimuli (Pine, 1960), 
Tesults indicated an increase in transformed 
derivatives of the incidental symbolic drive 
Stimulus in subsequently produced stories. 
Bach (1960), using a quite dissimilar design, 
Teports the same essential finding. In his 
Study, the words “room” and “water” pre- 
Sented subliminally or incidentally led to 
mother-related word associations—an effect 
quite distant from the original stimulus words 
but in accord with their conceptual value as 
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mother symbols.” Given the phenomenon of 
transformed effects of symbolic drive stimuli, 
one of the most curious findings of the present 
study is the decrease in all derivatives with an 
explicit drive stimulus. 

This result is in line with other studies that 
point up the significance of stimulus differences 
(notably symbolic vs. explicit drive) for the 
phenomenon. Klein, Spence, Holt, and Goure- 
vitch (1958), comparing subliminal stimuli of 
symbolic and explicit drive content, found 
group effects of the symbolic stimulus (the 
response options did not differentiate between 
direct and transformed effects) but a more 
variable pattern (individual consistency but 
no group effects) for the explicit drive stimulus. 
Eagle (1959), using an explicit drive stimulus 
presented subliminally, found stimulus effects 
moving both towards and away from the 
stimulus content. Thus, it may be that the 
effects of an explicit drive stimulus are more 
variable than the (typically transformed) ef- 
fects of symbolic drive stimuli. 

This variation of effects and the symbol- 
nonsymbol differences mesh in an interesting 
way with a variety of recent suggestions 
(Fisher & Paul, 1959; Klein & Holt, 1960; 
Pine, 1960) that marginal stimuli are governed 
more by the laws of primary process (non- 
logical, drive dominated thought) and focal 
stimuli by the laws of secondary process (ra- 
tional, adult thought—cf. Freud, 1955). Sym- 
bolization disguises drive content so that its 
threat is not felt (as in the dream). On the 
fate of threatening thought content in the 
psychic apparatus, Freud (1955) writes: “As 
a result of the unpleasure principle, then, the 
first y-system [primary process] is totally in- 
capable of bringing anything disagreeable into 
the context of its thoughts. It is unable to do 
anything but wish” (p. 600). He goes on to 
say that this makes much material that is 
necessary for coping with reality unavailable 
to thought. The secondary process develops as 
a way of making threatening thought contents 
available so that adaptive behavior can be 
maximally effective. Yet, “the second system 

7 The present writer has just repeated this finding 
in still a different design. The word “cow,” presented 
subliminally to a group of Ss, led to an increase in 
mother-related, oral-passive content in subsequent 
images—but nof to an increase in images related to 
cows, farm animals, food animals, or the like. 
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[secondary process] can only cathect an idea if 
it is in a position to inhibit any development 
of unpleasure that may proceed from it” (p. 
601). 

We can hypothesize that a “disagreeable” 
drive stimulus, incidentally presented, is ex- 
cluded from thought (reduced derivatives in 
the stories) in accord with the pleasure 
principle that guides the primary process 
stream of thought. This contrasts with the 
fate of incidental symbolic drive stimuli which 
are utilized in subsequent thought products 
(stories), symbolized drive representations 
typically being available in primary process 
thought products (¢.g., the dream), Since the 
assumption is that focal stimuli tend to be 
governed by the laws of the secondary process, 
it is striking that Ss who had the drive stimu- 
lus focally gave stimulus derivatives that “in- 
hibited the unpleasure” arising from the dis- 
turbing thought content. That is, they did not 
use the more threatening infantile dependent 
material, but instead used much material re- 
lated to social dating—for this sample, a kind 
of content that is congruent with the interests 
and activities of their peers, its threat, thus, 
reduced by its very normality. The hypothesis 
can be tested empirically that threatening 
stimuli, marginally presented, lead to effects 
that include inhibition, 

An essential problem in studies of effects of 
marginal stimuli is the role of consciousness in 
determining the course a stimulus takes in the 
mental apparatus (cf. Klein & Holt, 1960). A 
stimulus is “put in” and “comes out” of the 
person without his full awareness, intake and 
output being couched in and, thus, hidden by 
other intake and output processes going on 
simultaneously. For the drive stimulus at least, 
the effects obtained repeat a central aspect of 
results found earlier (Pine, 1960)—that inci- 
dental and focal stimuli work differently. The 
focal stimulus led to an expression of direct 
derivatives (as found earlier) and the incidental 
stimulus led to an inhibition of a wide range of 
derivatives (both direct and transformed). 
These data suggest that differences in the 
course stimuli follow in the psychic apparatus 
stem from differences in their mode of intake. 

Recent studies suggest that differences arise 
from variations on the response side as well. 
Thus, a linked pair of studies by Goldberg and 
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Barthol (1960), show effects of sub m 
stimuli on imaginative processes (image 
story production) but not on Bie 


kind of nlite is clearly superior for 
focal stimuli; but this difference does not ci 
over into more imaginative thinking (s 
creating). It may be that stimuli that are ta 
in in a way that evades conscious attent 
and adaptive relevance follow a meandei 
course in the mental apparatus, and 
imaginative response tasks are more sensi 
to picking up the end products of this 
dering course than are response tasks 
emphasize realistic thinking. 
It was hoped that the study would illuminate 
the respective roles of incidentalness and 
drive relatedness in determining effects o 
cidental stimuli. Zuckerman (1960) 
effects of neutral stimuli, subliminally 
sented. At first glance, present results u 
seem to contradict Zuckerman’s; since | n 
incidental stimulus had an effect when i it 


be concluded that drive related. a 
stimuli stand a better chance of aff 
thinking. 

Two important methodological consi 
tions modify this conclusion, however, 
raise interesting questions in themselves. í 
first stems from the fact of simultaneous pi 
tation of the focal and incidental stimuli. 
drive stimulus is not only more drive related 
but also more interesting than the new 
stimulus. The lack of effect of the n 
stimulus presented incidentally may 
function of Ss’ simultaneous attention 
more absorbing focal stimulus; in contrast, 
incidental drive stimulus was given in | 
context of attention to a relatively more b 
stimulus.’ Activities going on in full av 
at the time when incidental stimuli impi 
S may, thus, be partial determinants 6 
effect of the incidental stimuli. To 
hypothesis, one would have to pi 
incidental neutral stimulus along with 
interesting focal stimulus. 


SI am indebted to George S. Klein for | 
out this problem. 


+ 
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The second qualification comes from the 
role of the congruence (Eagle, 1958) between 
the response task and the effécts to be ob- 
served. Congruence of the response task with 
the stimulus was perhaps greater for the drive 
than for the neutral stimulus; TAT-like stories 
are better potential carriers of dtive content 
than of industrial themes (witness the smaller 
number of ratings for the neutral stimulus). 
This observation parallels the suggestion raised 
earlier that the response task must be sensi- 
tive to the kind of effects expected. 


SUMMARY 


The present study continues previous re- 
search on the contrasting effects of incidental 
and focal stimuli upon subsequent thinking. 
‘The significance of stimulus content (notably 
drive-nondrive) for these contrasting effects 
was specifically explored. The larger theoretical 
context grows from concern with the role of 
consciousness and attention in determining 
the course a stimulus takes in the mental 
apparatus. 

Twenty-four male Ss were seen individually 
for a study of “imagination.” Each told four 
Stories to a blank card—two pre-experimental 
(control) and two post-experimental. The in- 
tervening experimental condition was the 
simultaneous presentation of two stimuli. One, 
the “focal” stimulus, was a written passage 
that S was asked to read as a measure of his 
“ordinary reading speed.” The other, the 
“incidental” stimulus, was a similar passage 
broadcast through the wall from an adjoining 
room and characteristically experienced by S 
as irrelevant “noise.” One was drive 
related (the dependent tie of infant to mother) 
and the other was more neutral (a description 
of an industrialized economy). Each was in- 
cidental for half the Ss (while the other was 
simultaneously focal). 

Main effects were determined by compari- 
Sons of the pre and the poststories. Effects were 
Categorized in terms of their direct or trans- 
formed relation to the initial stimulus. The 
incidentally presented drive stimulus produced 
an inhibition of direct and transformed con- 
tent that was related to the stimulus, while 
the same stimulus focally presented led to an 
increase in specific kinds of direct stimulus 


| N Content. The neutral stimulus produced no 


n 


observable effects in either the incidental or 
the focal conditions, 

The results were discussed in the light of 
current studies of the often unique effects of 
marginal stimuli (incidental and subliminal) 
upon contrasting kinds of thought (reality- 
attuned vs. imaginative), Suggestions regard- 
ing the contrasting nature of the effects of 
incidental and focal stimuli, and the signifi- 
cance of the drive relatedness of incidental 
stimuli, were drawn from the psychoanalytic 
theory of primary and secondary process 
thinking. 
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MPIRICAL relationships between recogni- 
tion threshold and such properties of 
G the stimuli as prior familiarity or 
 affectivity have been well-established, as well 
as the interaction of these stimulus variables 
_ with motivational variables or subjects’ (Ss’) 
E personality characteristics (Jenkins, 1957; 
Postman, 1953). The usual interpretation of 
_ these data has been in terms of perception. 
_ hat is, differences in threshold between 
f groups of Ss or types of stimulus material are 
i _ attributed to the relative ease with which the 
= stimuli are “seen,” or in the case of auditory 
= presentation, “heard.” These results have been 
_used in an attempt to extend our knowledge of 
“perceptual processes and, by many investi- 
gators, to support explanations of personality 
‘phenomena based on perceptual theory. 
Recently Goldiamond (1958) has challenged 
the perceptual interpretation of many of these 
_ Studies on both methodological and empirical 
grounds. He points out that the performance 
_ Measure obtained from the recognition thresh- 
old technique in which S is required (by the 
method of ascending limits or constant stimuli) 
to identify a stimulus under perceptually in- 
adequate conditions is particularly susceptible 
__ to the influence of variables extraneous to the 
= Properties of the presented stimulus. Particu- 
s larly important are what Goldiamond terms 
‘response biases”; i.e., under conditions in 
which a stimulus is only partially perceived, 
‘Ss are more likely to guess or say certain 
‘Tesponses than others due to past learning, 
Motivational factors, the experimenter’s (E’s) 
instructions, etc. 

A restatement of the Goldiamond (1958) 
Position, as it is understood by the present 
Writers, may be made via the following ex- 
amples. If S were presented, at a suprathresh- 
old level, with a stimulus pattern such as C-T 
and instructed to report what he saw, he 
would report C-T; the verbal responses elicited 
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by these particular instructions, it could be 
agreed, indicate or reflect S’s perceptions. The 
S could also be asked to fill in the missing 
letter by saying the first complete word that 
occurred to him (e.g., CAT). One would not 
conclude in this instance that S’s behavior 
reflected his perception (e.g., that he saw CAT 
rather than C-T). It is quite reasonable to 
assume that in this type of situation, such 
variables as the prior frequency of usage of the 
various response alternatives, their affective 
connotations, Ss’ motivation and set, etc. in- 
fluence which response alternative is elicited. 
Were these empirical relationships to be ob- 
tained, one would not conclude that they 
constitute evidence that these variables de- 
termine perception. Rather they would be 
interpreted in other terms. 

The technique of obtaining recognition 
threshold by the ascending method of limits 
contains elements of both of the procedures 
described above, one of which elicited be- 
havior that can be agreed to reflect perception 
and the other of which does not. In the typical 
experiment S is asked to report after each 
exposure not merely (or sometimes, not even) 
what he sees but also to guess at what the 
word is on the basis of whatever cues are 
available to him. Obviously, S’s verbal re- 
sponses at subthreshold levels of exposure are 
determined to some degree by his perceptions, 
i.e. they are determined by the immediate 
properties of the stimulus. But it is also well- 
known that recognition of complex stimulus 
patterns like words is not all-or-none (Bricker 
& Chapanis, 1953); partial perceptions of in- 
creasing degrees of completeness and accuracy 
occur as exposure values increase toward supra- 
threshold levels. When S guesses at a whole 
word on the basis of a partial perception (thus 
paralleling the suprathreshold study described 
above) it is difficult to maintain that S’s verbal 
behavior indicates only what he sees or per- 
ceives. 

With respect to frequency of prior usage 
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several studies indicate that this variable does 
influence Ss’ verbal responses when partial 
perceptions are known and held constant. In 
two of these (Goldiamond, 1959 unpublished; 
Goldiamond & Hawkins, 1958) Ss were shown 
blank slides that allegedly contained stimulus 
items. “Thresholds” were established by count- 
ing the number of exposures necessary for S to 
guess the response E had previously decided 
upon as being “correct.” These pseudothresh- 
olds were found to be related to prior familiar- 
ity with the stimulus materials in the same 
manner as in threshold studies in which 
stimuli were actually presented (e.g., Howes & 
Solomon, 1951; Solomon & Postman, 1952). 
A similar phenomenon has also been reported 
by Zeitlin (1953) who manipulated the rein- 
forcement history of the stimuli. The results 
of such studies strongly suggest that the em- 
pirical relationship between recognition thresh- 
old and at least one variable, frequency of 
prior usage, is due at least in part to guessing 
preferences. At lower exposure values that 
provide only minimal information (and thus 
permit a number of response alternatives) S 
will tend to say (guess) familiar words more 
often than less familiar ones. This phenomenon 
has been reported by McGinnies, Comer, and 
Lacey (1952) and by Rosenzweig and Post- 
man (1957). Should the actual stimulus item 
happen to correspond to his guess, S’s “thresh- 
old” is low. Should the stimulus happen to be a 
less familiar word, additional exposures, which 
provide additional information that narrows 
the range of possible responses, are required 
before the correct response is elicited. 

In the present investigation an attempt was 
made to show a relationship between frequency 
of prior usage and verbal report, using a pro- 
cedure better able than the recognition thresh- 
old method to minimize the biasing effects of 
guessing preferences. The experimental ar- 
rangement was a modification of the forced- 
choice psychophysical technique employed by 
Blackwell (1952) and Goldiamond (1954) As 
employed in the present study, S was pre- 


2The forced-choice technique has been used by 
several investigators (Blum, 1954; Dulaney, 1957; 
Nelson, 1955) in studies of the affective nature of the 
stimulus materials. In whole or in part, however, these 
investigators required S to indicate the location of the 
“most outstanding” stimulus, a procedure which 
Goldiamond (1958) contends is subject to the same 
criticism as the ascending method of limits. 
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sented with several words simultaneously and 
required to indicate the spatial position of one 
of the words ‘vhich had previously been identi- 
fied by E. This method restricts severely the 
verbal responses that are elicited; bias due to 
any differential preference of one response 
(spatial position) over others can be minimized 
by assigning high and low frequency words 
equally to all spatial positions. 


Exrerment I 


Method 


Subjects. Undergraduate psychology students at 
Northwestern University were tested in groups with 
the forced-choice method. The study was replicated 
three times, the groups containing 17, 18, and 19 Ss, 
respectively. Four additional Ss were tested but their 
records were excluded because they failed to reach a 
performance criterion (a minimum of 75% correct re- 
sponses at the highest illumination level). 

Apparatus and materials. Eight pairs of six-letter 
nouns (a total of 16 individual words) were selected, 
four pairs with high frequency of usage and four with 
low, as indicated, respectively, by Thorndike-Lorge 
(1944) counts of 100 or more and 7 or less per million. 
Within each pair, each word began with a consonant 
and no common letter appeared in the same position, 
An attempt was also made to equate the frequency 
with which individual letters appeared in the high and 
low frequency lists, 

Slides were prepared for the word pairs in which 
one word appeared directly above the other. Two such 
slides were made for each pair, the spatial position of 
the words being reversed. The slides were projected 
onto a screen at a constant .10 seconds duration inter- 
val and at six different levels of illumination. Illumina- 
tion was varied by having the power supply (110 volts 
60 cycle ac) for the Keystone slide projector, which 
contained a 1000 watt lamp, pass through an Ohmite 
rheostat. 

Procedure. Ss were tested as a group in a semidark- 
ened room, being seated 10 to 18 feet from the screen, 
in rows of five. Ninety-six stimulus presentations were 
given by the method of constant stimuli. Prior to each 
exposure one of the pair of words was announced by 
E and Ss were required to indicate on printed data 
sheets the spatial position, top or bottom, in which 
the given word appeared. The pairs were presented at 6 
illumination levels, Ss being asked to locate each of 
the 16 words once at each level. Each individual word 
to be identified appeared equally often in the two 
spatial positions, as did the 16 different words within 
each illumination level. The order of the 96 exposures 
was random with respect to order of word, word fre- 
quency, illumination level, and correct spatial position. 


Resulis and Discussion 


For each group separately, the mean percent- 
age of correct responses for high and low 
frequency words at each illumination level was 
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first obtained. Inspection of these data revealed 
that a full psychophysical function! was ob- 
tained in each group; further the curves for 
the high and low frequency were essentially 
parallel. For this reason, the data chosen for 
statistical comparison were the means of per- 
centage of total correct responses for the two 
types of words. As may be seen in Table 1, 
the means of the high and low frequency lists 
are essentially equal for one of the groups but 
greater for the high frequency words in the 
other two. Matched-pairs ts computed for 
each of the groups failed to reach the .05 level 
of significance although that for Group 3 fell 
just below it. A similar ¢ for all groups com- 
bined yielded a value of 1.81 (.05 < p < .10). 
Thus there was no clear-cut evidence that the 
positions of high frequency words were more 
often correctly identified than those for low 
frequency words. 

Although the two lists of words employed 
in the present study differed widely in Thorn- 
dike-Lorge (1944) word count, it could be 
argued that all of the words were sufficiently 
familiar to college students to prevent the 
appearance of differences in perceptibility. To 
check this possibility recognition thresholds 
were determined tachistoscopically for the 16 
individual words by the usual ascending 
method of limits. The results obtained from 
15 Ss revealed highly significant differences 
between the high and low frequency lists (¢ = 
5.15, b < .001). Thus the failure to find sig- 
nificant differences with the forced-choice 
technique could not be attributed to the 
relative frequency levels of the stimuli. 

A second factor that should be considered in 
evaluating these results is the arrangement of 
the word pairs with respect to common letters. 
Since some decision had to be made as to the 


TABLE 1 
Means or PERCENTAGE TOTAL Correct RESPONSES 
AND } VALUES FOR THE HicH AND Low 
FREQUENCY Lists 


(Experiment I) 
Group 1 Group 2 Group 3 
eao Wee (N = 19) (N = 18) 
M o M o M o 


High | 69.36] 9.90 | 77.85| 11.39] 74.31| 13.10 
Low | 69.73| 9.65 | 76.54] 23.12] 70.25| 14.60 
t -16 1.03 2.01 


493 


number of common letters in the same position 
in each word that would be permitted, the 
least arbitrary number, zero, was chosen. In 
effect, then, S could be correct in his spatial 
identification if he perceived correctly the 
position of one letter in either of the two 
words. Although Ss were not informed of this 
arrangement, many undoubtedly based a 
number of their responses on extremely limited 
information. Since the frequency of individual 
letters was held constant between the two 
lists of words, it is perhaps not surprising that 
only minimal differences were found between 
high and low frequency words. 

Were the number of stimuli increased, the 
probability of the occurrence of correct re- 
sponses based on such limited information 
would decrease sharply. An additional experi- 
ment was therefore performed, one in which 
Ss were presented with sets of four high or low 
frequency words. 


EXPERMENT II 


Method 


Subjects. Ss were 66 students enrolled in introduc- 
tory psychology classes at Northwestern University. 
The Ss were randomly divided into two equal groups, 
each being given a different but parallel set of stimu- 
lus materials. Thus, in effect, the experiment was 
replicated twice. All Ss were tested individually. 

Materials and apparatus. Two lists of 16 five-letter 
nouns were prepared, both according to the same rules. 
Each list contained eight high frequency and eight 
low frequency words, as defined by Thorndike-Lorge 
counts of 100 or more and 4 or less per million, respec- 
tively. Each list was further subdivided into two sets 
of four high or four low frequency words. Within each 
set, each word began with a consonant and no com- 
mon letter appeared in the same position. The fre- 
quency with which individual letters appeared in the 
high and low frequency lists was equated as closely as 
possible. 

Slides of each of the word sets were prepared, the 
words appearing in capital letters within a 1” X 34” 
rectangle. Four different slides were made for each set 
by selecting one order of the four words (UR, LR, UL, 
LL) and rotating the position of the words for each of 
the other three orders so that each word appeared 
once in each spatial location. 

The slides were presented by means of a Gerbrands 
Mirror Tachistoscope. Illumination was held constant 
and the materials exposed at eight duration levels: 10, 
20, 30, 40, 50, 60, 80, and 100 milliseconds. Both expo- 
sure and pre-exposure fields were white and of equal 
illumination. A fixation point on the pre-exposure field 
marked the center of the rectangle in which the block 
of words occurred. 

Procedure. Each S was tested with only one of the 
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TABLE 2 


MEANS or PERCENTAGE TOTAL Correct RESPONSES 
AND $ VALUES FOR THE Hicu AND Low 
FREQUENCY Lists 


(Experiment IT) 


Group 1 Group 2 
Frequency 
M o M o 
High 72.58 15.53 75.90 13.91 
Low 66.09 13.73 72.16 13.03 
t 4.61 3.57 


two parallel sets of words, Ss being assigned alternately 
to each set as they appeared to be tested. Each S was 
given 128 stimulus presentations by the method of 
constant stimuli. Prior to the exposure of each slide, 
E announced one of the four words and required S to 
indicate the spatial position, UR, UL, LR, LL, in 
which the given word appeared. S was asked to iden- 
tify each of the 16 individual words once at each of 
the eight duration levels. Each individual word to be 
identified occurred equally often at each spatial posi- 
tion as did the 16 words within each duration level. 
The order of the 128 presentations was random, a 
different order being used for each S. 


Results 


An analysis of mean percentage of correct 
responses at each duration level indicated that 
the high frequency words were quite consist- 
ently located more accurately than the low 
frequency words in both groups. This superi- 
ority is reflected in the means of percentage 
of total correct responses shown in Table 2, 
The ¢ testing the significance of the difference 
between these means was significant in each 
group (p < .01). 


Discussion 


In conducting research on the recognition 
thresholds of verbal materials it is possible to 
have a purely empirical interest in isolating 
stimulus and subject variables that contribute 
to verbal behavior (or more exactly, influence 
the exposure value at which Ss’ verbal report 
is congruent with the presented stimulus). 
More frequently, however, investigators have 
used the data from such studies to support 
hypotheses about “perception,” an underlying 
state of the organism. Perception is thus con- 
ceived, implicitly or explicitly, as a hypo- 
thetical concept inferred from behavior rather 
than as a convenient term for labeling the 
behavior observed in a particular experimental 
setting. 
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Investigators who use the term “perception’ 
as an explanatory concept have been handi 
capped by laek of a generally accepted defini- 
tion of the concept. Typically (visual) percep- 
tion appears to refer in some manner to “what 
S sees,” although how this is to be inferred 
from behavior is left unspecified. Perhaps as a 
result, many investigators have appeared to _ 
assume, for all practical purposes, that the — 
systematic variance in verbal reports can be — 
attributed to variations in Ss’ “perception”; 
thus all variables shown to influence Ss’ b 
havior in the recognition threshold situation 
are almost automatically interpreted as sup- 
port for a hypothesis concerning perceptio 
The assumption of a 1-to-1 corresponde 
between behavior and perception has led to 
series of criticisms, each suggesting or demon- 
strating that a given set of empirical results 
could be attributed to nonperceptual factors. 
For example, the tabu word studies were ques- 
tioned because of the possibility of “responsi 
suppression”; Ss might see (perceive) tab 
words as easily or quickly as neutral words bui 
hesitate to say them, thus producing higher 
thresholds (Howe & Solomon, 1950). The “see 
vs. “say” issue is raised once more in Goldia- 
mond’s recent articles, Goldiamond pointing — 
to the particular susceptibility of the response 
used in the method of limits technique to the N 
influence of all sorts of variables that affect 
performance via nonperceptual processes. 

The forced-choice technique would appi 
to overcome Goldiamond’s methodologi 
criticisms of the method of limits. In 
present investigation evidence was obtain 
that indicated that word frequency is relat 
to verbal report even when response bias 
controlled. The survival of the empirical re 
tionship between performance and prior 
quency of usage with still another source 
extraneous variance reduced or eliminated may 
give added confidence to the perceptual inter- 
pretation. But even in the forced-choice — 
situation S is still perceiving partial cues a 
guessing as to the spatial location of the 
designated stimulus. Thus the question 
mains unanswered as to whether perception 
influenced by frequency of prior usage Of 
whether fewer partial cues are needed to 
identify more familiar materials. 
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partial perception would seem to be inevitable 
if complex patterns are to be employed. The 
search for a “pure” measure of, perception 
would thus appear to be a futile one. The latter 
conclusion is hardly novel. For example, Post- 
man (1953) has suggested that an inference 
about a perceptual response disposition can 
safely be made only if it is based on two or 
more classes of responses. A similar approach 
has been taken by Garner, Hake, and Eriksen 
(1956) who urge a program of convergent 
operations. 

What is sought by multiple measurements is 
a consistent empirical relationship between 
performance and the variable under considera- 
tion, one that is common to all sets of opera- 
tions rather than unique. If the selection of 
the sets of operations is such that only the 
perceptual hypothesis is adequate to explain 
the observed consistency of results, the per- 
ceptual hypothesis is thus confirmed and all 
alternate hypotheses (e.g., response suppres- 
sion, linguistic preferences) are disconfirmed. 
However, as was indicated above, all experi- 
mental arrangements which involve testing of 
hypotheses about the perceptibility of com- 
plex stimuli (e.g., words) seem inevitably to 
evoke overt behaviors that are responses to 
partial perceptions. Thus one does not, by 
using multiple measurements, rule out all 
alternatives but the perceptual hypothesis. For 
example, the results of the present investiga- 
tion suggest that the empirical relationship be- 
tween recognition thresholds and word fre- 
quency found with the ascending method of 
limits may not be attributed solely to biases 
in emitting more familiar responses when a 
number of response alternatives are possible. 
But they are still susceptible to a nonper- 
ceptual interpretation: fewer partial cues are 
necessary to identify familiar stimuli. It has 
been the former source of contamination, it is 
Interesting to note, which has occupied the 
attention of Goldiamond, Postman, Garner 
et al. and not the latter. 

There is no reason, however, why attempts 
at verification of perceptual hypotheses con- 
cerning threshold data need to be restricted to 
Investigations within this limited area. For ex- 
ample, there is already a body of literature 
(e.g., Osgood, 1957) concerning the identifica- 
tion of stimulus patterns presented in partial 


495 


but suprathreshold form. If a variable that has 
been shown to influence threshold behavior can 
be shown mot to influence response identifica- 
tion when partial cues are presented at a supra- 
threshold level, a firmer basis would be pro- 
vided to support a perceptual interpretation of 
the former. Conversely, the variables shown to 
affect thresholds might be found to operate in 
the same manner to determine response identi- 
fication of partially presented stimulus pat- 
terns. If such were the case, one might conclude 
that the unique contribution of the scores of 
threshold studies that have appeared over the 
past 15 years has been to demonstrate that 
short durations of exposure, low illumination 
levels, blurred carbon copies, etc. are all meth- 
ods for obtaining partial perceptions. 


SUMMARY 


A number of studies have suggested that 
word thresholds obtained from the ascending 
method of limits are particularly susceptible to 
linguistic biases and hence may indicate little 
about perception. In the present investigation 
a forced-choice technique was employed in 
which S was presented with several words and 
required to identify the spatial location of a 
given word. With this technique, which re- 
duces or eliminates response bias, the same 
positive relationship between frequency of 
prior usage and performance as had been found 
with the ascending method of limits was ob- 
tained. The implications of a perceptual inter- 
pretation of these findings were discussed. 
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tients were asked to dream about a briefly 

exposed picture many items not men- 
tioned in deliberate attempts at recall ap- 
peared in distorted form in the patients’ 
dreams. Over the years there were several re- 
ported confirmations of Poetzl’s findings 
(Malamud, 1934; Malamud & Linder, 1931), 
but the current revival of interest in this phe- 
nomenon has been mainly due to the work of 
Fisher (1954, 1956, 1957). While these studies 
have been interpreted as confirming Poetzl’s 
original finding, they have lacked much in 
terms of methodological sophistication and pre- 
cision in assessment of the amount of “uncon- 
scious recall.” 

Recently Shevrin and Luborsky (1958) have 
attempted to establish the “Poetzl phenom- 
enon” on a firmer basis, primarily through im- 
proved quantification in the assessment of the 
unconscious or preconscious recall effects. Sub- 
jects (Ss) were shown a scene by means of a 
tachistoscopic exposure and were asked to de- 
scribe the scene and recall the items contained 
in it. After a second attempt at deliberate re- 
call they were instructed to dream that night 
and report the following day for a second ses- 
Sion. Since a large proportion of the Ss did not 
bring dreams to the second session, they were 
asked to conjure up a daydream or image. The 
Ss’ images and dreams as well as their inten- 
tional recalls were then rated for the presence 
of items contained in the stimulus picture. 
These ratings were in terms of conceptual 
similarity to items contained in the original 
Picture rather than in terms of a criterion of 
Specific identity. 

When the dreams and images were corrected 
for the number of items mentioned in inten- 
tional recall, Shevrin and Luborsky (1958) 
found a significant number of conceptually 
Similar items in the dreams and images over 
and above that contained in intentional recall. 

*This research was supported by Research Grant 


M-1206 from the National Institute of Mental Health, 
United States Public Health Service. 


I 1917 Poetzl reported that when his pa- 


This they ascribed to the effect of preconscious 
perception of the stimulus items during the 
tachistoscopic exposure of the picture and the 
subsequent use of these preconscious percep- 
tions by the primary process in dreams and 
images. 

While Shevrin and Luborsky (1958) intro- 
duced some important refinements in quantifi- 
cation of the Poetzl effect, their study lacks 
several basic controls that are imperative if 
one wishes to explain the results in terms of 
preconscious perception and preconscious re- 
call. Since a criterion of conceptual similarity 
was used in comparing the stimulus picture 
with the images (a truck in the picture is 
considered recalled if the dream contains a 
bicycle, since they are both modes of convey- 
ance) rather than specific identity, a very 
careful estimation of the chance level at which 
various kinds of objects appear in dreams and 
images of Ss who have not been exposed to the 
picture needs to be obtained. The images from 
these Ss would have provided an estimate of 
the base rate necessary to evaluate whether 
significant preconscious recall occurred among 
the experimental Ss, 

A second source of artifact in their data 
arises from their use of a cohesive scene as the 
stimulus picture. Since the stimulus used is a 
natural representation of the environment, 
objects contained therein have a high proba- 
bility of being found together in our everyday 
experience. This could have resulted in a 
spurious preconscious recall effect in the follow- 
ing manner. Let us assume that after a tachis- 
toscopic exposure of a scene the only thing an 
S can deliberately recall is the impression of a 
lake. Now, if shortly thereafter he is asked to 
form an image, it is not unreasonable to suspect 
that the image or daydream is in part deter- 
mined by his associations with a lake. And his 
associations to a lake, furthermore, would not 
be expected to be random. It would seem that 
such things as boats, wharves, trees, birds, etc. 
would occur with a high probability relative 
to such items as marbles and calculators. Since 
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it is also probable that a stimulus picture of a 
cohesive naturally occurring scene containing 
a lake will also have some of these objects in it, 
5S is likely to receive credit for preconscious re- 
call of objects the presence of which is the 
result of prior learned associations and inde- 
pendent of any preconscious perception or 
memory. 

There are several methods by which this 
artifact due to prior learned associations can be 
eliminated or evaluated in preconscious recall. 
One method would be the use of a control 
group that is not exposed to the stimulus pic- 
tures. Instead Ss in this control group are 
randomly paired with experimental Ss and 
each control S reads over and studies the 
original recall of his matched experimental S. 
Images are then obtained from the control 
group and the images are evaluated for the 
occurrence of items conceptually similar to 
those in the stimulus picture that were not 
mentioned in the deliberate recall of the ex- 
perimental S. A second method would be to use 
stimulus pictures composed of unrelated ob- 
jects. One could use a random method of se- 
lecting a list of objects and then randomly as- 
signing a given number of them to a stimulus 
card. This would serve to preclude spurious 
recall effects due to prior learned associations 
among the objects. 

Finally, the Shevrin and Luborsky study, 
as have most of the other studies on the Poetzl 
phenomenon, made use of Ss who had an un- 
specifiable amount of prior contact and asso- 
ciation with the Zs. This uncontrolled associa- 
tion makes it impossible to specify the exact 
nature of the experimental conditions and 
instructions to the S. One cannot know the 
degree to which Ss may be contaminated by 
the expectations of the Es and the degree of 
nonexplicit verbal and nonverbal communica- 
tion that constitutes part of the real instruc- 
tions given to the S. 

The present study is an attempt to assess 
the amount of preconscious recall when these 
sources of artifact have been eliminated. Using 
a method similar to that of Shevrin and 
Luborsky (1958) we have assured not only 
independence between the judges’ ratings of 
the unintentional recall material and their 
knowledge of the experimental conditions, but 
in addition have used a control group to assess 
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the base rate with which conceptually similar 
objects appear without the pictorial stimula- 
tion. A control group was also planned to 
assess the effects of prior learned associations 
among the objects in the pictures but this 
proved to be unnecessary. The unconscious re- 
call effect was eliminated completely when 
correction was made for base rate. 


METHOD 


Procedure 


Three groups of Ss, two experimental and one con- 
trol, were employed. The Ss were assigned to the 
three treatment groups alternately in terms of the 
order of their appearance for the first experimental 
session. All experimental Ss were seen individually at 
both sessions and in the first session approximately 5 
minutes was spent in conversing with S$ in order to 
establish rapport. The nature of a tachistoscope was 
explained to them and they were told that on the count 
of three a picture would appear briefly in the tachisto- 
scope. The stimulus was then exposed for .02 seconds 
at an illumination level of 1.80 foot-candles as deter- 
mined by a McBeth Illuminometer. For Ss in Experi- 
mental Group I the stimulus was an archeological 
scene (Stimulus A) and for Experimental Group II an 
early historical waterfront scene (Stimulus B). 

Following the exposure S was asked to draw and 
describe as fully as he could what he had seen. En- 
couragement was given to overcome hesitancy in 
drawing and to reassure him that drawing talent was 
not being assessed. Upon completion of the drawing 
and verbal description the drawing was removed and 
S was then asked to draw and describe again what he 
had seen. He was asked to start from the beginning 
and to draw and describe it as fully as possible. Follow- 
ing this, S was asked how frequently he had dreams. 
He was then asked if he would remember and write down 
upon awakening any dreams that he might have that 
night, and to bring these dreams with him to the 
session that was then scheduled for the next day. 

At the second session S was asked whether he had 
dreamed and, if so, he was instructed to draw and de- 
scribe the dream fully. Following the description of the 
dream S was asked to sit back, relax, and close his 
eyes, and to let his mind wander or daydream until 
some sort of picture came to mind. He was then asked 
to draw and describe as fully as he could the mental 
image. Ss who did not report dreams were asked to go 
through the image procedure twice. Following these 
reports of dream images, the Ss were asked to draw 
and describe as fully as possible the picture that had 
been shown to them the previous day. 

The control Ss participated only in this second ses- 
sion and were treated the same as the experimental 2 
with the exception that approximately 5 minutes cE 
this second session were devoted to the establishing © 
rapport, and, of course, the third recall attempt was 
not required. 


RE-EXAMINATION OF THE POETZL PHENOMENON 


Apparatus and Stimuli 


The stimulus pictures were presented in a Dodge 
mirror tachistoscope. The size of thé pictures was 
8” X 8” at a distance of 24” from the eye. The arche- 
logical scene (Stimulus A) used with Experimental 
Group I depicted a large pyramidal structure in a semi- 
tropical setting. The scene contained a great deal of 
vegetation suggesting a dense surrounding jungle. 
Human figures appeared as only a minor part of the 
picture. The second picture (Stimulus B) was a water- 
front scene depicting a Phoenician city in which a 
number of dark skinned persons are prominently en- 
gaged in construction work while buildings, water, 
and boats appear less prominently in the background. 
The experimental sessions were recorded on tape and 
the recall, dream, and image descriptions were trans- 
cribed for evaluation. 


Subjects 

Forty-five male undergraduates randomly drawn 
from the introductory psychology course at the Uni- 
versity of Illinois were evenly divided among the three 
treatment groups. None had previously participated in 
experiments of this nature and they had had no previ- 
ous contact with the £. 


RESULTS 


Only 10 of the 45 Ss brought dreams to the 
experimental session. In order to have a meas- 
ure that was comparable across all Ss it was 
decided to analyze only the first image for each 
S. This decision was abetted by the Shevrin 
and Luborsky (1958) finding that the precon- 
scious recall effects were more prominent in the 
Image material than in the dreams. 

For use in scoring and evaluating the images 
as well as the intentional recalls, a check list 
was constructed for each of the stimulus pic- 
tures enumerating the items contained in the 
Picture. Stimulus A was broken down into 21 
fairly discrete categories and Stimulus B was 
similarly divided into 29. The drawings and 
verbal descriptions for both the images and the 
intentional recalls were evaluated by four ad- 
vanced graduate students of clinical psy- 
chology, Each pair of these judges evaluated 
the material of two-thirds of the Ss. Prior to 
evaluating the protocols, the judges were given 
a group training session during which scoring 
criteria were discussed and several practice 
Images and drawings were evaluated. In 
evaluating the protocols, however, the judges 
worked independently. Their first task was to 
Tate all of the images on the check list for 
Stimulus A and then B. After the images had 
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been rated the judge evaluated the descrip- 
tions of the stimulus or the intentional recalls. 
These evaluations were made on the same 
check lists of items. 

In evaluating the images the judges were 
instructed to rate in terms of conceptual simi- 
larity as was done by Shevrin and Luborsky 
(1958), Thus, a spear that appeared in Stimu- 
lus B was credited as being present in the image 
if an S reported any type of weapon in the 
image. Particular attention was given to this 
criterion of conceptual similarity during the 
training session. Although the judges had 
available to them the two stimulus pictures, 
their evaluations were made independent of 
the knowledge of the treatment group to which 
an S had been assigned. 

The reliability of the judges’ evaluation of 
the images was computed by means of Pearson 
product-moment correlations between all pos- 
sible pairs of the four judges. The average cor- 
relation was .66. 

In order to facilitate comparison of our re- 
sults with those of Shevrin and Luborsky 
(1958) we have adopted their nomenclature for 
the different recall measures computed. In- 
tentional recall consists of all the separate items 
mentioned by an S in his three deliberate 
attempts to recall the stimulus picture. Related 
imagery items are all the items mentioned in an 
S’s image that are conceptually similar to items 
in the check lists for either Stimulus A or B. 
The related imagery score is further subdivided 
into two scores. First, there is the preconscious 
recall score which comprises the imagined items 
that are related conceptually to items in the 
stimulus picture and have not been reported in 
any of the three attempts at intentional recall. 
Second, the recalled again score represents those 
imagined items that also have appeared in one 
or more of the intentional recalls. Finally, un- 
intentional recall means all the items contained 
in the S’s image. 

Since the agreement between judges was less 
than perfect, two parallel analyses or sets of 
recall scores were computed. The first set is for 
the perfect agreement condition in which both 
judges of a pair agree upon the presence of the 
item. The second, the incomplete agreement 
condition, is based upon indication by one or 
the other judge that the item was pesent. 

Our first step in analyzing the data was to 


500 


determine whether or not the imagery obtained 
from the three groups was comparable irre- 
spective of its relation to stimulus pictures. A 
simple analysis of variance was run on the un- 
intentional recall scores for the Ss in the three 
groups. The between groups F of .35 (p > .10, 
df 2 and 42) did not appraoch statistical sig- 
nificance. The comparability of the three 
groups can be seen in Table 1 where the mean 
unintentional recall score by group is given. 
This table also shows the mean intentional re- 
call scores for the two experimental groups. 

The comparability of the three groups on 
unintentional recall is fortunate since it facili- 
tates the comparison of the three groups on the 
number of imagined items that are conceptua- 
ally similar to the items in the stimulus pic- 
tures. For this analysis we computed the re- 
lated imagery score for each S in each of the 
three groups for Stimulus A and also for 
Stimulus B. These scores were evaluated by a 
modified three-way classification analysis of 
variance (groups, stimuli, and individuals). A 
separate analysis was performed for the perfect 
agreement and the incomplete agreement 
scores. In keeping with the concepts of pre- 
conscious perception and preconscious recall 
it was anticipated that Ss in Group I would 
have a greater amount of related imagery to 
Stimulus A than either Group II or the control 
group. Similarly, Group II should have a sig- 
nificantly greater related imagery score for 
Stimulus B than either of the other two groups. 
This effect should be revealed in a significant 
group-by-stimulus interaction in the analyses 
of variance. 

These expectations, however, were not con- 
firmed in either analysis. For the items showing 
perfect agreement among judges the interac- 
tion F was .49 (p > .10, df 2 and 42) while for 
the incomplete agreement scoring the F was 
1.76 (p > .10, df 2 and 42). The only significant 
effect in either analysis was the between stimu- 


TABLE 1 


MEAN NUMBER OF Items IN UNINTENTIONAL AND 
INTENTIONAL RECALLS 


Group Unintentional Intentional 
Experimental I 11.9 Se Das 
Experimental II 10.7 9.6 5.9 
Control 1.9 


Note.—Values in italics represent perfect agreement condition. 
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TABLE 2 
MEAN RELATED IMAGERY IN UNINTENTIONAL RECALL 
Group Stimulus A Stimulus B 
Experimental I 10.4 8.2 
5.0 3.5 
Experimental II 8.9 8.4 
4.3 3.6 
Control 10.8 10.9 
FE 4.1 


Note.—Values in italics represent perfect agreement condition, 


lus picture effect for the perfect agreement 
data(F = 9.83, p < .01, df 2 and 42). As can 
be seen in Table 2 this significant effect arises 
from the fact that all three groups have a 
smaller number of items in their images related 
to the items contained in Stimulus B. ; 

The data in Table 2 in conjunction with the — 
above analyses provide no evidence that the 
nature of the tachistoscopically presented pic- 
ture, or even no picture at all, has any effect 
upon the Ss’ related imagery scores. Not only 
does the control group show as much related 
imagery to the two stimuli as do the cor 
responding experimental Ss, but Experimental 
Group I has about the same amount of related 
imagery for Stimulus B as does Group II, and 
Group II has as much related imagery for 
Stimulus A as does Group I. In addition, it is 
to be noted that the magnitudes of the related 
imagery scores for all three groups are com- 
parable to the scores obtained by Shevrin and 
Luborsky (1958) although the present scores 
tend to be slightly lower. A somewhat lower 
related imagery score is to be anticipated in our 
data since the number of items in our stimulus 
pictures was somewhat less than in the picture 
used by Shevrin and Luborsky (21 and 29 
items in Pictures A and B as compared with 34 
in the Shevrin and Luborsky picture). 

Before we conclude that Shevrin and Lubor- 
sky’s Poetzl phenomenon was solely attributa- 
ble to an unassessed base rate effect, there 18 
one possibility that needs to be considered. 
While the two experimental groups do not 
differ from each other or from the control group 
on related imagery, it may be that the kind 0 
items appearing in the images differ among the 
three groups. 

We assessed this possibility in two ways 
First, we computed the preconscious rec 
scores for the experimental Ss by subtracting 
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their recalled again scores (items in the images 
that were mentioned in one or more of the at- 
tempts at intentional recall) from’ the related 
imagery scores. Since the control Ss had not 
received a pictorial presentation, no intentional 
recall score existed for them. However, this 
problem was solved in the following manner. 
Each of the 15 control Ss was randomly paired 
with one of the 15 Ss in Experimental Group I 
and the control S was then assigned the inten- 
tional recall items of his corresponding experi- 
mental mate. The preconscious recall score for 
a control S was computed by subtracting from 
his related imagery all those items mentioned 
by his matched experimental S during inten- 
tional recalls. The control Ss were then ran- 
domly paired with the experimental Ss in 
Group II and this procedure repeated. To 
check upon the randomization of this pairing 
a second randomization was performed for both 
groups. Also, parallel computations were again 
made on data obtained from the perfect agree- 
ment and the incomplete agreement conditions. 
With these measures for the three groups of Ss 
we are now in a position to test the possibility 
that the items comprising the related imagery 
of the experimental Ss are different from those 
of the control Ss. 

If the kind of related imagery items differs 
among the three groups, then subtracting an 
experimental S’s intentional recall items from 
a control S’s related imagery items should re- 
sult in a difference of the recalled again scores 
between the experimental and control groups. 
The control Ss should have lower recalled again 
scores and correspondingly higher preconscious 
recall scores. 

The mean recalled again and preconscious 
recall scores obtained in this manner are shown 
in Table 3. As is seen there is little or no differ- 
ence in the recalled again scores between Group 
Tand the control Ss or between Group I and 
the control Ss. This lack of difference is con- 
firmed by / tests run between Group I and the 
two randomizations of the control group. For 
the perfect agreement data the values of ¢ were 
‘37 for the first randomization and .56 for 
the second randomization. For the incomplete 
agreement data the corresponding ¢ values were 
46 and .18. Similarly comparisons of Group II 
with the two randomizations of the control 
group gave 7 values of .43 and .19 for perfect 
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TABLE 3 


MEAN NUMBER OF ITEMS PRECONSCIOUSLY RECALLED 
AND RECALLED AGAIN 


Perfect Agreement pee 
Pı Pi 
recon- | Recalled | Erecon- | Recalled 
geo, | "Kenta | glou | Again 
Stimulus A 
Experimental I 3.3 1.6 | 5.8 4.5 
Control 
First random 3.3 1.9 5.9 4.9 
matching 
Control 
Second random 3.3 1.9 6.4 4.4 
matching 
Stimulus B 
Experimental IL 2.2 1.3 5.3 3.1 
Control 
First random 3.0 1A 6.9 3.9 
matchi: 
Control 
Second random 2.9 1.2 6.7 4.1 
matching 


Heer ee E ee SEER EEE 


agreement and .86 and 1.02 for incomplete 
agreement? 

The failure to obtain significant or even ap- 
preciable differences in this comparison renders 
unlikely the possibility that the kinds of items 
contained in the related imagery of the control 
group and experimental groups differ. Since 
the operation of subtracting an experimental 
S’s intentional recall items from a control S’s 
related imagery leaves both Ss with similar 
recalled again and preconscious recall scores, 
the control S’s related imagery must consist in 
large part of the same items existing in an ex- 
perimental S’s related imagery. In the experi- 
mental S’s recall, however, they are labeled as 
recalled again, but the analysis casts considera- 
ble doubt as to whether these are items recalled 
again or just another manifestation of the base 


rate. 


2 Since the preconscious recall score is obtained by 
subtracting recalled again items from related imagery 
there was no point in testing the significance of the 
means for this measure. Since there was no difference 
between the groups on related imagery or recalled again 
scores there could have been no difference in precon- 
scious recall. bee 
3 Although only four dreams were obtained in each 
of the experimental groups and two in the control 
group, identical analyses were carried out on the 
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One further test of the question of differences 
in kind of unintentional recall was performed. 
We computed the number of times each of the 
21 items in Stimulus A was mentioned in the 
imagery of Experimental Group I Ss and the 
control Ss. Eight of the items occurred more 
frequently in the images of the experimental 
group but on only three items was the fre- 
quency difference 2 or greater (differences of 2, 
3, and 7). A similar comparison of the control 
Ss with Group II on the 29 items of Stimulus 
B showed that 6 of the items occurred more 
frequently in the images of the experimental 
Ss, but for only 3 of these items was the fre- 
quency difference 2 or greater (differences of 
2, 2, and 4). 


DISCUSSION 


The foregoing results offer no support to the 
view that unconscious or preconscious percep- 
tion occurs during tachistoscopic presentation 
and is subsequently reflected in such indirect 
measures as daydreams or images. We were 
unsuccessful in finding any differential influ- 
ence upon the images of our Ss as a function of 
differential prior stimulation. Not only do the 
control Ss, who have seen neither stimulus pic- 
ture, have related imagery scores closely simi- 
lar to the experimental groups’, but the amount 
of preconscious recall for these control Ss is also 
directly comparable (see Table 3). These find- 
ings hold true not only when we use the con- 
servative criterion of perfect agreement be- 
tween independent judges as to the presence or 
absence of a stimulus item in the $’s imagery, 
but also when we use the more liberal criterion 
of one or the other judge’s considering the item 
present in the image. 

It is possible that our failure to confirm 
previous interpretations of the Poetzl phe- 
nomenon may be due to subtle differences in 
the population of Ss used and unrecognizable 
differences in techniques and methodology. 
The fact that the magnitude of the related 
imagery and the preconscious recall obtained 

in our study is directly comparable to that of 
Shevrin and Luborsky (1958) would argue 
against such an interpretation. Since both 
studies used what would appear to be compara- 


dreams. The results were comparable to those reported 
above and there was similarly no indication of even a 
trend in the direction of preconscious recall. 


HAROLD JoHNSON AND CHARLES W. ERIKSEN 


ble stimulus material, the finding that any 
lated imagery or preconscious recall in ou 
study was accounted for solely in terms of th 
base rate would necessarily imply that 
Shevrin and Luborsky results were also a fun 
tion of the unassessed base rate. 

The demonstration of the high base rate op- 
erating in the use of a classification system like 
“conceptual similarity,” suggests that extreme 
care should be taken in the evaluation of- 
dreams, images, and other associational da 
Such a high rate of coincidental appearance: 
objects in fantasy may lead one to jump 
tenuous conclusions of causality in the inter 
pretation of dreams, projective tests, etc. when 
in fact, the relationships perceived are explain: 
able in terms of base rate phenomena due ti 
the remarkable ability of the human mind i 
see conceptual relations among almost any o 
jects. 

We had originally anticipated that the 
rate would not account for all of the relate 
imagery and preconscious recall effects and, 
was pointed out in the introduction of th 
paper, we had planned to institute a control 
for the effect of prior learned associat 
among objects. Since there was no effect re- 
maining after base rate had been elimina’ 
the application of this control would have bet 
rather pointless. Nevertheless, it is a contr 
that needs to be considered in evaluating oth 
studies on the Poetzl phenomenon and 
tainly needs to be taken into account in an 

future investigations inethis area. A 

It has been pointed out elsewhere (Erikse 
1960) that the evidence for unconscious pêl 
ception during a tachistoscopic presentation i 
at best highly controversial. Experimen' 
should therefore be particularly careful in pos 
iting unconscious processes as explanations fo 
their data when simpler explanations have no 
been ruled out. 

Even when artifacts such as those discus 
here have been excluded there are simpler ê 
planations available for an obtained Poet 
phenomenon that do not require usage of an 
biguous concepts such as preconscious percep- 
tion and primary process thinking. A Poetal- i 
like effect could be produced as a function o! 
an S’s operating characteristic (Goldiamond, 
1958). If, as seems reasonable, Ss tend to adopt 
a more rigorous criterion for report under the 
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intentional recall condition with its veridical 
orientation than under the daydream condi- 
tion, items of which S was too unsure to men- 
tion in deliberate recall might appear in the 
less reality oriented daydreams. 


SUMMARY ° 


The Poetzl phenomenon was experimentally 
analyzed using a method similar to that em- 
ployed by Shevrin and Luborsky (1958). The 
Ss in two experimental groups were shown 
tachistoscopically one of two experimental pic- 
tures and were asked to describe it. Daydreams 
obtained from these Ss on the following day 
were analyzed to determine the presence of 
items conceptually similar to those in the re- 
spective pictures. This experiment differed 
from Shevrin and Luborsky’s in using a control 
group who received no prior picture stimula- 
tion but whose images were examined to deter- 
mine the base rate for conceptually similar 
items. Also, stricter experimental control was 
maintained over the instructions to the Ss. 

Analysis of the results provided no support 
for an interpretation of the Poetzl phenomenon 
in terms of preconscious perception. Although 
the number of conceptually similar items ap- 
pearing in the images that had not been re- 
ported in intentional recall was comparable in 
magnitude to the number reported by Shevrin 
and Luborsky, it was found that this number 
did not differ significantly nor appreciably from 
the base rate obtained from the control group. 

2 
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Several sources of artifact that occur in ex- 
perimental designs such as are employed in 
studying the Poetzl effect were discussed. 
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STABILITY OF ACHIEVEMENT “AND RECOGNITION SEEKING © 
BEHAVIORS FROM EARLY CHILDHOOD i 
THROUGH ADULTHOOD! 


HOWARD A. MOSS anp JEROME KAGAN 
Fels Research Institute, Yellow Springs, Ohio 


HE supposition that selected adult re- 

sponse patterns are established at an 

early age is a primary assumption of de- 
velopmental theory. Although literary docu- 
ments and psychotherapy protocols have pro- 
vided anecdotal support for this hypothesis, 
more objective validation has been difficult to 
obtain. The present paper is a second report 
that has emerged from a larger project on the 
stability of childhood behavior. The first paper 
indicated that dependent behavior in girls 
showed moderately high stability from the 
early school years through young adulthood 
(Kagan & Moss, in press). The present report 
is concerned with the developmental consist- 
ency of two related behaviors: the tendency to 
strive for (a) mastery of selected skills (achieve- 
ment behavior), and (b) social recognition 
through acquisition of specific goals or be- 
haviors (recognition behavior). 

The achievement variable emphasizes mas- 
tery of intellectual, athletic, mechanical, and 
artistic skills as well as competence in special- 
ized crafts. Social recognition is obtained 
through acquisition of most of the above be- 
haviors. For intellectual competence, athletic 
ability, acquisition of money, and positions of 
power in social groups are the primary methods 
of obtaining social recognition in the cultural 
milieu of our middle class population. Thus, 
the overt behaviors involved in achievement 
and recognition strivings overlap to some de- 
gree. 

In an attempt to differentiate between these 
two variables, the investigators evaluated the 
degree to which the individual’s mastery be- 
havior was directed at satisfaction of an in- 
ternal standard of excellence in order to gain 
self-approval (achievement motivation), in 
contrast to seeking approval from the social 
environment (recognition motivation). This is 
a difficult differentiation to make. The data to 

1 This research was supported, in part, by Research 
Grant M-1260 from the National Institute of Mental 
Health, United States Public Health Service. 
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be presented reveal a high, positive correla 
between ratings of these two behavioral 

bles. This interdependence suggests* 
be impossible to measure the “d. 
prove at a skill” independent of the 
ual’s “desire for social recognition” fort 
provement. 


METHOD 


Subjects and General Procedure 


The subjects were 36 males and 35 females 
the Fels Research Institute’s longitudinal 
They were enrolled in the project at birth, durin 
years 1929-1939. At the time of a recent adult a 
ment (1957-1959) they were between 20 and 
of age. The subjects came from predominantly 
class backgrounds, over half of the group 
ried; 70% had college degrees or were enrol 
college, and the majority were living within a 
radius of the institute. The adult group incl 
Protestants, 15 Catholics, and 1 Jew. 

The heart of this study consists of correlatio 
tween the childhood information on these subje 
their adult behavior. The childhood data in 
longitudinal observations of the child’s behavior d 
the first 14 years of life in a variety of settings, (0) 
servations of the mother-child interaction di 
years, (c) TAT protocols obtained in adoles 
(d) annual Stanford-Binet intelligence test sco 
ing the ages 5-11. Although the data collected 
adulthood (age range 20-29) sampled a v 
techniques, this report utilizes only two soi 
adult information, 5 hours of interview, and a 
protocol. 


Longitudinal Observations: Birth to Age 


As a standard procedure of the Fels lo 
program, psychologists or psychologically tra 
sonnel summarized their observations of the ch 
the home, in the Fels nursery school and day 
and in the subject’s public school. The home 
were based on a visit to the home where mo 
child were observed for half-day sessions, These 
visits were generally made semiannually for 
years of life and annually from 6 to 12. Most 
mothers were interviewed each year for the 
years of the child’s life. The nursery school s 
were based on semiannual, free-play sessions 
2.5 to 5. The sessions usually consisted of 15 
tive half-day periods in groups of 10-12 chil 
camp typically consisted of an annual 2 week 
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of half-day periods during age 6-10 in which free and’ 
structured group activities were observed. Public 
school visits, made semiannually, consisted of a half- 
day observation of the child in his routine classroom 
activities. Finally, the subjects of age 6-14 were inter- 
viewed each year at the institute and a summary of 
the interview was prepared. All of the longitudinal 
reports for each subject were collated in ghronological 
order and placed in the subject’s individual file.” 

Scoring of longitudinal variables. A comprehensive 
list of rating scale variables (seven-point scale) was 
defined for the purpose of evaluating the narrative 
material füst outlined. The material for each subject 
was divided into four age periods: 0-3, 3-6, 6-10, and 
10-14. The senior author, who had no knowledge of 
the adult psychological status of the subjects, first 
read all the information for each subject for age 0-3 
and made those ratings for which he had adequate in- 
formation. Following a period of interpolated work, he 
studied each subject’s material for age 3-6 and again 
made his ratings. This procedure was repeated for 
ages 6-10 and 10-14. A period of approximately 6 
months intervened between the evaluation of the data 
for any one subject for each age period. This paper 
deals only with the stability of achievement and recog- 
nition behaviors and abridged definitions of these 
variables follow. 


Childhood Variables 


Achievement behavior: (Rated for ages 0-3, 3-6, and 
6-10). This variable assessed the degree to which the 
subject tended to persist with challenging tasks, games, 
and problems, and his involvement in activities in 
which a standard of excellence was applicable. For 0- 
3, emphasis was given to persistence with perceptual- 
motor activities, (e.g, making block towers, stringing 
beads, drawing, and coloring). For ages 3-6 and 6-10 
the greatest weight was given to interest in and persist- 
ence with intellectual, mechanical, athletic, and fine 
motor activities, 

For age 10-14 the general achievement variable de- 
fined above was differentiated into three variables 
dealing with different achievement ardas (intellectual, 
mechanical, and athletic). 

Intellectual achievement: (Rated for age 10-14). This 
variable assessed the degree to which the subject at- 
tempted to master language and numerical skills and 
showed involvement in the acquisition of knowledge. 

Mechanical achievement: (Rated for age 10-14). This 
variable assessed the degree to which the subject at- 
tempted to master mechanical skills and manifested 
involyement in activities such as carpentry, construc- 
Se model vehicles, engines and motors, and craft 
work. 

Athletic achievement: (Rated for age 10-14), This vari- 


* The staff of the institute during the 24 years of 
data collection included Alfred L. Baldwin, Thomas 
W. Richards, Horace Champney, Virginia L. Nelson, 
assisted by Leah Levinger, Helen Marshall, Mary 
Frances Hartson, Joan Kalhorn Lasko, Faye Breese, 
Margaret Slutz, Marjorie Powell, Frances Best, and a 
group of assistants to whom the authors are indebted. 
Their efforts made this project possible. 
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able assessed the degree to which the subject attempted 
to master and showed involvement in athletic activi- 
ties. These behaviors included swimming, hiking, base- 
ball, football, basketball, tennis, acrobatics, and track 
events. 

Recognition seeking behavior: (Rated for ages 6-10 and 
10-14). This variable assessed the subject’s striving to 
obtain goals that led to recognition from parents, 
teachers, and peers. The behaviors emphasized in the 
rating were (a) grades in school and school honors, (b) 
stated desire for status-laden vocations or ostentatious 
material goods, (c) striving for leadership in teams or 
clubs, (d) attempts to get recognition from farm ac- 
tivities (e.g., raise the best calf, the highest corn, etc.). 


Maternal Variables 

Maternal acceleration of developmental skills in child: 
(Rated for ages 0-3, 3-6, and 6-10). The home visits 
and maternal interviews yielded information on the 
mother’s behavior and attitudes toward her child. The 
maternal variable that is directly relevant to the sub- 
ject’s achievement behavior was called maternal ac- 
celeration. It was defined in terms of the degree to which 
the mother showed concern over the subject’s cognitive 
and motor development, and the degree to which she 
exhibited desires for precocious achievement in her 
child. The rating reflected the degree to which the 
mother “pushed” the subject’s development beyond 
his abilities and her concern with his general achieve- 
ment level. 


Adult Interview 

The junior author, who had no knowledge of the 
subjects childhood information, interviewed each sub- 
ject and rated him (seven-point scale) on a variety of 
variables. The definitions of the variables related to 
achievement and recognition seeking behaviors follow. 

Achievement behavior. This variable evaluated the 
subject’s behavioral attempts to master tasks for 
which “self-satisfaction” rather than social recogni- 
tion was the salient goal. In achievement behavior, 
the subject was striving to attain a self-imposed stand- 
ard of excellence. The rating was based on the sub- 
ject’s emphasis and concern with task mastery in his 
job and avocational pursuits, 

Recognition seeking behavior. This variable evaluated 
the subject’s behavioral attempt to obtain symbols of 
status and social recognition. The rating was based on 
evidences of strivings for (a) vocational recognition, (b) 
academic awards and honors, (c) positions of leadership 
or recognition in community or vocational groups, @ 
concern with conspicuous material display, (e) striving 
for upward mobility in social class position. 

Concern with intellectual competence. This variable 
assessed the value the subject placed upon intelligence, 
knowledge, academic achievement, and intellectual 
superiority regardless of whether the goal was to satisfy 
inner standards or to obtain social recognition. 


Reliability of Longitudinal and Adult Interview 


Ratings . 
A random sample of 32 tape recorded adult inter- 
views were independently studied and rated by a second 
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judge to assess the reliability of the juniot author’s 
adult ratings. The reliabilities of the longitudinal vari- 
ables were also assessed through independent ratings, 
by a second judge, of samples of 50-60 cases at each of 
the four age periods, The reliabilities of the adult and 
child ratings were determined by product-moment cor- 
relation coefficients.’ For the adult ratings of achieve- 
ment behavior, recognition behavior, and intellectual 
concern the reliability coefficients were .84, .99, and 
.98, respectively. With the exception of one child be- 
havior variable, the reliabilities of the longitudinal 
ratings ranged from .74 to .90 with a median coefficient 
of 81. The one low longitudinal reliability was for 
child’s achievement for age 0-3 (z = +.35; p < .01; 
two-tailed). 


TAT Achievement Fantasy: Adolescent and 
Adult Protocols 


Early adolescent (median age of 14-6) protocols were 
available for 67 of the 71 subjects, and all 71 subjects 
were administered TAT stimuli following the adult 
interview. The adolescent protocol was based on seven 
cards from the Murray (1943) series (Cards 1, 5, 14, 
17BM, 3BM, 6BM, and 3GF). The male adult protocol 
was based on 13 cards (4, 8BM, 7BM, 6BM, 12M, 
17BM, 13MF, 14, 3BM, 5, 1, 3GF, and 18GF). The 
adult females were also administered 13 cards (4, 6GF, 
12F, 2, 8GF, 17BM, 13MF, 14, 3BM, 5, 1, 3GF, and 
18GF). For both the adolescent and adult protocols 
achievement themes were scored according to the 
scheme described by McClelland, Atkinson, Clark, and 
Lowell (1953). Since incidence of the subcategories of 
the McClelland scoring system were infrequent, only 
stories in which achievement behavior was the major 
aspect of the plot were considered. These are scored 
Ach Th in the McClelland scheme. For the adolescent 
protocol, there was a lack of comparability among the 
examiners with respect to the inquiry questions and 
only the spontaneous verbalization of the subject was 
scored. Agreement between two independent coders 
was 95%. The longitudinal and interview ratings of 
achievement and recognition behavior were made with- 
out knowledge of the subject’s adolescent or adult TAT 
stories. Thus, the behavior and interview ratings were 
independent of each other and of the TAT thematic 
scores, 


IQ Change 


Each child was given the Stanford-Binet, Forms L 
and M alternately, annually from ages 5 through 11 by 
the same psychologist.‘ The mean IQ for the entire Fels 
population is about 120 (SD of 15). For each subject, 
a smoothed plot of his IQ scores was obtained by aver- 
aging his three IQ scores around each age. For example, 
a child’s smoothed or average IQ at age 6 was the re- 
sult of averaging his IQ scores at ages 5, 6, and 7; his 
smoothed IQ at age 10 was the average of his IQs at 
ages 9, 10, and 11. This procedure tends to remove the 


3 All correlations were corrected for restricted range 
of scores using a procedure described by Peters and 
Van Voorhis (1940, pp. 395-398). 

4 Virginia L. Nelson administered all of the IQ tests. 
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‘chance variation associated with any one IQ score 
has been used in other studies (Kagan, Sontag, B 
& Nelson, 1958; Sontag, Baker, & Nelson, 1958), 
subject’s smoothed IQ at age 6 was then subtract 
from his smoothed IQ at age 10 and the resulting 
ference was used as a measure of IQ change. As” 
achievement themes, the child and adult achievem 
ratings were made without knowledge of the subje 
IQ or his IQ change score. 

In summary, four independent sources of data 

analyzed: child and maternal behaviors for the first 

years of life, adult behavior, adolescent and 
achievement themes, and childhood IQ change sco 


Statistical Analysis 


Relationships among the following variables 
evaluated: (a) childhood achievement and mat 
acceleration ratings with the adult interview rat 
(b) adolescent achievement themes with adult a 
ment themes, (c) adolescent and adult achieve: 
themes with the longitudinal and adult ratings, 
(@ IQ change scores with the childhood and 
ratings. Product-moment correlations were used ey 
when the TAT achievement score was involved. 
achievement themes were not normally distril 
contingency coefficients® were used for all tests of 
ciation using this variable. Mechanical achievemen 
age 10-14 was the only variable for which there 
significant sex difference; the boys having a higher 
rating than the girls (p < .05; two-tailed). 


RESULTS 


Stability of Achievement and Recognition 
haviors 

Table 1 presents the relationships bet 
the child and adult ratings of achievement 
recognition behavior, as well as the relai 
between maternal acceleration and the a 
achievement variables. There are several 
portant results in this table. The rating‘ 
achievement behavior for age 6-10 showed 
significant, positive association with all ri 
adult variables for both sexes. The rating 0 
achievement for age 3-6 was predictive | 
adult behavior for the females but not for 
males, a finding that suggests the earlier en 
gence of stable achievement strivings in g 
development than in boys. Of the 
achievement behaviors rated for age 10- 
only intellectual mastery was predictive 
adult achievement for both sexes. Involveme 
in mechanical activities was predictive of 2 
achievement for boys but not for girls. Athl 
achievement showed no relationship to # 

The contingency coefficients were based oo 


squares computed from Mood’s likelihood ratio t 
for a 3 X 2 distribution (Mood, 1950, p. 257). 
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TABLE 1 
RELATION BETWEEN LONGITUDINAL RATINGS OF CHILDHOOD ACHIEVEMENT AND EARLY MATERNAL 
ACCELERAFION WITH ADULT ACHIEVEMENT BEHAVIOR 
(product-moment correlations) 
Adult Variables 
Childhood Variables Age Achievement Recognition Intellectual Concerns 
Males Females Males Females Males Females 
Recognition 6-10 E iier .40** -42** -48*** .37** 559400 
10-14 25 +20 .36* 139%" 24 .40** 
Achievement 0-3 —.12 —.02 01 —.22 —.08 —.02 
3-6 —.03 F Sied —.11 Koea 13 -44** 
6-10 .46*** .38** 57ee* „514+ „69%++4 ,49ee* 
Achievement 
Intellectual 10-14 -40** .42** Ota ook: Mee uoan ADtet 
Mechanical 10-14 -20 -20 -46** 02 Aree 27 
Athletic 10-14 —.18 01 —.17 —.09 —.47"* 02 
Maternal Ac- 0-3 22 36* .44* .41* 09 -36* 
celeration 3-6 31 09 +24 12 42%* 12 
6-10 14 33* -16 +23 32" 43%" 
* p <05; one-tailed. 
** p < 02; one-tailed. 
$ < 01; one-tailed. 
“"** p < 001; one-tailed. 


rating of general adult achievement, and was 
ji negatively associated with intellectual concern 
for adult males (p < .02). 
Recognition seeking behavior for age 6-10 
was also predictive of adult achievement be- 
havior. A few of the child variables were mod- 
erately intercorrelated and the three adult 

variables were highly intercorrelated (Tables 
4, 5, and 6). This lack of independence makes 
Some of the stability correlations between 
childhood and adulthood somewhat redundant. 


Maternal Acceleration and Adult Behavior 


f Maternal concern with the child’s develop- 
_ Mental progress during the first 10 years of life 
showed low to moderate correlations with adult 
_ achievement behavior. The maternal rating for 
age 6-10 was not a better predictor of adult 
behavior than the maternal rating for the first 
3 years of life. Moreover, the age 0-3 rating 
Was associated with all three adult, achieve- 
= Ment variables for girls, while it predicted only 
_ Tecognition behavior for adult males. 
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Stability of TAT Achievement Fantasy 


Although different sets of TAT pictures were 

a Used in obtaining the adolescent and adult pro- 
tocols, the three pictures that usually elicited 
achievement stories were presented at both ad- 
munistrations. Cards 1, 14, and 17BM, which 


elicited 77% of all the achievement themes, 
were common to both protocols. The strong 
tendency for these particular cards to elicit 
achievement themes has been noted in another 
study (Kagan & Moss, 1959), A typical 
achievement theme to Card 1 concerned a boy 
who wanted to master the violin and/or be- 
come a famous violinist. A typical achievement 
story to Card 17BM involved a person who was 
in a rope climbing contest and wanted to do his 
best to win. A common achievement story to 
Card 14 concerned an artist or student who had 
been working hard and was looking forward to 
fame and success as a result of his accomplish- 
ments. 

The stability of the TAT achievement score 
between the adolescent and adult protocols was 
determined through the use of contingency co- 
efficients. The stability coefficients were .34, 
.36, and .31 for boys, girls, and total group 
(p < .10, < .05, < .02; one-tailed). Thus, 
achievement themes also showed some degree 
of stability over this 10-year period. These data 
extend the findings of an earlier investigation 
(Kagan & Moss, 1959), in which the authors 
reported a 3 year stability coefficient of .32 
(p < .01) for achievement themes obtained at 
median ages of 8-9 and 11-6. The stability 
coefficients between the adolescent and adult 
protocols are of the same magnitude as those 
found for the earlier age period. 
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TABLE 2 
RELATION BETWEEN TAT ACHIEVEMENT THEMES AND CHILD AND ADULT ACHIEVEMENT BEHAVIOR 
(contingency coefficients) 
Adolescent TAT Adult TAT 
Longitudinal Variable Age Ma ae] SAARE 
Males Females Males Females 
Recognition 6-10 .21 17 .26 22 
10-14 —.20 —.18 —.31 39 
Achievement 0-3 —.15 =—.25 —.20 16 
3-6 -42* 19 19 36 
6-10 24 AS 13 30 
Achievement 
Intellectual 10-14 -30 = 25 -26 -16 
Mechanical 10-14 31 —.62*** 63544% —. 5054 
Athletic 10-14 —.20 12 .12 17 
Maternal acceleration 0-3 „11 bed 25 —.08 
3-6 37 23 27 +28 
6-10 Lane 26 +24 .41* 
Adult interview variables 
(median age 25) 
Recognition .17 +25 -40** 5258s 
Achievement 19 4g .37* 525888 
Intellectual Concerns 440 25 31 Sorna 
* p < 05; one-tailed. 
** p < 02; one-tailed. 
*** p < 0l; one-tailed. 


#8" p < .001; one-tailed. 


Validity of Achievement Themes: Relations with 
Child and Adult Behavior 


Contingency coefficients were computed re- 
lating the occurrence of adolescent and adult 
achievement themes with the longitudinal and 
adult achievement ratings. These results are 
presented in Table 2. The highest and most 
consistent relations were between the adult 
achievement themes and adult interview rat- 
ings. The only significant relation between 
adult themes and the childhood ratings held for 
mechanical achievement (C = +.63; p < .001 
for boys, and —.50; p < .02 for girls). 

The adolescent TAT was also more predic- 
tive of adult behavior than it was of the child- 
hood ratings. Adolescent achievement themes 
predicted adult achievement behavior for 
women (C = +.44; p < .01) and intellectual 
concerns for men (C = +.44; p < .01). 
Adolescent achievement themes showed mini- 
mal association with the child’s achievement 
behavior. The only significant positive associa- 
tion was with age 3-6 achievement for boys. 
Once again the rating of mechanical achieve- 
ment for girls was negatively associated with 
achievement themes. This negative correlation 
may be due to the fact that this is the only 
variable for which markedly different behav- 


ioral referents were used in rating the two sexes. 
For boys, involvement in carpentry, engines, — 
motors, and model airplanes was emphasized 
in the rating. These activities are sex-typed 
and girls showed no interest in them. Participa- 
tion in craft work (making jewelry, leather 
articles) and sewing was also used as evidence 
of involvement in mechanical activities and 
girls tended to choose these behaviors. 

Maternal acceleration during the first 10 
years of life showed suggestive relationships 
with the adolescent achievement themes. For 
example, maternal acceleration for age 0-3 pre- 
dicted achievement themes at adolescence for 
girls (C = +.51; p < .02), but not for boys. 
Maternal acceleration for age 6-10 predicted 
adolescent achievement themes for boys (C = 
+.51; p < .01) and adult achievement themes 
for girls (C = +.41; p < .05). 

In summary, the adult and adolescent TAT 
stories showed moderate correlations Ww! 
adult achievement but minimal association 
with the childhood achievement: ratings. Ma- 
ternal acceleration was associated, to some de- 
gree, with adolescent achievement themes. 


IQ Increase and Achievement Behavior 


The difference between the child’s smoothed 
IQ at age 6 and 10 was used as a measure of 
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TABLE 3 


RELATION BETWEEN IQ CHANGE AND CHILDHOOD AND 
ADULT ACHIEVEMENT V: LES 


(product-moment correlations) 


Longitudinal Variables Age Males Females 
Recognition 6-10] .24 «| .21 
Recognition 10-14) .41** 09 
Achievement 0-3 13 04 
Achievement 3-6 | —.02 +24 
Achievement 6-10; .39*** K Titir 
Achievement-intellec- | 10-14) .37* .41** 

tual 
Achievement-mechan- | 10-14) „15 14 

ical 
Achievement-athletic | 10-14) —.16 PAGNE. 
Maternal acceleration | 0-3 | —.06 -20 
Maternal acceleration | 3-6 | —.03 —.12 
Maternal acceleration | 6-10) 10 “sane 
Adult interview vari- 

ables 

Recognition 48%** 25 

Achievement ‘38920 .38** 

Intellectual concern .49**+ K” bdel 


* p < 05; one-tailed. 
** p < .02; one-tailed. 
*** p < 01; one-tailed. 


change. Earlier studies (Kagan et al., 1958; 
Sontag et al., 1958) have demonstrated that 
the amount of increase in IQ correlated both 
with independent behavioral indices of achieve- 
ment strivings for age 6-10, and with early 
adolescent (age 10-14) achievement stories. 
These latter relations remain significant when 
the influence of the child’s IQ at age 6 is 
statistically controlled. The present data al- 
lowed for a partial validation of these results 
and an extension of the Sontag et al. findings 
for adolescence and adulthood.® 

Table 3 presents the correlations between 
changes in IQ during age 6-10 and the longi- 
tudinal and adult behaviors. The amount of 
IQ increase was a fairly sensitive predictor of 
both intellectual achievement for age 10-14 
(r = .37 and .41 for boys and girls; p < 01), 
and concern with intellectual competence in 
adulthood (r.= .49 and 42; p < .01). These 
Tesults support and extend the earlier studies 
and indicate that amount of IQ increase during 
the first 4 years of school is a moderately ac- 


€ In the present sample of 71 subjects, 50% of the 
males and 20% of the females overlapped with the 
group of 70 subjects studied by Sontag, Baker, and 
Nelson (1958). However, their data only dealt with the 
Period from 3 to 10 years of age. The present behav- 
loral material covered adolescence and early adulthood. 
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TABLE 4 
INTERCORRELATIONS AMONG VARIABLES RATED FOR 
AcE 6-10 
ition Achieve- te 
Recognition Acheve- Matera) 
Recognition = A TISSA fala 
Achievement 60*** — Es iaa 
Maternal accelera-  „39* 44r ~ 
tion 


Note,—Data for males are in upper right; for females, in lower 
left, 
* p < .05; two-tailed. 
*** p < .01; two-tailed. 


curate index of the subject’s motivation to 
master intellectual tasks during adolescence 
and early adulthood. It is important to note 
that IQ change showed no relation to mechani- 
cal or athletic strivings for boys, and was nega- 
tively associated with athletic achievement for 
girls (r = —.46; p < .02). Thus, IQ increase is 
not a general measure of achievement strivings 
for all areas of task mastery. The IQ change 
measure predicts all three adult achievement 
ratings because the three adult variables are 
heavily weighted with concern over intellectual 
competence. Finally, the maternal acceleration 
rating for age 6-10 showed a positive relation 
with IQ change for girls (r = +.54; p < .01) 
but not for boys. 


Intercorrelations among the Measures 


There were, as might be anticipated, positive 
correlations among the achievement and recog- 
nition ratings. Tables 4, 5, and 6 present the 
intercorrelations among the variables that were 
rated for ages 6-10, 10-14, and adulthood. The 
correlations for the males are above and to the 
right of the diagonal; the female data are to 
the left and below the diagonal. 

For all three age periods there were high, 
positive correlations among the achievement 
and recognition variables. For age 10-14, recog- 
nition behavior was highly correlated with 
achievement strivings in the intellectual area, 
but only minimally related to mechanical or 
athletic achievement. This finding suggests 
that, for this middle class sample, mastery of 
intellectual skills is the primary method chosen 
to obtain social recognition. Perhaps for lower 
class samples this generalization might be less 
valid. The high correlations between recogni- 
tion behavior and intellectual concern in adult- 
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TABLE 5 
INTERCORRELATIONS AMONG VARIABLES RATED FOR 
AcE 10-14 
Intellec- | Mechan- 
Recog- | “tual | teat | Atbetic 
nition | Achieve- | Achieve- ment 
ment ment 
Recognition = statt tii 123 -04 
Intellectual 60*** = .53%98] = 1 
achievement 
Mechanical au -32 = —.23 
achievement 
Athletic +24 12 07 - 
achievement 


Note.—Data for males are in upper right; for females, in lower 
left. 
*** p < .01; two-tailed. 


TABLE 6 
INTERCORRELATIONS AMONG ADULT INTERVIEW 
VARIABLES 

Achieve-  Recog- Intellec- 

ment nition tual 
Achievement — bee Pri riin 
Recognition Pike deg — ieee 
Intellectual TIOE, 844+ — 


Note.—Data for males are in upper right; for females, in lower 
left. 
*** p < .01; two-tailed. 


hood, together with the fact that maternal 
acceleration predicted both variables, suggests 
that it is difficult to separate “recognition seek- 
ing behavior” from “attempts to improve in- 
tellectual competence.” 


Discussion 


Stability of Achievement Strivings 


The results indicate that strivings for in- 
tellectual mastery are moderately stable from 
the school years through early adulthood. This 
behavioral disposition emerges as a stable phe- 
nomenon at ages 3-6 for girls and 6-10 for boys. 
The stability of the behavior ratings is paral- 
leled by the moderate stability of TAT achieve- 
ment stories over a shorter age span. Moreover, 
achievement stories in adolescence and adult- 
hood also predicted the adult behavior ratings. 
This consistent cluster of correlations adds con- 
struct validity to the TAT achievement varia- 
ble and support to the conclusion that this class 
of behaviors is stable over time. 

Involvement in athletics for age 10-14 
showed no strong, positive relation to either IQ 
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increase or adult achievement behavior and, 
in a few instances, negative relationships oc- 
curred. This vas not because the interviewer 
failed to assess adult involvement in this par- 
ticular activity. Rather, many of the adults 
who had been involved in athletics as early 
adolescents were not overly concerned with 
task mastery as adults and they tended to 
avoid intellectual activities. 

The majority of the sample regarded posi- 
tions of responsibility, intellectual challenge, 
and knowledge of the environment as highly 
desirable goals. If a subject had strong achieve- 
ment motives he tended to gratify them 
through intellectually oriented endeavors. It 
is suggested that the mass media and social 
environment differentially emphasize the im- 
portance of different skills in accordance with 
the sex and age role characteristics of the in- 
dividual. For adults, there tends to be an em- 
phasis on intellectual competence and a de-em- 
phasis on active mastery of athletic skills. 
Moreover, intellectual mastery is less involved 
in potential sex role conflict than mechanical or 
athletic behaviors. To excel at sports is one of 
the defining characteristics of masculinity. 
Some boys become involved in athletics in or- 
der to maintain their sex role identity and 
avoid peer rejection. An athletic girl will be 
subject to peer rejection for excessive partici- 
pation in athletics. Thus, athletic mastery 1s 
under the control of motives and conflicts re- 
lated to sex role identification in addition to 
needs for task mastery. 

This latter point raises the question of the 
appropriate definition of achievement behavior 
and motivation. It is suggested that the con- 
cept of a general achievement motive is too 
broad a term, and it may be useful to replace 
this construct with a series of variables that 
relate to more specific behaviors. It seems more 
reasonable to talk about “desire to improve m- 
tellectual skills,” or “desire to improve athletic 
skills” than to use the more global concept of 
need achievement. Individuals strive to perfect 
skills in different areas, and the motivations 
for these strivings are multiple. Prediction an 
comprehension of these phenomena might be 
facilitated if there was some differentiation 
among the behaviors and motives that are m- 
volved in task mastery. 

The lack of predictive power of age 0-3 mas- 


? 


tery behavior might have been due to the 
greater difficulty in rating this variable (the 
interrater reliability was .35). On the other 
_ hand, the behavioral referents for this rating 
‘differed from those used to assess mastery for 
the older age periods. Since 2-year-olds do not 
“initiate intellectual or athletic nfastery be- 
_ havior, persistence with simple, perceptual- 
" motor tasks (stringing beads, building towers) 
was the basis for this early rating. A high rat- 
‘ing for 0-3 reflected a high threshold for satia- 
tion with simple, sensorimotor activities. At the 
‘older ages, the achievement rating was based 
on involvement with problem solving behaviors 
that were more similar in form to adult achieve- 
ment behavior. The age 0-3 rating is dynami- 
cally different from the symbolic behaviors that 
characterize achievement during the preschool 
and school years.’ This statement is supported 
by the fact that achievement for age 0-3 was 
_ negatively correlated with achievement for age 
3-6 (r = —.20) and age 6-10 (r = —.03), and 
Showed no relationship to achievement themes 
or IQ change. Persistence with simple sensori- 
Motor tasks during the first 2 or 3 years of life 
‘is not an index of future intellectual, achieve- 
ment strivings. The 2-year-old who will sit for 
20 minutes trying to put a peg in a hole is not 
_ hecessarily the ambitious scholar of the fifth 
= grade. 
Maternal Acceleration and Achievement 


The ratings of maternal concern with the 
child’s developmental skills were heavily 
Weighted with encouragement of intellectual 
Progress. The most consistent correlates of ma- 
ternal acceleration were found with the ratings 
_ of adult concern with intellectual competence. 
Maternal acceleration for age 0-3 was slightly 
_ More predictive of adult behavior for girls than 
for boys. Similarly, maternal acceleration for 
age 6-10 was more predictive of IQ increase for 

daughters than for sons. The sex difference be- 
_ tween these latter two correlations was signifi- 
Cant at the .05 level. It is suggested that since 


< 


__, "The differences in the content of mastery behaviors 
for ages 0-3 and 3-6 are analogous to Piaget’s (1952) 
Wi description of intellectual development. Piaget suggests 
_ that during the first 2 years the child is in the sensori- 
_ Motor stage of intelligence in which simple perceptual- 
7 Motor activity is salient. During the subsequent pre- 
pd School years the child’s intellectual activity becomes 
be more symbolic and more comparable to adult problem 
solving behavior. 
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the girl was more likely than the boy to identify 
with the mother, maternal encouragement of 
intellectual mastery should have had a greater 
effect on the development of the girl than on 
the boy.8 


TAT Achievement Stories 


Achievement themes on the TAT were mod- 
erately stable and were correlated with adult 
achievement behavior. The fact that the cor- 
relations were as high for females as for males, 
although the three critical cards illustrated 
male heroes, raises some question concerning 
the validity of the hero hypothesis. Since Cards 
1, 14, and 17BM all picture a male in a poten- 
tial achievement situation, one might expect 
that achievement themes for women would not 
be highly correlated with their achievement 
behavior. The present results indicate that the 
production of achievement themes may be 
more influenced by the subject’s conception of 
what behaviors are appropriate for the hero, 
than by the degree of identification of story- 
teller with hero. Perhaps high achievement girls 
conceptualize the male role as being more asso- 
ciated with task mastery than do low achieve- 
ment girls. 

Atkinson (1957) has suggested that achieve- 
ment themes have differential validity depend- 
ing on whether or not the criterion task engages 
thesubject’s motivation. The achievement vari- 
ables used in this study (ratings of overt be- 
havior, IQ increase scores) measured ‘“‘real life” 
behaviors that would be expected to engage the 
subject’s motivation. The positive correlations 
obtained indicate that achievement themes are 
valid indices of intellective mastery when the 
conditions under which the behavioral samples 
are obtained are motive arousing. 

Limitations on Generalizability 

Although the stability correlations for 
achievement behavior are fairly high, the na- 
ture of this particular sample favored stability. 
The social milieu of these subjects remain con- 
stant throughout the first 17 years of their 
lives, and the parents and peers of these sub- 
jects retained their same values. The degree of 
stability obtained with this sample might not 


8 Research in progress at the institute, under the 
direction of Vaughn J. Crandall and Walter Katkovsky 
is assessing the role of both mother and father in the 
adoption of achievement motives and behaviors. 
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hold for populations that were more mobile, for 
different ethnic or social class groups, or for 
children subjected to major developmental 
traumata. 


Social Reinforcement and Stability 


The stability of achievement behavior is con- 
gruent with general reinforcement theory. 
Each time achievement strivings are rewarded 
through social approval or internal feelings of 
satisfaction, the strength of this behavioral 
tendency should be increased. If achievement 
strivings lead to failure, these behaviors should 
extinguish. The child who attains scholastic 
honors through effort is rewarded by the social 
environment, and this experience frequently 
leads to an expectancy of future success for 
similar behavior. This rewarding experience, 
coupled with the strong cultural approval for 
intellectual competence, increases the prob- 
ability that the child will continue to engage in 
intellectual tasks. On the other hand, persist- 
ent failures in intellectually challenging situa- 
tions are likely to lead to an expectancy of 
failure, and these expectancies can result in 
avoidance and/or withdrawal from involve- 
ment in intellectual behavior. 


SUMMARY 


This paper summarized results from a larger 
investigation on the stability of behavior from 
childhood through adulthood. This investiga- 
tion dealt specifically with the long term stabil- 
ity of achievement and recognition seeking be- 
haviors in subjects who were part of the Fels 
Research Institute’s longitudinal population. 

The subjects were 36 males and 35 females 
for whom extensive longitudinal information 
was available from birth through 14 years of 
age. One psychologist, who had no knowledge 
of the adult behavior of these subjects, studied 
narrative reports based on observations of the 
child in a variety of settings, and rated each 
child on achievement and recognition seeking 
behaviors for four age periods: 0-3, 3-6, 6-10, 
and 10-14. In addition, ratings were made for 
maternal acceleration of developmental skills 
for the first three age periods. A second psy- 
chologist, who had no knowledge of the child- 

hood information, interviewed each subject in 
adulthood (age range 20-29) and rated him on 
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three variables related to achievement and 
ognition seeking behavior in adulthood, 

In additior, the following information 
available for most subjects: (a) a 7 card T} 
protocol administered during early adolesce 
(median age 14-6) anda 13 card TAT pı 


Stanford-Binet IQ tests from ages 5 through 
which furnished an IQ change score over t 


The major results were as follows; 

1. Both achievement and recognition sti 
ing behaviors for age 6-10 showed signific 
positive correlations with similar behay 
during adulthood, the correlations ranged 
-38 to .68 and all were significant at the .05 
or better. 

2. Involvement in intellectual activi 
age 10-14 showed high positive correla 
with achievement behavior in adulthood 
involvement in athletics during these 3 
showed no positive relationship with 
achievement behavior. Involvement in 
chanical tasks for age 10-14 showed po 
correlations with adult achievement beh 
for boys but not for girls. 

3. Maternal acceleration of the child’ 
velopmental skills during the first 3 
life predicted adult achievement behai 
women but not for men. Maternal acce 
of developmental skills during age 6-10 
moderate correlations with adult concern 
intellectual competence in both sexes. 

4. Achievement stories told during a 
cence and adulthood showed high positive 
relations with adult achievement be 
Maternal acceleration of developmental 
showed suggestive correlations with the 0 
rence of achievement stories in both 
cence and adulthood. 

5. The amount of increase in IQ score di 
the years 6-10 showed high positive coi 
tions with the ratings of achievement beh 
during adulthood. In addition, increase 
showed positive correlations with age 
achievement behavior, and with striving 
intellectual competence during age | 
Amount of IQ increase showed negative 
lations with athletic achievement for age 

The results suggested that achii 
strivings during the first 4 years of school’ 
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moderately good index of future achievement 

behavior during adolescence and adulthood. 

There were high correlations between strivings 

for social recognition and intellectual achieve- 

ment and it was suggested that these variables 
are intimately related in a middle class popu- 

lation. + 

REFERENCES 

Arxinson, J. W. Motivational determinants of risk 
taking behavior. Psychol. Rev., 1957, 64, 359-372. 

Kacan, J., & Moss, H. A. The stability and validity 
of achievement fantasy. J. abnorm. soc. Psychol., 
1959, 58, 357-364. 

Kacan, J., & Moss, H. A. The stability of passive and 
dependent behavior from childhood through adult- 
hood, Child Develpm., in press. 

Kacan, J., Sonrac, L. W., BAKER, C. T., & NELSON, 


513 


VIRGINIA L, Personality and IQ change. J. abnorm. 
soc. Psychol., 1958, 26, 261-266. 

McCtetranp, D. C., ArKwson, J. W., CLARK, R. A., 
& LoweLL, E. L. The achievement motive. New 
York: Appleton-Century-Crofts, 1953. 

Moon, A. M. Introduction to the theory of statistics. New 
York: McGraw-Hill, 1950. 

Murray, H. A. Thematic Apperception Test Manual. 
Cambridge: Harvard Univer. Press, 1943, 

Perters, C. C., & VAN Vooruis, W. R. Statistical pro- 
cedures and their mathematical bases. New York: 
McGraw-Hill, 1940. 

Pracer, J. The origins of intelligence in children, New 
York: International Univer. Press, 1952. 

Sontac, L. W., Barer, C. T., & NELSON, VRoma L. 
Mental growth and personality development. 
Monogr. Soc. Res. Child Develpm., 1958, 23, No. 68. 


(Received April 11, 1960) 


Journal of Abnormal and Social Psychology 
1961, Vol. 62, No. 3, 514-519 


A REINTERPRETATION OF SOME 


IN CONCEPTUAL BREADTH? 


LOREN J. CHAPMAN? 
University of Kentucky 


ANY clinicians and experimental in- 
M vestigators have reported that 
schizophrenics err in their assign- 
ment of objects to common conceptual cate- 
gories, but the various writers differ on the 
nature of the error. Some writers have de- 
scribed it as an inappropriate enlarging of the 
application of a conceptual category, while 
others have described it as an inappropriate 
restricting. Observers usually infer inappro- 
priate enlarging when subjects overinclude, 
i.e., include incorrect objects in a conceptual 
category, and they infer inappropriate re- 
stricting when subjects overexclude, i.e., ex- 
clude from a category objects that belong in it. 
Cameron (1939) emphasizes the first of these, 
but it should be pointed out that he uses the 
term “‘overinclusion” to refer to a wider range 
of phenomena, Writers who regard schizo- 
phrenic thinking disorder as “concrete,” such 
as Goldstein and Scheerer (1941), put primary 
emphasis on overexclusion. Using more spe- 
cialized tests, Chapman and Taylor (1957) and 
Epstein (1953) found only overinclusion, not 
overexclusion. Zaslow (1950) claimed to find 
both, but as Kugelmass and Foundeur (1955) 
pointed out, the results are doubtful because 
of a lack of reliability of Zaslow’s one item 
test. The present investigation is an attempt 
to reconcile the apparently conflicting observa- 
tions of these writers. 
Schizophrenics often appear to deal with 
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PATHOLOGICAL DISTURBANCES 


test materidls in ways which they prefer rather 


than the ways in which they are instruc 


not a tendency toward overexclusion or o 
inclusion, but a tendency to use concepts of 


priateness. Thus, if a schizophrenic is in- 
structed to sort by a more narrow concept than 
he prefers, he will be more overinclusive; 
if he is asked to sort by a broader concept tha 
he prefers, he will be more overexclusive. 
other words, what may in part underlie 
frequently observed inappropriate enlarging 
restricting of the application of common coi 
cepts is in fact a preference for concepts of 
particular breadth. 

The present study is an extension of 
method partially developed earlier (Chapma 
& Taylor, 1957) when schizophrenics were 
compared with normal subjects on a 
sorting test designed to measure inappropriat 
enlargement or restriction of the applicatio 
of common conceptual categories. Each sub 
ject was given a pile of cards, each of which” 
had printed on it the name of an object, and he 
was asked to sort all those that fell into a 
named conceptual category into one pile (e.g 
all the fruit) and everything else into a secon 
pile. The finding was that the schizophreni 
committed more errors of overinclusion t 
the normals. Moreover, items which were more 
often incorrectly included were ones whi 
shared with the correct objects another comz 
mon conceptual category which subsumed the 
classification specified in the instructions. 
These results suggested that the subject’s over 
inclusion errors resulted from his sorting by 
broader concepts than he was instructed to use: 
However, the schizophrenics did not commit 
more errors of overexclusion than the nor: 
subjects. 

In the present study the breadth of the con- 
ceptual category is varied. The hypothesis E 
that as the conceptual category specified in 
the instructions becomes broader, schi20- 
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phrenics increase their errors of overexclusion 
but decrease their errors of overinclusion. 

Since brain damaged patients have also been 
reported to show a disorder in application of 
conceptual categories, it seemed desirable to 
test a group of such patients as well as a schizo- 
phrenic and normal group in order to compare 
the two patient populations on overinclusive 
and overexclusive behavior. No specific pre- 
diction was made concerning the effect of vary- 
ing the breadth of conceptual categories on 
overinclusion and overexclusion by brain 
damaged patients. 


METHOD 


Two sets of tasks were used, one designed primarily 
to elicit errors of overinclusion, the other to elicit errors 
of overexclusion. 

Tasks primarily eliciting overinclusion. The subject 
was given a pile of 30 cards, each of which named one 
object, the objects falling into three categories, e.g., 10 
fruit, 10 vegetables, and 10 items of sports equipment. 
The experimenter presented an instruction card nam- 
ing one of the categories. This card remained visible 
throughout the sorting. The subject was asked to sort 
the cards into two boxes: e.g., fruit into one box and 
everything else into another. The measure of over- 
inclusion was the number of incorrect cards included 
with the fruit. Errors of overinclusion tend to be elicited 
on this task because such errors are often the result of 
sorting by a broader concept than the one specified in 
the instructions. In the example, the broader concept 
of edibles would include vegetables as well as fruit. In 
addition to this measure of overinclusion, the task 
provided a measure of the extent to which overinclusion 
occurs on the basis of the subject’s using excessively 
broad concepts. If the subject sorted by a broad con- 
cept of a similar kind that subsumed the one specified 
by the instructions, he more often included items from 
the incorrect but similar category (e.g., vegetables) 
than items from the dissimilar category (sports equip- 
ment). However, errors of other kinds—those due to 
carelessness, cards sticking together, inability or un- 
willingness to take the test, or sorting by another uni- 
dentified concept—would result in placements in the 
two boxes equally often. This does not imply, of course, 
that the broader concept offered in the task is the only 
broader concept toward which the patient is inclined. 
Overinclusive tendencies on the basis of similarity 
would be expected to lead toward many kinds of re- 
Sponses other than the ones measured here. Thus, a 
Score of number of vegetables placed with the fruit, 
minus the number of sports equipment that are so 
Placed, measured the tendency to sort by inappropri- 
ately broad concepts, independent of errors of other 
types. In order that differences in sorting the three 
Categories of objects would not be attributable to differ- 
ences in their familiarity, the three sets of words were 
equated on frequency of occurrence in the language as 
indicated by Thorndike and Lorge’s (1944) average of 
three word counts. 
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The “fruit” concept was used as a comparatively 
narrow concept, and the 30 cards used for it are called 
here Task IA-Incl. (Level I). A second task with a 
comparatively narrow concept was Task IB-Incl. for 
which the instructed concept was “birds.” In addition 
to the names of 10 birds, the subject was given 10 in- 
sect names (incorrect similar) and 10 words referring to 
articles of clothing (incorrect dissimilar). 

For sorting by broader concepts (called here Level 
II concepts), the materials from Level I tasks were 
combined with additional cards as described below. 
For Task I[A-Incl., the 30 cards of IA-Incl. (fruits, 
vegetables, and sports equipment) were combined with 
10 cards naming trees, 10 naming flowers, and 10 nam- 
ing musical instruments, with the instructed concept 
being “Things that a person can eat.” The 20 fruits 
and vegetables were scored as correct, the 20 trees and 
flowers as incorrect similar, and the 20 musical instru- 
ments as incorrect dissimilar. For Task I1B-Incl., the 
30 cards from IB-Incl. (birds, insects, and clothing) 
were combined with 10 cards naming wild animals, 10 
domesticated animals, and 10 tools. The subject was 
instructed to sort out the “Creatures that fly.” The 20 
birds and insects were scored as correct, the 20 animals 
as incorrect similar, and the 20 tools and clothing as 
incorrect dissimilar. 

Tt is seen that on Task ITA-Incl. as well as ITB-Incl, 
the instructed concept was by definition broader than 
the corresponding one at Level I, because in each case 
the Level II concept subsumed Level I as a subconcept. 
The hypothesis was that on Level II there would be 
fewer errors of overinclusion than on Level I. 

Tasks eliciting overexclusion. In order to elicit errors 
of overexclusion, the same combinations of cards were 
used as in the tasks testing for errors of overinclusion, 
but subject was instructed to sort by different concepts. 
For example, for Task IA-Excl., the concept was 
“Things that a person can eat,” and for Task IB-Excl., 
it was “Creatures that fly.” The concept for ITB-Excl. 
was “Things that can move without outside help,” and 
for ITA-Excl., “Things that grow.” These concepts 
subsume more of the available items and hence provide 
more opportunity for overexclusion errors on that 
particular pile of cards than do the instructions designed 
to elicit overinclusion. Also, the expectation was that 
if a subject were disposed toward errors of overexclusion, 
he would have a tendency to sort by more narrow con- 
cepts than specified in the instructions. Because the 
conceptual category the subject was asked to sort by 
always included as subconcepts two common and 
readily identifiable categories of objects, these tasks 
were expected to elicit overexclusion errors. In addition, 
the concepts used for both Level I and IT on these 
tasks are one level of abstraction higher than those in 
the tasks eliciting overinclusion because the instructed 
concepts subsume as a subclass the concepts used with 
the same response cards in eliciting overinclusion. 

Additional tasks. In addition to these tasks, four 
others were given, which, although they do not fit into 
the scheme of levels of abstraction, give additional data 
on tendencies toward overinclusion and overexclusion. 
Task IC-Incl. consisted of 10 domesticated animals, 
10 wild animals, and 10 tools, with the instruction 
“Animals that are kept on the home or on the farm.” 
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Task IC-Excl. used the same cards with the instruction 
“Animals,”” 

Tasks ID-Incl. consisted of 10 trees, 10 flowers, 
and 10 musical instruments, with the instruction 
“Names of trees.” Task ID-Excl. used these cards 
with the instruction “Things that grow.” 

There were 12 separate sortings in all. Every subject 
received all 12 in a prearranged order such that the 
subject was never asked to do consecutive sortings by 
the same conceptual category, and the overinclusion 
and oyerexclusion tasks as well as tasks from the two 
levels were intermixed. This order was established by 
a circular plan, with consecutive subjects from a given 
experimental group beginning the experiment with a 
sorting which was one removed from that of the pre- 
vious subject. This design was intended to minimize 
the possibility that differences between groups of sub- 
jects as to their performance on the various sortings 
were due to practice or fatigue effects. 

‘Subjects. Subjects were white males who showed no 
indication of mental deficiency and who had completed 
at least the sixth grade. They included 52 schizophren- 
ics, 39 brain damaged patients, and 33 normal subjects. 

The schizophrenic subjects were chronic patients of 
whom 40 were on a “regressed” ward at Chicago State 
Hospital. Seven were from other Chicago State Hospital 
wards, and 5 were from Downey Veterans Administra- 
tion Hospital. Also, none of the patients had received 
recent somatic therapy, and none was being treated 
with psychopharmacological agents. 

Although no attempt was made to select patients by 
diagnostic subtype, the schizophrenic group was found 
to include 6 diagnosed as paranoid, 16 hebephrenic, 6 

catatonic, 2 simple, and 22 without classic subtype. 
No patient was included for whom there was any sus- 
picion of brain damage. All patients had spent at least 
one year in the institution prior to testing, and most 
of them had spent a large part of their lives in the 
hospital. |; 

‘The brain damaged patients were obtained at Hines 
Veterans Administration Hospital. Only those patients 
were included who were diagnosed as suffering from 
cerebral damage. Three subjects bore diagnoses of brain 
tumor, 3 of head injuries, 5 of multiple sclerosis, 1 of 
Huntington’s chorea, 1 of chronic brain syndrome, 24 
of cerebral vascular disorders, and 2 others. No brain 
damaged patient was included who appeared on the 
basis of medical history or interview to be suffering 
from severe emotional disorder. The brain damaged 
patients as a group had been hospitalized for a shorter 
term than the schizophrenics. About one-third of them 
had entered during the previous 3 months, and none 
had been hospitalized longer than 5 years. Since these 
patients were being tested in order to study the nature 
of cognitive disturbance resulting from brain injury, 
only those patients were included who appeared by 
clinical interyiew to have some intellectual loss or dis- 
turbance. This decision was usually made on the basis 
of the patient’s difficulty in communication, serious 
disturbance of memory, or severe lack of alertness. 

The normal subjects were ambulatory, psychi- 
atrically normal medical and surgical patients at the 
United States Public Health Service hospital and the 
Ilinois Central Hospital in Chicago. The mean age of 
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the three groups was schizophrenic 44, brain damaged 
44, and normals 45. Mean education was 10.2, 10.6, 
and 10.1, respectively. The three groups did not differ 
statistically with respect to mean age or education, 
However, the variances of both age and education were 
not homogeneous, the values being significantly greater 
for the brain damaged patients than for either of the 
other two groups (p = .05 or less). The variances did 
not differ significantly between the schizophrenic and 
normal groups. 


RESULTS 


Although the overinclusion tasks were de- 
signed to elicit overinclusion errors, some over- 
exclusion errors could occur. Similarly, overin- 
clusion errors could occur on the overexclusion 
tasks, Table 1 lists mean total overinclusion 
and overexclusion error scores at each level for 
both sets of tasks as well as probability state- 
ments concerning differences in error scores 
between levels. Since there were only half as 
many cards and therefore half as many oppor- 
tunities for both kinds of error on Level I as on 
Level II, the scores on Level I were doubled to 
make them comparable. Because the distribu- 
tions were extremely skewed and the groups 
small, all differences within groups were tested 
for significance by a two-tailed Wilcoxon test, 
and all differences between groups by a two- 
tailed Mann-Whitney test. 

As seen in Table 1, the schizophrenic pa- 
tients, as predicted, showed changes in number 
of errors between Level I and Level II that 
were in opposite directions for the two kinds 
of errors. Overexclusion errors increased from 
Level I to Level II, and overinclusion errors 
decreased. Both differences were significant. 
The brain damaged patients also showed a 
significant rise in overexclusion errors between 
task levels, although the drop in overinclusion 
errors was significant only on the oyerinclusion 
tasks, not on the combination of both sets of 
tasks (p = .18). For the normal subjects, the 
differences are in the same direction but not 
significant. 

Let us now consider the question of whether 
the overinclusion errors resulted from use 0: 
excessively broad concepts. Table 2 breaks 
down the overinclusion errors on the overinclu- 
sion tasks into those that occurred on items 
similar to the conceptual category specified for 
sorting as against those dissimilar. As in the 
Chapman and Taylor study (1957), the m- 
correct inclusions were more often from the 
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similar category than from the dissimilar; 
whereas random errors, or errors other than 
those resulting from sorting by am excessively 
broad concept, fell equally often in similar and 
dissimilar errors. For example, when instructed 
to sort by “fruit,” subjects included vege- 
tables more often than sports equiprient. This 
indicates that the errors of overinclusion re- 
sulted from sorting by an excessively broad 
concept, since the incorrect similar items to- 
gether with the correct items formed a slightly 
broader conceptual category than the one the 
subject was asked to sort by. The data from 
the two additional tasks yielded comparable 
results. 

Of more interest to the present study, how- 
ever, is the finding that the difference in over- 


TABLE 1 


Mean TOTAL OVERINCLUSION AND OVEREXCLUSION 
Error Scores on Each Set or Tasks 


Overinclusion Errors | Overexclusion Errors 
Level g g s s 

So OA- p p 

$99 EFS) Total FEE REE Total 

Sos |e os| S5 

5° 5" 5° |6 
Schizophrenic 
Te 10.57 | 1,61 | 12.18 | 1.39 | 2.61 | 4.00 
I 6.85%*| 2.08 | 8.93**| 2.40*| 8.06%" | 10.46** 
Brain damaged 
Ja 3.23 | .15| 3.38 |1.49|1.08 | 2.57 
Ir .36* | .33 | .69 | 1.00 | 5.08" | 6.08* 
Normal 
r 2.12 | .12| 2.24 | .61 | 1.52 2.13 
u -42 | .33| -75 | .94 | 2.94 3.88 


A The Level I scores are doubled to make them comparable to 
Level II scores, 

* The difference between levels is significant at the 5% level. 

** The difference between levels is significant at the 1% level. 


TABLE 2 


MEAN OVERINCLUSION ERROR SCORES ON THE OVER- 
INCLUSION TASK FOR ITEMS WHICH WERE 
SIMILAR AND DISSIMILAR TO THE 
INSTRUCTED CONCEPT 


Schizophrenic g Brin, Normal 
Level T° 
Similar 9.39 3,23 2.06 
Dissimilar 1.18 0 -06 
Level If 
Similar 5,56 10 33 
Dissimilar 1.29 -26 -09 


^ The Level I scores are doubled to make them comparable 
to Level II scores. 
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inclusion behavior between Levels I and II is 
primarily attributable to a difference in the 
incorrect similar category. A score of similar 
errors significantly distinguishes the two levels 
at the .01 level for both the schizophrenics and 
the brain damaged, while, as expected, the 
score of dissimilar errors does not distinguish 
the levels for either group. These findings were 
investigated more directly by computing for 
each subject a score of similar errors minus 
dissimilar errors. As discussed above, this dif- 
ference score is a relatively pure measure of 
sorting by excessively broad concepts with 
other kinds of errors controlled. On this score, 
the two levels were distinguished for both 
pathological groups (p < .001). None of these 
differences was significant for the normal 
group. We may conclude, therefore, that for 
the patients there is a difference between the 
two levels not only in overinclusion errors, but 
also in overinclusion errors that are attribut- 
able to sorting by a broader concept than the 
instructed one. 

A similar question is whether the overexclu- 
sion errors resulted from the use of excessively 
narrow concepts. If this were the case, any 
subject who excludes would be expected to sort 
by only one of the subconcepts that the experi- 
menter built into the concept specified for 
sorting, However, there is no basis on which to 
predict which of the several such subconcepts 
the subject would use for sorting; and there is 
no conventional statistical technique to test the 
relevant contention. A qualitative inspection 
of the data indicated that the prediction was 
probably not confirmed, since for most in- 
dividual subjects there did not appear to be 
among the various subcategories any differen- 
tial excluding. Therefore, no extended attempt 
was made to obtain a suitable statistical test. 

Comparison of schizophrenic and brain dam- 
aged subjects. As the foregoing indicates, all of 
the effects which were preditted and found for 
the schizophrenics were also found for the most 
part among the brain damaged patients. How- 
ever, it is of interest to compare the schizo- 
phrenic and brain damaged on their relative 
overall tendencies to overinclude or overex- 
clude. In order to measure this, all of the 
available data were used, including the “addi- 
tional tasks.” Altogether, there were 240 cards 
which could be incorrectly included and 240 
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Ratio of Inclusion Errors to Total Errors 


Fic. 1, Comparison of schizophrenic and brain 
damaged subjects on preference for overinclusion or 
overexclusion errors. 


which could be incorrectly excluded. The score 
used was the proportion of total errors which 
were inclusion errors. Figure 1 shows the dis- 
tributions of scores of schizophrenic and brain 
damaged patients on this index. In computing 
the data for Figure 1, those subjects were 
omitted who made fewer than five errors in all 
because a proportion score is highly unreliable 
for very low error scores. These omissions left 
a total of 29 brain damaged and 42 schizo- 
phrenic patients. 

As indicated in Figure 1, 70% of the schizo- 
phrenics made more overinclusion errors than 
overexclusion errors, whereas 66% of the brain 
damaged showed the opposite preference, i.e., 
made more overexclusion errors than overin- 
clusion. Note, however, the apparent bimo- 
dality of the schizophrenics’ distribution. Al- 
though the schizophrenics, as a group, were 
significantly more overinclusive than the brain 
damaged (p < .01), 17% (seven of them) had 
a high proportion of errors on the overexclusion 
side. 

An attempt was made to compare these 
seven patients on other variables with the main 
body of the schizophrenic group. They were 
not found to differ on age, education, chronic- 
ity, diagnostic subtype, or variety of symp- 
toms. There was some suggestion that the 
seven overexcluders tended to be more effec- 
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tive hospital workers than the main body 
schizophrenics. 


Discussion 


The main hypothesis was confirmed in that 
the schizophrenics decreased their errors of 
overinclusion and increased their errors of 
overexclusion as the concepts specified by the 
instructions became broader. The same effects 
were also found for the most part with the 
brain damaged patients. Thus, for both kinds 
of patients there is a tendency to sort by con- 


cepts of a certain preferred breadth which ac- 


counts, in part, for both overinclusion errors 
and overexclusion errors. However, the tasks 
may differ from one another in ways other than 
conceptual breadth. 

It is interesting to note, nevertheless, that 
the two groups of patients differed in relative 
tendencies toward overinclusion and overex- 
clusion errors. This may reflect a difference be- 
tween the groups on breadth of preferred sort- 
ing, or it may instead reflect tendencies toward 
overinclusion or overexclusion in addition to 


that arising from a preferred breadth. The first | 


interpretation is consistent with McGaughran 
and Moran’s (1957) finding that, when allowed 
to choose their own concepts on an object 
sorting test, brain damaged patients use nar- 
rower concepts than schizophrenics. 

The normal subjects also showed differences 
between task levels of the same sort as the pa- 
tients, which, although not significant, suggest 
the possibility that the effects observed for the 
pathological groups represent an increase of a 
normal error tendency. This is consistent with 
evidence pointed out by Chapman (1958) that 
many features of schizophrenic thinking dis- 
order are accentuations of normal error pat- 
terns, 


SUMMARY 


The present study was designed to investi- 
gate the possibility that both overinclusion an 
overexclusion errors on conceptual sorting 
tasks by schizophrenics are attributable, at 
least in part, to a tendency to use concepts © 
a specific preferred breadth regardless of their 
appropriateness. This suggestion was pie 
gated by means of a task in which names A 
objects were sorted on the basis of conceptu 
categories of varying breadth. The hypothesis 


ai 
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was that as the instructed conceptual category 
becomes broader, schizophrenics increase their 
errors of overexclusion but decreasé their errors 
of overinclusion. 

The hypothesis was confirmed. In addition, 
a group of brain damaged patients to a large 


“extent showed the same effect, whereas a nor- 


mal group showed only qualitative nonsignifi- 
cant trends in this direction. However, the 
schizophrenic group on all tasks combined 
made more overinclusion errors than overex- 
clusion, and the brain damaged showed the 
opposite error preference. 
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Adult Intelligence Scale (WAIS), some 
“1700 subjects, both sexes, aged 16-64 
plus an additional 475 subjects of both sexes, 
ages 60-75 and over” were used. In selecting 
the 1700 subjects, “occupation and education, 
as well as sex and age,...also geography, 
urban vs. rural and color” were simultaneously 
considered, and it was concluded that “the 
norms as they stand . . . seem to be reasonably 
representative of the country as a whole...” 
(Wechsler, 1958, pp. 86-88, 91). The additional 
475 subjects aged 60-75 and over referred to 
here formed the basis for an extension of the 
WAIS norms beyond age 64, but caution was 
expressed about using tables derived from 
these data because they were “not based on 
cases tested throughout the country according 
to census specifications but came from a special 
old age study” (Wechsler, 1955, p. 25). How- 
ever, there was an overlap of groups aged 60- 
64 in both the national standardization and in 
the special old-age study, which was conducted 
in Kansas City. The overlapping groups 
showed high comparability of performance 
with only one subtest, Picture Completion, 
indicating a reliable difference between groups 
at the .05 level (Doppelt & Wallace, 1955). 
The generality of the Kansas City results for 
the United States’ population over 65 has 
needed testing, and the opportunity to make 
comparisons with a population in another part 
of the country arose in the course of an investi- 
gation conducted at Duke University in 
Durham, North Carolina, in connection with a 
longitudinal study of the aged. In a previous 
paper (Eisdorfer, Busse, & Cohen, 1959) some 
question was raised as to the generality of the 
Kansas City results in view of strikingly con- 
sistent discrepancies between the Verbal and 
Performance IQs in the Duke Project popula- 
1 This investigation was supported in part by Re- 
search Grants M900C, H3582, M2109 of the National 
Institutes of Health, United States Public Health Sery- 
ice, The help of E. W. Busse is gratefully acknowledged. 


T establishing age norms for the Wechsler 
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tion when the Kansas City data were use 
comparison. Using these latter criteria, 
would have been expected that the 1 
ship between Verbal and Performan 
the Duke data would be random, nei 
nor the other being consistently superior: 
successive comparisons based on age, sex, 
socioeconomic status, high or low IQ § 
and community vs. hospitalized status i 
Duke Project data showed consis 
reliable differences favoring the Verl 
the Performance IQs. p 
In consequence, it seemed advisable 
pare systematically the data obtained ! 
final population of subjects available in t 
Duke study with those from Kansas City. 
present report is based upon this comp: 


METHOD 


Sample 


The Kansas City study set out to test 
viduals, although some 659 were listed to be 
on the basis of a “two stage, stratified, syst 
ple” in which every block in the area of four c 
embracing Kansas City was geographically 
and weighted by size, and a systematic pri 
of every nth block selected. Every nth dwelling’ 
nth block was selected further, and each person 
in these dwellings was selected for the study. A 
the WAIS was administered to 475 subjects, 0 
352 completed the study. Only data from © 
tests were used for the IQ tables—a total of 
jects (Wechsler, 1955, p. 23). 

The Duke sample of 239 subjects who c 
the full scale WAIS was drawn from volun! 
community, originally recruited through req 
radio and television as well as through Go 
Club meetings. The census characteristics of 
for age, sex, race, and occupation served as CI 
the recruitment of additional subjects to rep! 
various segments of the population in Durham 
county. The sample was built up over a 4; 

Actually 336 subjects were examined. Two 
tionalized groups in Durham (15 in a retirem\ 
and 64 in a state hospital) are not included i 
ent analysis. There were 18 subjects who did 
plete the full scale WAIS. These too are not 
Our report is therefore based on 239 subjects. 
Certain biases entered into the selection of 
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TABLE 1 


COMPARISONS OF NUMBERS OF SUBJECTS AND AGE, RACE, AND SEX CHARACTERISTICS OF THE 
Dure Project anp Kansas Crry Sroptes on tHe WAIS 


Duke Project 
Age White Negro Total Sample Ka otal Sampie 
Male Female Malè Female Male Female Total Male Female Total 
E ee ky aie. 
Tee 38 ose E A dling A e 
Subtotal g a 32 42 8 105 239 10 192 392 
Total 165 74 239 
a Wechsler (1955, Table 15, p. 23). 
Project sample. Since all subjects had to come to the Procedure 


Medical Center for a 2-day examination, only ambula- 
tory, reasonably well functioning individuals partici- 
pated in the study. Bedridden, immobile, and socially 
isolated individuals are probably not represented in re- 
lation to their incidence in the Durham population. 
Since the sample was built up over time, many volun- 
teers were not included in the study because they did 
not fit into the “cells” of the design that needed to be 
filled. Thus, toward the latter part of the study, “up- 
per” socioeconomic Negro male volunteers were sought 
for and accepted, while “lower” socioeconomic white 
female volunteers were not. It may well be that some 
systematic bias was incurred by the inclusion of sub- 
jects who volunteered early, since those individuals may 
have been more socially alert and more assertive than 
others of their age in the Durham population. 

In Table 1 may be seen the distribution of the sub- 
jects included in the Duke Study according to age, sex, 
and race characteristics. This distribution is compared 
with the Kansas City population on the same criteria. 
The total number of subjects in the Duke sample for the 
three age intervals 65-69, 70-74, and 75 plus is 67, 62, 
63; for the Kansas City sample, N = 86, 80, and 85. 
The Duke and Kansas City samples for the three older 
groups seem close enough to each other in number to 
warrant comparison. Both studies report & slightly 
higher number of female subjects in the sample. Al- 
though the Durham population characteristics show 
57% females over age 60, our Duke sample shows 52% 
female. The white/Negro proportions are not mentioned 
in the Kansas City study, but in the Duke sample, 31% 
are Negro, which conforms to 29% Negro in the Dur- 
ham population. Some of the sample characteristics of 
the Duke Project are attributable to placing sufficient 
numbers of subjects in each cell so that individual varia- 
tions would not distort the values of any cell. 

No data on social class are reported for the Kansas 
City sample, but subjects were dichotomized on this 
variable in the Duke Project on the basis of a modifi- 
cation of Warner’s Index of Status Characteristics 
(Warner, Meeker, & Eells, 1957) described in detail 
earlier (Eisdorfer et al., 1959), The “higher” status 
group included 103 subjects, whereas the “lower” socio- 
economic group included 149 subjects. 


The WAIS examinations were administered by one 
of three experienced examiners according to the stand- 
ard procedure outlined in the WAIS manual (Wechsler, 
1955). The results for each subject were treated as pre- 
scribed in the test manual, Scaled scores, Verbal, Per- 
formance, and Full Scale IQs were computed. 

In order to assess the relationship between the per- 
formance of the Duke and the Kansas City samples, 
comparisons were made by utilizing the data on Kansas 
City subjects reported by Doppelt and Wallace (1955). 


RESULTS 


Table 2 presents the Means and SDs of the 
Scaled scores for each of the WAIS subtests as 
well as for Verbal, Performance, and Full Scale 
scores and IQs. To assess the relationship be- 
tween these scores and the data reported for 
Kansas City, ¢ tests were performed to compare 
the Full Scale, Verbal, and Performance scores 
reported by Doppelt and Wallace (1955), and 
the data obtained in the present study. Table 
3 gives the results of this comparison. 

There is evidently a larger variance in the 
Duke sample, but no reliable difference be- 
tween the Full Scale scores was obtained from 
the two groups. The import of the slight differ- 
ences between the Full Scale scores of the two 
groups becomes clear on study of Sections B 
and C of Table 3. In Section B, subjects in age 
groups 70-74 and 75 plus in the Duke sample 
show significantly higher Verbal scores than 
do the subjects in the Kansas City sample. In 
Section C, we note that the Performance score 
of the 65-69 year old group of the Duke sample 
is significantly lower than the corresponding 
score in the Kansas City sample. It is interest- 
ing to observe also that in age group 60-64, 
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the Duke sample has a somewhat higher Verbal 
score and a somewhat lower Performance score. 
All of these comparisons confirm the previous 
report of Eisdorfer et al. (1959) of the con- 
sistent superiority of Verbal over Performance 
scores in an earlier Duke sample as compared 
with the Kansas City sample. 

A more intensive analysis of the Verbal 
scale subtest scores is possible since Doppelt 
and Wallace (1955) present the Means and 


TABLE 2 
Means AND STANDARD Deviations or ALL WAIS Suprest SCALED SCORES AND FULL SCALE, 
VERBAL, AND PERFORMANCE IQs yor DUKE Proyect SAMPLES 
Age Group Age Group Ase Group Age Group | 
wet) ne ws a) we 6) 
u sD u sD M sD M 7 
Information 9.76 4.02 9.19 3.46 10.16 3.82 9.12 
11.63 4.71 10.74 3.62 10.75 3.99 9.74 
Arithmetic 9.76 4.31 8.67 3.33 9.08 3.45 8.28 
Similarities 8.72 4.32 7.17 3.75 7.62 4.15 6.65 
Digit 9.34 3.40 7.91 2.73 8.11 2.76 7.41 
Vocal 10.65 4.24 9.85 3.89 10.56 4.38 9.90 
Digit Symbol 5.14 3.54 4.43 2.92 4.37 3.22 2.73 
Picture Completion 6.97 2.00 6.05 3.11 5.91 3.16 5.52 
Block Design 6.42 3.20 5.71 2.61 6.04 2,89 5.14 
Picture ageet 5.93 2.58 5.31 2.46 5.25 2.70 4.38 
Object Assem! 6.97 3.18 6.05 2.56 6.16 2.75 5.20 
Verbal score 59.89 22.78 53.70 18.12 56.14 20.00 50.96 
Performance score 31.46 = 12,95 27.61 11.62 27.75 12.43 23.00 
Full Scale score 91.23 34.68 81.31 28.89 83.90 31.24 73.74 
Verbal IQ 104.00 22.56 100.01 17.95 108.20 19.99 106.77 
Performance IQ 92.68 16.92 91.26 = 15.51 97.72 16.58 97.69 
Full Scale IQ 99.08 20.68 96.04 17.35 104,00 18.86 102,88 
Means, Staxparp nae Pe 
Deviations, AND ¢ Tests or FULL Vv: AND PERFORMANCE 
re BETWEEN DUKE Project AND ti ee S 
Duke Project Sample Kansas City Sample 
‘ 
N u sD N u sD 
A oti 47 91.23 
. 34.66 101 90.21 24.15 18 
65-9 67 81.31 28,89 86 88.13 22.92 1.57 
70-74 62 83.9 31.27 80 77.19 21.42 1.43 
75+ 63 73.74 33.36 85 68.71 21.56 1.03 
p woe 47 59.9 22.78 
p A 101 55,24 14.51 1.27 
65-09 67 5.70 18.12 86 53.73 14.51 01 
70-74 62 56.14 20.00 80 47.66 13.73 2.83 
75+ 63 30.96 21.27 85 42.02 14.16 2.23 
Ge O-A 47 31.46 12.95 101 59 
y . 34.97 10.94 1. 4 
65-09 67 27.61 11.62 86 34.40 10.05 3.77 
70-74 62 27.75 12.43 80 29.53 9.45 92 
75+ 63 23.00 13.23 85 24.68 9.54 4 


* As reported by Doppelt and Wallace (1955, Table 15, p. 328), 
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SDs of all their Verbal scale subtests. A com: 
parison of the Verbal subtest performa: 

the Duke sample with that of the Kansas 
sample is found in Table 4, which pre 
these data as well as the ¢ tests for diffe 
between the two samples. : 

Of special note is the Comprehension 

of the Duke sample, which is signific 
higher (¢ < .01) at every age level. Arith 
and Vocabulary subtest scales also 
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TABLE 4 


MEANS, STANDARD Deviations, AND ¢ Tests oF 
Duxe Pro: AND Kansas Crry SAMrirs 
AIS Vernar Sunvesrs* 


ON 


hot -320 
65-09 m Mi 
10-14 » 21.” 
1+ ss Lm 
Comprehension 
tA 108 swn 
65-09 ao 3a” 
10-74 LIES tonn 
15+ ss) 72 ieee 
Arithmetic 
BO 101 | 9.51 39 
-0 sé | 9.30 1.0% 
10-74 |) 7.6 nue 
1+ sinu Liur 
Similarities 
w- nor | 8.77 0 
6-0 6) 64S aw 
10-14 s |a am 
s+ 85 | 6.39 ss 
Digit Span 
4 108 | 6.82 m 
so » a 
wn » Gaal 
154 s 09 
Vocabulary 
0-64 ot 17 
- “ as 
vn » ase 
154 5 2.68" 


^ Doppelt and Wallace (1985, Table 9, p. 321). 
p< 05. 
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higher in the Duke sample, but the differences 
are significant only for the two oldest groups— 
70-74, and 75 plus, 

In 19 of the 24 comparisons of the Verbal 
subtest scores, the Duke sample is higher, 9 of 
the 19 being significant at the .0S level or 
better. The two older groups in the Duke 

In only 


opposite direction at a significant level. 

While Doppelt and Wallace (1955) presented 
the Means and SDs of all Verbal scale subtests 
for all subjects according to age, their presenta- 
tion of the Performance scale subtests was not 
organized in the same manner. They list Per- 
formance scale data for the four age groups 
separately for men and women. Thus, it was 
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E at dics oom tha pent study 
similarly. Table 5 contains the Performance 


ve 
level of p < .05, The Kansas City females in 
65-69 are significantly higher on 


6 reveal a consistently poorer Performance 
score on the part of the Duke sample; of the 


EF 
i 
E 
A 
g 
3 


gzz 
BH 
33 
H 
i 
H 


j 
+1 


ae 


appears to be similarity among the age groups 
as the correlation pattern emerges. Thus, these 
findings tend to support the contention of Dop- 
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TABLE 5 


DIFFERENCES BETWEEN MEAN SCALED SCORE OF MEN IN THE DUKE AND Kansas City 
SAMPLES ON EACH PERFORMANCE TEsT* 


Duke Sample Kansas City Sample 
Test Age 
N Mu SD N “M SD 
Digit Symbol 60-64 20 5.90 3.09 50 4.68 2.80 
65-69 30 3.56 2.66 44 5.07 1.83 
70-74 32 4.31 3.03 44 3.00 2.50 
75+ 32 3.03 2.55 40 2.40 2.40 
Picture Completion 60-64 20 8.05 2.45 50 7.34 2.48 
65-69 30 6.16 3.14 47 747 2.48 
70-74 32 6.31 2,91 47 6.87 2.65 
75+ 32 6.18 3.59 50 5.32 2.81 
Block Design 60-64 20 6.30 3.51 46 SR 3:13 
65-69 30 4.80 2.59 43 7.42 2.39 
70-74 32 5.90 2.96 44 6.02 2.62 
75+ 32 4.93 3.30 42 4.69 2.41 
Picture Arrangement 60-64 20 6.00 2.95 45 7.29 2.46 
65-69 30 5.00 2.44 43 6.86 2.51 
70-74 32 5.34 2.78 41 6:37 2.51 
15+ 32 4.59 3.09 40 5.05 2.37 
Object Assembly 60-64 20 6.65 3.01 46 1.33 2.77 
65-69 30 5.73 2.30 43 7.98 2.76 
10-74 32 5.87 2.56 40 6:25 . 2.95 
75+ 32 5.50 2.67 41 5.12 2.66 


* Doppelt and Wallace (1955, Table 7, p. 319). 
* Significant at .05, 
** Significant at .01. 


TABLE 6 
DIFFERENCES BETWEEN MEAN SCALE SCORES OF FEMALES IN THE DUKE AND KANSAS 
Crry SAMPLES on EACH PERFORMANCE TEST* 


Duke Sample Kansas City Sample 


Test Age 4 tests of f 
N u sD N M sD ieren 
Digit Symbol 60-64 AA AA E A 0 T5722, 83 1.33 
65-69 37 «5.13 2.98 CAD fiy «4.532.590 .97 
70-74 SO 4537) S150) 460) 3.41. 2.02 1.33 
75+ Gingiai2:41) 2077 EOL gs) 3102. 2,29 1.02 
Picture Completion 60-64 27 6.17 2.40 60 6.93 2.17 1.38 
65-69 375.97 3.13 SA 6:54. 2.40 -90 
70-74 3015.48. 3.47 51 5:35. | 2.48 a 
75+ 31 48 3.01 5.25 2.07 .67 
Block Design 60-64 27° 6.50 2.96 59 7.83 2.69 1.98 
65-69 376.45 2.42 497.61 2..69 2.05* 
70-74 30 6.20 2.90 49 6.55 2.53 +52 
75+ ES SS I0 S 5.54, 2.81 -26 
Picture Arrangement 60-64 275.89 «2,28 = 5964 T2354 1.35 
65-69 37 5.56 2.47 46 6.30 2.27 1.37 
70-74 30 «5.13 2.69 48 «5.48 2.09 -57 
15+ 31 4.16 2.59 50 4.98 2.32 1.41 
Object Assembly 60-64 IT O 2E VAIDA STVA T: 96t 13.13 -88 
65-69 SOA a: S46... 8.00 2.91 2.64* 
70-74 30. 6.37, «2.98 457,24 2.85 1,21 
75+ 314.90 2.79 55 5.51 2.93 -93 


2 Doppelt and Wallace (1955, Table 7, p. 319). 
* Significant at .05, 
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TABLE 7 
INTERCORRELATIONS OF THE Tests: AcE 60-64 DUKE SAMPLE* 
W = 47) 
5 6 7 8 9 10 11 12 13 14 

1, Information 
2, Comprehension 
3. Arithmetic 
4, Similarities 
5, Digit Span 
6. Vocabulary -123 
7. Digit Symbol -547 |.805 
8, Picture Completion +457 |.717 |.746 
9. Block Design 3 -508 |.698 |.655 |.620 
10. Picture Arrangement |. 502 |.655 |.653 |.662 |.613 
11, Object Assembly 492 |.645 |.652 |.571 |.728 |.731 
12. Verbal score? -782 |.961 |.842 |.758 |.744 |.699 |.692 
13, Performance score? -589 |.827 |.877 |.831 |.853 |.843 |.866 |.876 
14. Full Scale score? 1733 |.939 |.878 |.806 |.805 |.775 |.779 |.983 |.947 
15. Verbal IQ -183 |.964 |.840 |. 757 |.742 |.700 |.695 
16, Performance IQ "588 |.830 |.875 |.822 |.847 |.838 |.871 |.872 
17. Full Scale IQ "732 |.941 |.879 |.802 |.803 |.770 |.784 |.981 |.947 


® Compare with Doppelt and Wallace (1955, Table 10, p. 324). 


b Correlation of tests with Verbal, Performance, and Full Scale scores before correction for contamination, 


TABLE 8 
INTERCORRELATIONS OF THE Tests: AGE 65-69 DUKE SAMPLE 
(N = 67) 
1 | 21S st Sele Sonera in 

k eenation 
. Comprehensio: 846 
3. Arithmetic 2 767 |.679 
4, Similarities .769 |.818 |.686 
5. Digit Span 1503 |.576 |.493 |.396 
6. Vocabulary 360 |837 |.700 |.725 |.551 
1. Digit Symbol 767 |.703 |.632 |.686 |.496 |.774 
8. Picture Completion 787 |.769 |.719 |.706 |.511 729 |.719 
9. Block Design “625 |.632 |.560 |-603 |.423 1.639 |-656 |-589 | n 
10, Picture Arrangement |.719 |.685 |.634 |.733 "370 |.729 |.652 |.643 627 DA 
11. Object Assembly 572 |586 |.557 |.628 |.324 |-548 |-594 |.107 |.674 I-53 | ezg 
12. Verbal scoreb “909 |1927 |.830 |.860 |.649 |.914 |.791 |.827 -685 |. alent 
13. Performance scoreb |1828 |,802 |.740 |.794 |.509 |-813 .866 |.876 |.834 oy 826 | 88" | 055 
14, Full Scale score> 916 |.904 |.824 |.859 |-612 |.901 845 |.871 |.766 D eat . 
fee C927 |.924 |840 |:867 |-639 | 2 | ep 828 |: Ss (B13 |822 |873 
» Lertor re 3 A al . . G B 3 
t. Fal Seale TQ © “510 |:807 Bar | 266 |509 |.894 |.847 |.867 |.761 | .809 |730 ‘976 |.953 


{Compare with Doppelt and Wallace (1955, Table 11, p. 325). 
Correlation of tests with Verbal, Performance, and Full 


Pelt and Wallace concerning the factorial com- 
Position of the WAIS for these age groups. 


DISCUSSION 


The differences which appear between the 
Duke Project sample and the standardization 
sample derived from Kansas City favor the 
1 uke sample on the Verbal scale and, to @ 
esser extent, the Kansas City sample on Per- 
ormance scale items. 

The Duke 65-69 age group appears to differ 


‘Scale scores before correction for contamination. 


from the pattern of the Duke sample. As 
Tanlo 3 aena the Full Scale, Verbal, and 
Performance comparisons, this group scored 
lower than the next pieren in ou ee 
ne might expect that it would perform 

tae s sdpoint between the 60-64 and 70-74 age 
groups. It can be seen from Table 3 that this 
tation holds for the Kansas City sample. 
With the one exception of this Duke age group, 
all the age groups in both samples show de- 
creasing levels of performance with increasing 
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TABLE 9 

INTERCORRELATIONS OF THE Tests: AGE 70-74 Duke SAMPLE* 
(N = 62) 
1 2 3 4 5 6 7 8 9 fo | 11 | 12 | 13 | 4 

1. Information 
2. Comprehension 854 > 
3. Arithmetic +793 |.738 
4, Similarities «762 |.794 |.573 
5. Digit Span 577 |.622 |.584 |.620 
6. Vocabulary 876 |.832 |.708 |.817 |.630 
7. Digit Symbol 772 |.760 |.670 |.670 |.563 |.699 
8. Picture Completion |.819 |.759 |.688 |.666 |.576 |.720 |.731 
9. Block Design -681 }.640 |.596 |.605 |.581 |.593 |.733 |.736 
10. Picture Arrangement |.631 |.642 |.605 |.499 |.519 |.706 |.630 |.604 |.501 
11. Object Assembly 569 |.549 |.403 |.530 |.475 |.482 |.577 |.686 |.699 |.431 
12. Verbal score’ +926 |.921 |.831 |.876 |.746 |.934 |.796 |.802 |.705 |.681 |.576 
13. Performance score |.830 |.800 |.708 |.709 |.646 |.763 |.880 |.898 |.873 |.746 |.801 |.850 
14, Full Scale score? 923 |.907 |.813 |.842 |.734 |.901 |.859 |.870 |.798 |.732 |.687 |.977 |.941 
15. Verbal IQ 926 |.914 |.826 |.875 |.741 |.939 |.788 |.809 |.708 |.683 |.579 
16. Performance IQ 827 |.789 |.705 |.704 |.640 |.770 |.864 |.902 |.867 |.751 |.796 |.846 
17. Full Scale IQ 921 |.897 |.809 |.839 |.730 |.906 |.846 |.877 |.798 |.736 |.685 |.973 |.940 


® Compare with Doppelt and Wallace (1955, Table 12, p. 326). 


b Correlation of tests with Verbal, Performance, and Full Scale scores before correction for contamination. 


TABLE 10 
INTERCORRELATIONS OF THE Tests: AGE 75+ DUKE SAMPLE* 


63) 


6 7 8 9 10 i 12 13 14 


N= 
1 2 3 4 5 

1. Information 
2. Comprehension 874 
3. Arithmetic «775 |.754 
4, Similarities -801 }.749 |.768 
5. Digit Span 572 |.559 |.568 |.541 
6. Vocabulary -849 |.857 |.757 |.837 |.655 
7. Digit Symbol 743 |.712 |.782 |.745 |.637 
8. Picture Completion |.807 |.767 |.770 |.798 |.597 
9. Block Design -625 |.544 |.625 |.639 |.525 
10. Picture Arrangement |.773 |.685 |.690 |.772 |.552 
11. Object Assembly +662 |.555 }.629 |.727 |.529 
12. Verbal score’ -912 |.896 |.869 |.900 |.698 
13. Performance score’ |.823 |.748 |.796 |.838 |.663 
14. Full Scale score? -904 |.860 |.865 |.905 |.696 
15. Verbal IQ -910 |.891 |.869 |.903 |.700 
16. Performance IQ +821 |.742 |.786 |.834 |.639 
17, Full Scale IQ -905 |.859 |.861 |.903 |.693 


+716 
-870 
-803 
721 
873 
-803 


883 
981 


875 
+980 


.953 


.954 


® Compare with Doppelt and Wallace (1955, Table 13, p. 327). 


b Correlation of tests with Verbal, Performance, and Full Scale scores before correction for contamination, 


age. This 65-69 group has the largest V and 
the lowest SDs of the Duke sample. 

In subtest comparisons, as noted on Tables 
4, 5, and 6, the 65-69 group is the most atypical 
of the Duke sample. It is significantly lower in 
one Verbal subtest in comparison with the 
Kansas City sample—the only Duke subgroup 
that was significantly lower on a Verbal sub- 
test—and in four of the six Verbal subtest 
comparisons, it was also lower. On the Per- 
formance subtests, it was not only lower than 


the comparable Kansas City group, but it was 
the only subgroup significantly lower than the 
comparison sample. ? 
If we assume that the Duke 65—69 group 15 
closer to the national standard and that the 
results for our other age groups are atypical, 
we are unable to explain the even lower Per- 
formance score of these subjects. At the very 
least, we must conclude that our subgroups 
are different from the Kansas City sample. 
Since the criteria for the selection of groups 
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were similar for all ages, we cannot account 
for the performance of this group in terms of 
any systematic sampling bias. Since it included 
recently retired subjects, it perhaps involved a 
number of individuals who were under special 
emotional stress as a function of adaptation to 
their new status. This does not appear to have 
influenced the Kansas City sample, however, 
unless there may be differences in retirement 
age practices in the different regions. 

It should be observed that this 65-69 group 
is not the principal contributor to the differ- 
ences between the two samples. In fact, with 
regard to the Verbal differences noted, this 
group goes contrary to the Duke sample’s 
higher achievement. 

Tt is, of course, possible that the Duke Proj- 
ect sample is a biased one since all subjects 
were volunteers. On the other hand, it seems 
quite clear that this is also true of the Kansas 
City sample. Of the group originally contacted 
in Kansas City, approximately one-half 
(54.2%) completed the examination, Of the 
649 subjects identified as the “Sample for 
testing,” only 352 subjects completed the 
examination and were used for establishing 
the tentative national norms. In the Eisdorfer 
et al. (1959) study, using a portion of the 
Duke sample, various population variables were 
investigated in an attempt to find a source of 
sample bias. It appeared that the relationship 
between Verbal and Performance scores was 
not a function of sex, race, intelligence, social 
class, age, or mental illness variables. The 
authors can think of no reasonable explanation 
for the differences found, apart from unidenti- 
fied regional differences. 

If regional differences in patterns of intel- 
lectual performance exist, it would seem logical 
to suppose that they would derive from re- 
gional influences and would be intensified dur- 
ing the life span, thus appearing more promi- 
nently in the aged. This presumption could be 
tested readily. Probably in order to offset such 
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(possible) regional influences, the collection of 
data from the WAIS norms for younger in- 
dividuals (16-60) was based upon a national 
sample (Wechsler, 1958). In view of the pres- 
ent findings, it is suggested that subjects to be 
used as the standardization sample for the aged 
(60 plus) should also be selected on a nation- 
wide basis. 


SUMMARY 


This study reports WAIS data on 239 
elderly subjects above age 60 from the Dur- 
ham, North Carolina, area. In view of the 
absence of national norms above age 65 for the 
WAIS and the suggested use of data from 352 
subjects from a Kansas City study as tentative 
norms, it seemed especially appropriate to 
compare these latter with the Duke data. 
Stable differences were found between the 
samples, with the Duke groups showing supe- 
riority in the Verbal subtest scores and lower 
achievement in the Performance subtests. No 
satisfactory explanation of these differences is 
offered, other than the possibility of regional 
influences. The need for establishing national 
norms on a national sample of the aged is 
demonstrated. 
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INTERPERSONAL PERCEPTIONS OF REPRESSORS AND SENSITIZE 
AND COMPONENT ANALYSIS OF ASSUMED ? 
DISSIMILARITY SCORES! 


JOHN ALTROCCHI 
Duke University 


NE recent trend in relating individual 
differences in behavior to relatively 
enduring patterns of personality 

functioning has been the delineation of a di- 
mension of repression-sensitization. Repres- 
sors, compared to sensitizers, have been shown, 
for instance, to have higher thresholds for ma- 
terial related to unacceptable needs (Eriksen, 
1951) and to recall more ted than in- 
completed tasks under threat (Eriksen, 1954). 
Selection criteria for repressors and sensitizers 
have usually included the Hysteria (Hy) 
and Psychasthenia (PA scales of the MMPI 
and many investigatorshave related the repres- 
sion-sensitization dimension to behavioral dif- 
ferences between patients using primarily re- 
pressive and denying defenses and those using 
primarily obsessive and intellectual defenses. 
From these studies the following definitions 
were derived (Altrocchi, Parsons, & Dickoff, 
1960) and are used throughout the present 
study: Repressors are defined as those who 
tend to use avoidance, denial, and repression 
of potential threat and conflict as a primary 
mode of adaptation; sensitizers are defined as 
those who tend to be alerted to potential threat 
and conflict, to respond more readily with 
manifest anxiety, and to use intellectual and 
obsessive defenses. 

In two experiments Gordon asked subjects 
to predict how their partners in the experi- 
ment would answer inventory questions. He 


+ This paper was presented at APA, September 1959, 
aAA A A bee sa ety ona 
partially supported t M- the Na- 
poy pil oe el 

to on 
and expecially to Lee J. Cronbach, David M. Shee: ned 
Edward E. Jones for advice and criticism, 
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teristics of the partner being judged. 
ent study is designed to clarify and ext 


personal perception (description of th 
person, as distinguished from prediction of 
the other person will answer items)? 

2. If so, is the phenomenon evid 


3. Does the phenomenon extend to per 
ing dissimilarity between two other p Pop! 
well as to perceiving dissimilarity be 
oneself and others? 

While these questions are meaningf 
principle, Cronbach (1958) has recently on 
cized the use of assumed similarity (AS 
more precisely, assumed dissimilarity @ 
scores which would usually be involved 
seeking answers to them. He argues that 
after the simpler main effects associated” 
the perceiver or the object of perception! 
been given separate consideration” may 
scores be interpreted meaningfully as ini 
tions between two sets of data (c.g, dë 
tions of self and other) (Cronbach, 1958) 
Suggests that AD scores can be analyzed 
five simpler mathematical component 
as, YF, oy, and reo,/oe—the means and § 
ard deviations of the two descriptions 
regression of one on the other. 
fourth question is asked: 

4. Do differences in AD reflect a dy 
Phenomenon, such as dependency of dë 
tion of other on description of self, or cami 
be accounted for by differences in one or 
of the monadic components (e.g., mean 
on self-description)? 

Another problem noted by Cronbach if 
the usual form of the AD score combines 
or more personality variables into @ 
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presented in Table 1. The results of a Lind- 
quist (1953) Type IIT analysis of variance are 
presented in Table 2, The AD scores for sen- 
sitizers were significantly higher than the AD 
scores for repressors (p < .01). The AD scores 
were not significantly affected by whether the 
person described was actually similar or dis- 
similar to the subject. The significant interac- 
tion between pre-post and similar or different 
persons described (p < .05) suggests that 
subjects’ perceptions more closely reflected 
actual differences and similarities after train- 
ing than before. The AD scores also decreased 
significantly from pre to posttraining across all 
groups ($ < .01). 

Thus sensitizers do seem to assume more 
dissimilarity between self and other in inter- 
personal perception than repressors, even af 
extensive acquaintance and special clinical 
training. 

Mean AD scores for all six groups in the 
second design (involving descriptions of two 
others) are presented in Table 3, Results of a 
Lindquist Type III analysis of variance are 
presented in Table 4. The only significant 
effect is that based on whether the two persons 
described actually were similar or dissimilar 
(p < .05). There was a trend, which did not 
reach an acceptable level of statistical sig- 
nificance (p = .10), for sensitizers to assume 
more ilarity between two others, 


Component Analysis 
To answer the fourth and fifth questions 


concerning component analysis, it was neces- 
sary to analyze the AD scores into the five 


components, (X, a, P, %, and teyr,/02), 


TABLE 3 
Mean AD Scones ror tHe Sxconn Deston 


Note. —W = 12 in each cell. 


Joux Atrrocert 


TABLE 4 


ÅNALYSIS OF VARIANCE OF AD Scores 
FOR THE SECOND Desicn 


Between 
itizers (B) 


Repressors-sensi 
Persons described (Similar-Dif- 
ferent) (C) 


noted by Cronbach (1958), for at least two 
independent dimensions of the ICL. Therefore 
Scores were calculated for all 46 repressors and 
46 sensitizers (from the first design) on the 
two moderately independent dimensions of the 
ICL—Dominance and Love (Leary, 1957)— 
for description of self and description of other 
separately. Data for description of others who 
Were repressors and others who were sensitizers 
were combined. Dominance and Love were 
used rather than octants because the octants 
are intercorrelated and use of the octant scores 
would thus violate one of Cronbach's (1958) 
specifications for component analysis. The 
original AD scores, however, were calculated 
using octant differences. Thus it was necessary 
to demonstrate that AD scores based on 
Dominance and Love would show differences 
between repressors and sensitizers parallel to 
the differences found using AD scores based on 
octants. Therefore AD scores for self-other on 
Dominance pretraining were calculated. The 
Scores for sensitizers were higher than those for 
repressors (t = 3,12, p < .01), suggesting that 
a component analysis of Dominance and, prë- 
sumably, Love scores would be likely to reveal 
contributions to the differences in AD scores 
using octants, 

Tables 5 and 6 present the results of the 
component analyses for pre and posttraining 
Dominance and Love scores. Before the train- 
ing repressors described themselves with less 
variability and with higher scores than sensi- 
tizers did on Dominance and, to a lesser extent, 


Descurrion or 
Descxtrrion or Orien SEPARATELY 


sensitizers | = 4.5, p < .0O1; for repressors t = 2.4, p < 05. 


on Love. On the other hand, Table 5 discloses 
no evidence that repressors and sensitizers 
differed significantly in their perceptions of 
others. Furthermore, the low correlations and, 
therefore, low regression components in Table 
5 suggest that neither in the repressor nor 
sensitizer groups did the description of others 
depend substantially on the description of self. 
Of the components of AD noted by Cronbach, 
regression of other description on self-descrip- 
tion is the only dyadic component and comes 
closest to reflecting assumed similarity as 
ordinarily conceptualized. The differences be- 
tween repressors and sensitizers in the correla- 
tions are in the direction consistent with the 
earlier AD analysis (in Tables 1 and 2) for Love 
before and after the training and for Domi- 
nance before the training. Thus there was a 
trend, although not statistically significant, for 
the repressors to assume more similarity than 
did sensitizers between self and others before 
the training, 

After the training (see Table 6) approxi- 
mately the same pattern was still apparent; 
higher and less variable scores 

Dominance. 
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S The distributions were split at the medians and a chi square 
test was significant at <.001 for Dominance but wot for Love 
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dence of a dyadic phenomenon—and no clear 
evidence that the groups tended to differ 
significantly in their perception of others. 
The component analyses raise questions as 
to whether the analysis of AD scores for de- 
scription of two others (see Tables 3 and 4) 
actually answered the original question con- 
cerning perceptions of difference between 
others. It has become apparent that AD scores 
cannot be interpreted meaningfully unless 
they are analyzed into components. Yet com- 
paring the mean and standard deviation com- 
ponents in this case would only indicate the 
adequacy of the selection of pairs of people to 
be described. Thus it seems that the closest 
one can come to comparing differences between 
descriptions of two others in the present design 
is to calculate the correlation component. The 
most uncontaminated measure of this com- 
ponent is covariance (2xy/n), which Bass and 
Fiedler (1959) used, but in this study the more 
familiar product-moment correlation coeffi- 
cient (Zxy/noxoy) is used because it makes 
comparison between groups simpler. The cor- 
relations between Dominance scores for de- 
cription of two others and Love scores for the 
same descriptions for all groups in the second 
design are presented in Table 7. Since only 2 
of 24 correlations were significant, these results 
must be viewed cautiously. Some trends are 
apparent, though. The correlations tended to 
be higher in the positive direction for repressors 
than midgroup subjects, while the correlations 


TABLE 7 
PRODUCT-MOMENT CORRELATIONS BETWEEN DE- 
SCRITIONS OF Two OTHERS ON DOMINANCE 


AND LovE 
Pretraining Posttraining 
Variable and Group Reson 
re people eaapeomle oiia, | Diter- 
to | from each p 
each other other 
Dominance 
Repressors .32 .39 |—.16 * .03 
Midgroup -10 | —.03 | .79***) — 54 
Sensitizers —.28 -04 |—.38 —.32 
Love 
Repressors 54 -30 | .70***| 47 
idgroup -50 +13 -37 10 
Sensitizers :26 | —.14 13 —.41 


Note.—N=12 for each correlation. 
s p < 01. 
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for sensitizers tended to be negative. The 
correlations tended to be higher when describ- 
ing two similar people than when describing 
two different people, higher before training, 
and higher for Love than for Dominance, 
Nevertheless, transforming all r’s into z scores 
and comparing 2’s disclosed only two statisti- 
cally significant comparisons; the correlation 
on Dominance, posttraining, describing two 
similar people for midgroup subjects was higher 
than that for repressors (p < .05) and sensi- 
tizers (p < .01). Thus these analyses did not 
provide clear support for the suggestion that 
sensitizers assume more dissimilarity between 
two others, but the trends in the results suggest _ 
some possible leads for future investigators, 


Discussion 


The present results clearly support Cron- 
bach’s (1958) conclusion that AS and AD 
scores can be greatly misleading. Whereas 
the AD analyses might have led to interpreta- 
tions regarding different interpersonal percep- 
tion processes for repressors and sensitizers, the 
component analyses demonstrated no percep- 
tual differences between the groups. Cronbach’s 
conclusion receives further corroboration from 
a study by Bassand Fiedler (1959) in which AD 
scores did not consistently relate to their co- 
variance components and self-concept differ- 
ences were found to be responsible for some 
of Fiedler’s earlier results using AS scores. Self- 
concept differences may also be responsible for 
the differences Gordon (1957, 1959) found be- 
tween repressors and sensitizers in interper+ 
sonal prediction and for some of the results 
in other studies using assumed similarity scores 
(e.g., Rosenbaum, 1959). It is noteworthy that 
Bronfenbrenner (1958) has reached a similar 
conclusion regarding the use of gross simi- 
larity scores in the measurement of identifica- 
tion. The potential usefulness of Cronbach’s 
component analysis as a substitute for use of 
AD scores seems demonstrated by the pres- 
ent study. Hopefully other methods such as 
multivariate r statistics and associated designs 
suggested by La Forge (1959) will also prove 
useful in studies of interpersonal perception. 

Since AD scores can be misleading, Cron- 
bach? concludes that statements of hypotheses 
in AD terms are valueless and tend to obscure 


? Personal communication. 


ANALYSIS OF ASSUMED DISSIMILARITY Scores 


dependable relationships. This conclusion can 
be further supported by noting the logical 
"difficulties in measuring a phenomenon such 
as assumed dissimilarity (or similarity). Such 
a phenomenon seems to imply measurement 
of the degree of agreement between two meas- 
ures, for which the intraclass correlation 
coefficient (r;) (Robinson, 1957) is a more 
appropriate statistic thanr. Evenr;, however, 
would need to be analyzed into components 
involving means, standard deviations, and 
covariance to allow precise interpretation 
and only if covariance were the primary com- 
ponent would use of the term assumed simi- 
larity be appropriate. This investigator is led 
to conclude that development and use of 
concepts at least as precise as our measuring 
techniques will allow would greatly accelerate 
meaningful research in interpersonal percep- 
tion. 

The present results demonstrate, more 
clearly than had been shown earlier with many 
of the same subjects (Altrocchi et al., 1960), 
that repressors have more positive self-con- 
cepts than sensitizers (in this case describing 
themselves as higher on Dominance and Love) 
and that they are a more homogeneous group 
in self-description. These results are not sur- 
prising in that there is growing evidence that 
subjects with negative self-concepts show 
greater instability of self-concept (cf. Al- 
trocchi, et al., 1960; Engel, 1959); and one 
would expect people who describe themselves 
negatively on one self-report test to do so on 
another. Many sensitizers evidently see them- 
selves as different from and less desirable than 
other people (cf. Table 5) and do not simply 
describe everyone negatively, but it is not 
clear whether sensitizers tend to describe 
others differently from the way repressors 
describe others. Considering the hetero- 
geneity of the sensitizer group, it might be 
useful for future investigators to define more 
homogenous sensitizer subgroups. 

The only results in this study which are not 
likely to have been directly affected by self- 
concept differences between repressors and 
sensitizers are those involving perceived dis- 
similarity between two others (see Tables 3, 
4, and 7). There was a slight tendency for 
sensitizers to perceive greater dissimilarity 
between two others as measured by correla- 
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tions between descriptions of two others. It 
might be profitable to explore this difference 
further, since it seems possible that Fiedler’s 
(1958) measure of “assumed similarity between 
opposites” (ASo) might be partly a function 
of such individual differences as those between 
repressors and sensitizers. Another reason for 
pursuing this problem is_that Holzman and 
Gardner (1959) have recently demonstrated 
that extreme repressors tend to be levelers on 
a neutral psychophysical task. It would be 
useful to know whether we can link defensive 

processes, such as repression or sensitization, 

not only to adaptive processes in perception 

of physical stimuli, as Holzman and Gardner 

have done, but also to modes of interpersonal 

perception. 

The present results are relevant for studies 
of changes in self-concept or perception of 
others as a function of special training. The 
clinical training of these nursing students did 
not result in increased differentiation (varia- 
bility) in perception of peers of the kind that 
Crow (1957) found in medical students’ 
perceptions of patients after training in 
accuracy of perception. The conditions under 
which clinical training results in changes in 
perceptions of self and others remain unclear. 
Finally, the present results suggested the 
need to take account, in future studies, of 
subjects’ possible tendencies to check fewer 
items true on the ICL in a second testing. 


SUMMARY 


From a population of 227 student nurses, 
MMPI criteria were used to select 56 re- 
pressors and 56 sensitizers. Each subject 
described herself and three of her peers on 
the Interpersonal Check List before and after 
intensive training in psychiatric nursing. 
Assumed dissimilarity (AD) scores of inde- 
pendent groups of repressors and sensitizers 
describing other repressors and sensitizers were 
calculated. The results, following two studies 
by Gordon, seemed to demonstrate that sensi- 
tizers assume more dissimilarity between self 
and other than repressors. Following sug- 
gestions by Cronbach, however, AD scores 
were analyzed into components, revealing that 
the differences in assumed dissimilarity between 
self and others were due primarily to stable 
differences in self-description and not to any 
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clear differences in perception of others nor to 
any substantial correlation between perception 
of others and perception of self (the dyadic 
component). The results demonstrate the 
usefulness of Cronbach’s suggestions and the 
necessity of analyzing gross AD scores into 
components in order to interpret them with 
precision. It is further concluded that use of 
such gross concepts as assumed dissimilarity 
tends to obscure relationships. 

Sensitizers were again shown to have more 
negative self-concepts than repressors and 
to be a more heterogeneous group, suggesting 
the need to examine possible sensitizer sub- 
groups. Other suggestions for further investi- 
gation stress the need to clarify, more than 
the present results did, whether sensitizers, 
more than repressors, tend to perceive dis- 
similarity between other people; whether there 
are, in fact, any differences between repressors 
and sensitizers in perception of others; and 
whether special clinical training does result 
in changes in perception of self or others. 
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HE first part of this study relates 
i different pairings of dominant and 

submissive males, interacting under 
various role assignments, with their joint 
achievement in a cooperative problem solving 
situation. The second part investigates the 
relationship of these different pairings to the 
perceived dominance of the self and the partner. 

The thesis-is advanced that different com- 
binations oe dominant and/or submissive 
individuals achieve more or less successfully 
according to the pair-combination as well as the 
conditions of assignment of dominant or 
submissive roles, and that it is possible to 
predict differential success among these permu- 
tations according to hypotheses derived from 
personality theory. 

The hypotheses are derived from Sullivan’s 
(1953) general assumption that a person’s 
modes of relating to others are functional in 
that they enable him to maintain anxiety at a 
minimum. Interpersonal situations that per- 
mit, or encourage, the use of a salient inter- 
personal technique, such as dominance or 
submission, give rise to less anxiety than 
situations that do not. The experiencing of 
anxiety disrupts cognitive functioning and 
leads to a less effective performance in a task. 
Hence, there should be greater achievement 
when persons are permitted or encouraged to 
assume habitual modes of relating. 

Tn spite of the explicitness and theoretical 
importance of this reciprocal aspect of inter- 
personal theory, few experimental studies have 
been concerned with it. Leary (1957), em- 
ploying dominance as one of his central vari- 
ables, posits a reciprocal theory of interaction. 
In Mann’s (1959) review of the studies of the 
relation of personality and performance in 


1 This paper is based on a dissertation submitted to 
the Department of Psychology, University of Cali- 
fornia, in partial fulfillment of the requirement of the 
PhD. The writer wishes to express his appreciation for 
their help to Read Tuddenham, Edwin Ghiselli, Har- 
rison Gough, Frederick Tyler, and Lyman Porter. 
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small groups, he did not report any studies 
considering the relationship between interact- 
ing individuals differing in dominance and the 
influence of this relation upon either the nature 
of group achievement or mutual perception. 
Tuma (1955) found that dissimilarity in domi- 
nance between therapist and client, regardless 
of the direction of the difference, correlated 
highly with rated improvement in a counseling 
situation. 

The following four hypotheses are based on 
a theoretical prediction of differences, among 
different pairings of male subjects under speci- 
fied conditions of role assignment, in achieve- 
ment in a cooperative task situation, Table 1 
describes the seven different groupings and the 
interaction and the role assignments, when 
such assignments were made. The predicted 
rank orders of achievement of the groups in- 
volved in each of the four hypotheses are 
presented in Table 2. 

Hypothesis 1 compares Groups A, B, and 
G. Following Sullivan (1953), the pairings in 
Group A perform the most productively, since 
each member of the pair has a salient inter- 
personal technique explicitly requested by the 
experimenter as well as “pulled for” by the 
behavior of the partner. The pairs of dominant 
and submissive subjects comprising Group B 
interact under somewhat less optimal condi- 
tions, since these pairs lack the congruent role 
assignments of the pairs in Group A. The pairs 
of dominant and submissive subjects in Group 
G interact with an assignment of roles con- 
trary to their preferred modes, which arouses 
anxiety and disrupts achievement. 

Hypothesis 2 compares Groups B, E, and F. 
Paired dominant subjects (Group E) do not 
interact under as optimal conditions as do the 
pairs in Group B, since each member of the 
paired dominant subjects interacts so as to in- 
duce submissive behavior in his partner, who is 
also a dominant subject. The paired subjects 
in Group E were predicted to perform more 
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successfully than the paired submissive sub- 
jects in Group F, since dominant subjects are 
hypothesized to behave in the pursuit of satis- 
faction rather than security, and thus to be 
more at ease with each other and themselves 
than the paired submissive subjects. Sullivan 
(1953) considered the power motive as one of 
the pursuits of satisfaction, rather than se- 
curity. Individuals lacking in dominance may 
have found it necessary to dissociate the ex- 
pression and/or awareness of this motive. 
Hypothesis 3 compares Groups C, D, E, and 
F. The prediction involves the differential 
achievement of two different pairings (two 
dominant subjects interacting and two sub- 
missive subjects interacting), each under two 
conditions (no role assignments and assign- 
ment of the dominant role to one subject and 
assignment of the submissive role to the part- 
ner). The predicted rank order of achievement, 
by groups, is C, D, E, and F, since it was hy- 
pothesized that pairings with role assignment 
perform more successfully than comparable 
pairings with no role assignment, and further, 
that the paired dominant subjects would per- 
form more successfully than paired submissive 
subjects under comparable conditions. For a 
dominant subject to relinquish a dominant role 
should produce less anxiety than for a sub- 
missive subject to assume a dominant role. 


TABLE 1 


PAIRINGS or DOMINANT AND SUBMISSIVE SUBJECTS 
AND CONDITIONS OF ROLE ASSIGNMENTS 


(10 pairs in each group) 

$ Description of Interaction Code 

A | Dominant subject assigned dominant role; | D:d-S:s 
submissive subject assigned submissive role. 

B | Dominant subject and a submissive subject | D:x-S:x 
interact with no role assignment. 

C | Onedominant subject assigned dominant role; | D:d-D:s 
other dominant subject assigned submissive 
role, 

D | One submissive subject assigned dominant | S:d-S:s 
role; other submissive subject assigned sub- 
missive role, 

E | Two dominant subjects interact with no role | D:x-D:x 
assignment. 

F | Two submissive subjects interact with no role | S:x-S:x 
assignment. 

G | Dominant subject assigned submissive role; | D:s-S:d 
submissive subject assigned dominant role. 
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TABLE 2 


HYPOTHESES OF RANK ORDER OF 
ACHIEVEMENT OF GROUPS 


Hypothesis Predicted Rank Order of Achievement 


Pode 


In Sullivan’s (1953) terms, it is less disruptive 
to modify a mode of attaining satisfaction 
(dominant role) than it is to modify a security 
operation (submissive role). 


Hypothesis 4 is not independent of the other — 


three hypotheses, but comparestachievement 
among all seven groups. The predicted rank 
order corresponds to the alphabetical ordering 
of the groups, i.e., Group A is predicted to be 
the most successful and Group G the least suc- 
cessful. Hypotheses 1 and 3 are independent, 
since their predictions do not involve any 
common groups. Hypothesis 2 has one group 
(Group B) in common with Hypothesis 1 and 
it has two groups in common with Hypothesis 
3 (Groups E and F). 


METHOD 


The subjects were all male volunteers enrolled in 
either Military Science or Air Science at the University 
of California, Berkeley. Since enrollment in the uni- 
versity armed forces training program is compulsory, 
this group afforded a broad sampling of university un- 
dergraduate men. The total number of subjects who 
volunteered was 748. Participation in the experiment 
was credited as an excuse from one military drill period. 

Before requesting volunteers, the students were ad- 
ministered a short form of Gough’s California Psycho- 
logical Inventory (CPI), which contained the Domi- 
nance (Do), Social Responsibility, and Self-Control 
scales (Gough, 1957). A vocabulary test (Thorndike & 
Gallup, 1944) was administered at the conclusion of the 
personality test. The subjects for the experiment were 
selected from the extremes of the distribution of the Do 
scores, The mean of this distribution was 28.5 (SD, 6.8) 
which is quite similar to Gough’s (1957) standardiza- 
tion group mean of 28.5 (SD, 6.0). A subject scoring 
or above was defined as dominant; a subject scoring 
or below as submissive. The raw scores of 34 and 
correspond to Gough’s T scores of 110 and 92, baie 
tively, on a scale with a mean of 100 and a standar 
deviation of 10. 


Experimental Situation 
The interaction of the paired subjects involved the 
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joint operation of two model railroad trains. The ap- 
paratus was that employed by Ghiselli and Lodahl 
(1958). The main track was 6 feet in diameter, and had 
two bypass sidings by wbich the two tfains could leave 
and enter the main track. The two subjects sat on high 
stools to operate their respective control panels, which 
were 3 feet high, on the same level with the tracks. The 
left half of each control panel contained switches that 
supplied power to different numbered sections of the 
track. The right half of each control panel contained 
several controls: direction and speed of the trains, 
switches to and from the sidings, and a master control 
switch. The subjects were thus able to control the speed, 
direction, and route of each train. Carelessness by either 
subject in operation of his switches could obstruct the 
efforts of the partner in operating his train. 


Procedure 


Before the instructions for operation of the trains, 
the subjects were introduced, both as a matter of social 
convention, and also to determine whether the subjects 
were acquainted with each other. Subjects previously 
acquainted were not included in the results. Subjects 
from Military Science and Air Science were not mixed 
in the pairings. The instructions for operation of the two 
trains were given during the first 20 minutes. Both 
trains were operated by the experimenter at this time, 
and both control panels were equally employed for 
demonstration. The subjects were permitted to ask 
questions during the demonstration. They were allowed 
3 minutes to discuss the operation of the trains before 
the first trial, 

The role assignments, when given, were given just 
prior to the first trial. In the following example, “X” is 
assigned the dominant role and “Y” is assigned the sub- 
missive role. 

Mr. X, your train will be the passenger train, and 

Mr. Y, your train will be the freight. Mr. X, you are 

the dispatcher and you will arrange and order the 

solutions to the problem, and Mr. Y, your task is to 
carry out, on your board, Mr. X’s directions. Re- 
member, Mr. X, you are to plan and organize the 
' Solutions, while both of you are to carry out the 
operations of the trains on your respective boards. 
Mr. Y, your function is to carry out as well as you 
can the directions of Mr. X. You are permitted to 
make suggestions, but the final decision rests with 

Mr. X. Do you have any questions? 


Task 


At the start of the first trial, the passenger train was 
on one siding, pointed so as to move counterclockwise 
around the track. The freight train was on the main 
track, pointed so as to move clockwise around the track. 
‘The prescribed route for the passenger train was differ- 
ent from the route for the freight, so that the subjects 
frequently operated the switches to the sidings. Each 
Subject was responsible for the operation of the as- 
Signed train. The subjects’ task was to complete as 
many mutually complete trips as possible around the 
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track in a 3-minute trial period. There were six 3- 
minute trials, with a 1-minute rest period between each 
trial. The trains were not run simultaneously. The sub- 
jects were free to communicate with each other during 
the entire procedure, but could not ask the experi- 
menter for directions once the first trial had begun. 

The achievement score for a given trial was the 
number of mulually complete trips around the track, 
€.g., if one subject made 8 trips and his partner made 5, 
the achievement score for that trial was 10. The maxi- 
mum possible score for a given trial was 32. After the 
last trial, each subject checked Leary’s (1957) Inter- 
personal Check List (ICL) twice: as he saw himself and 
as he saw his partner, This 128-item check list can be 
scored for dominance. The entire procedure lasted one 
hour. 


RESULTS 


Achievement 


Groups A through G were first compared on 
age, the Responsibility and Self-Control scales 
of the CPI, and verbal intelligence. No signifi- 
cant differences between any of the seven 
groups on any of these variables was found. 
The means and standard deviations of achieve- 
ment scores for Groups A through G on Trials 
1 through 6 and the sums of trials are presented 
in Table 3. 

Table 4 contains the results of testing Hy- 


TABLE 3 


MEANS AND STANDARD DEVIATIONS OF ACHIEVEMENT 
FOR TRIALS 1 THROUGH 6 AND SUMS OF TRIALS 


Trials 
Group Sums 
sa ae EA Pee ee Rn dV 
A m | 18.9} 23.9 | 26.8 | 29.8 | 30.3 | 30.8 | 160.4 
sp | 7.0] 5.3| 5.4] 21] 28] 2.0] 19.0 
B M_ | 14.1} 19.8 | 25.7 | 26.2 | 26.7 | 28.7 | 141.2 
sp | 67| 5.5| 4.0] 6.0] 55} 4.4] 20.1 
C M {15.4 | 23.2} 27.4 | 27.7 | 28.8 | 30.8 | 153.3 
sp | 75] 5.6] 4.3] 65] 54] 1.6] 18.7 
D m | 13.6} 20.4| 22.5 | 26.4 | 29.2 | 30.8 | 142.9 
sp | 7.7} 7.0] 81] 45] 3.0] 2.0] 22.8 
E m | 12.0} 20.3 | 20.7 | 26.9 | 26.6 | 29.2 | 142.0 
sp | 87| 68] 3.5] 42] 60] 3.3] 23.0 
F xo | 10.0] 18.1 | 21.2 | 24.1 | 27.3 | 29.5 | 130.2 
sp | 69} 67| 4.0] 7.8] 44] 25] 13.8 
G M 7.3 | 14.8 | 15.9 | 22.4 | 27.6 | 28.4 | 116.4 
SD | 5.7| 4.9] 7.0] 5.1] 24] 4.9] 13.5 
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pothesis 1. The analysis of variance employed 
was that recommended by Edwards (1950) 
when there are repeated measurements of the 
same subjects. Before any of the analyses of 
variance were performed, Bartlett’s test of 
homogeneity was performed, which proved 
to be insignificant in all cases. 

There is a highly significant difference in 
achievement between Groups A, B, and G. 
Inspection of Table 3 shows that Group A was 
more productive than Group B, which in turn 
was more productive than Group G. The sig- 
nificant mean square between trials indicates a 
marked increase in performance across trials 
for all three groups. The significant interaction 
of trials by groups reflects the initial higher 
achievement and greater rapid improvement 
in performance of Groups A and B over Group 
G. The analysis of variance confirms Hy- 
pothesis 1. 

The results of another simple analysis of 
variance to test Hypothesis 2 are presented in 
Table 5. The hypothesis was not confirmed, 
since the mean square between Groups B, E, 
and F was not significant, and the empirical 
ordering by achievement was E, B, then F, 
instead of B, E, then F as predicted. Thus 
paired dominant subjects (Group E) outper- 
formed paired dominant and submissive sub- 
jects (Group B), who in turn outperformed 
paired submissive subjects (Group F). The 
mean square between trials was again highly 
significant, a finding attributable to the in- 
creasing achievement with successive trials for 
all three groups. 

A complex analysis of variance was per- 


TABLE 4 
ANALYSIS OF VARIANCE FOR HYPOTHESIS 1 
Source df MS F 
Between Groups A, B, G 2 811.02 | 13.91*** 
Between Trials 1 5 | 1,081.78 | 47.85*** 
through 6 
Trials X Groups 10 56.56 | 2.50** 
Between subjectsinsame | 27 58.31 
group 
Pooled subjects X Trials | 135 22.61 
Total 179 
. p= 01. 


“> = 001, 
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TABLE 5 
ANALYSIS OF VARIANCE FOR HYPOTHESIS 2 
Source> df MS F 
Between Groups B, E, F 2 72.47 | 1.04 
Between Trials 1 through S | 1,175.57 |. 41,40%% 
6 

Trials X Groups 10 19.96 | .70 
Between subjectsin same | 27 69.72 


group 
Pooled subjects X Trials| 135 
Total 179 


e. p = 001. 


formed to test Hypothesis 3, since the two 
variables of pairings and role assignments were 
each varied in two ways. The results are pre- 
sented in Table 6. The empirical rank order of 
the four groups was C, D, E, then F, which 
corresponds to the predicted rank order. The 
mean squares between pairings and between 
role assingments were both significant. The 
results confirm Hypothesis 3. The mean squares 
between trials was again significant. 

Hypothesis 4 predicted the rank order in 
achievement of all seven groups. The predicted 
rank order was alphabetical, A through G, 
while the empirical rank order was A, C, D, 
E, B, F, and G. The rank order correlation 
between predicted and empirical rank orders 
was .78, significant at the .05 level (Table P 
in Siegel, 1956). This empirical rank order is 
based on the sums of trials. The rank order 
correlations, on the other hand, between pre- 
dicted and empirical rank order calculated 
separately for Trials 1 through 6 were: 96 
(p = .01), .64, .61, .75 (p = .05), 39, and 
.55, respectively. 

A frequency count of the subjects’ spontane- 
ous requests for the maximum possible score 
shows that 59 of the total N of 70 dominant 
subjects made such a request, while 12 of ‘the 
total N of 70 submissive subjects made this 
request (chi square is 55.6, p < 001). Also, 
15 additional dominant subjects operated the 
trains without a partner, as did 15 additional 
submissive subjects. An F test between the 
achievement of these two groups was sight 
cant at the .05 level, with the dominant sub- 
jects achieving more than the submissive 519- 
jects. In Mann’s (1959) review of the relation 
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TABLE 6 
ANALYSIS OF VARIANCE FOR: HYPOTHESIS 3 
Source af us F 
Between pairings 1 205.35 | 5.68* 
(Groups C, D vs. E, F) 
Between roles (Groups 1 240.00 | 6.50* 
C, E vs. D, F) 
Between Trials 1 through 5 | 1,570.95 | 43.48*** 
6 
Trials X Pairings 5 44.95 | 1.24 
Trials X Roles 5 8.88 .25 
Pairs X Roles 1 81 02 
Pairs X Roles X Trials 5 3.66 10 
Within Groups 216 36.13 |- 
Total 239 
* p= 05. 
se > = 001. 


of personality and performance in small groups, 
dominance is found to be positively related to 
the number of task contributions. 


Perception 


Since each subject checked the ICL as he 
perceived himself and as he perceived his part- 
ner, it was possible to determine the subjects’ 
own definition of the situation in terms of 
dominance of self and partner. The ICL domi- 
nance scores are presented in Table 7. 

The intragroup differences in perceived 
dominance are considered first. The test em- 
ployed was the Wilcoxon matched-pairs signed- 
tanks test (Siegel, 1956). Differences that at- 
tained statistical significance are presented in 
Table 8. Inspection of Table 7 shows that for 
every group which includes a dominant sub- 
ject, the dominant subjects’ self-check mean 
is always higher than the check of his partner 
mean. This difference attains statistical signifi- 
cance within Groups A and B. Thus in terms 
of a comparison of group means, the submissive 
subjects consistently perceived themselves as 
less dominant than they perceived their part- 
ners and the dominant subjects as more domi- 
Nant, regardless of the roles assigned and re- 
8ardless of whether the partner was dominant 
subject or a submissive subject. 

In those groups where a dominant subject 
and a submissive subject interact (Groups A, 
B, and G), the dominant subjects’ mean check 
of their partners was always higher than their 
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TABLE 7 


INTERPERSONAL CHECK List DOMIANCE SCALE MEANS 
AND STANDARD DEVIATIONS or PAIR COMBINATIONS 
FOR CHECK OF SELF AND CHECK OF PARTNER 


(N = 10 for each pair combination) 


Self Partner 

Gi Pair 
TOUP Combination y F X oh 
A D:d 64.4 3.8 55.8 4.3 
S:s 49.6 9.6 61.2 6.8 
B Dix 62.7 2.9 60.1 6.7 
Six 49.0 8.5 58.3 7.7 
Cc D:d 62.9) 351 60.8 4.7 
á Dis 61.7 5.3 61.5 5.9 
D S:d 53.1 8.2 54.3 8.5 
Sis 51.3 T 54.1 8.7 
E D:x, 59.0. 3.8 58.9 8.6 
D:x: 62.6 3.7 57.7 3.8 
F Six 49.8 9.9 57.4 5.9 
Six 49.2 8.1 53.9 8.8 
G D:s 58.0 3.9 56.7 3.5 
S:d 52.1 6.9 57.5 7.0 


partners’ mean self-checks. This difference at- 
tains statistical significance in Group B. In 
Groups A, B, and G, the submissive subjects’ 
mean self-check was consistently lower than 
the dominant subjects’ mean self-check. This 
difference is statistically significant in Groups 
A and B. 

Several intergroup comparisons were made, 
employing, for the most part, the data as pre- 
sented in Table 7. The standard deviations of 
the submissive subjects’ self-checks are sig- 
nificantly greater than the standard deviations 
of the dominant subjects’ self-checks (p < 
.01), employing the Wilcoxon matched-pairs 
signed-ranks test (Siegel, 1956). Thus in their 
self-checks, dominant subjects are a more 
homogeneous group than are the submissive 
subjects. The standard deviations of the domi- 
nant subjects’ self-checks are significantly 
greater (p < .05) than the standard deviations 
of the dominant subjects’ checks of their part- 
ners. Dominant subjects are less uniform in 
their perceptions of their partners than they 
are of themselves. This relationship is not true 
of the submissive subjects. 

A comparison of the self-check of the domi- 
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TABLE 8 


STATISTICALLY SIGNIFICANT INTRAGROUP DIFFERENCES 
IN INTERPERSONAL CHECK List DOMINANCE 
(N = 10 for each pair combination) 


Pair Combination Pair Combination 
Group with higher with lower 
Dominance® Dominance® 
A D:d (s) Sis (s) .006 
B D:x (s) S:x (s) -006 
A D:d (s) D:d (p) -006 
E D:m (s) D:x: (p) .05 
A ; P s (s, .01 
B S:x (p) S:x (s) .02 
B D:x (p) S:x (s) 01 


® (s) denotes self-check; (p) denotes check of partner. 
* 


nant subjects interacting with submissive sub- 
jects under three different conditions of role 
assignment (Groups A, B, and G) shows that 
the dominant subjects assigned a dominant 
role (Group A) perceived themselves as more 
dominant than the dominant subjects assigned 
no role (Group B), who in turn perceived them- 
selves as more dominant than the dominant 
subjects assigned a submissive role (Group 
G). These three groups are significantly dif- 
ferent (p < .02), employing the Friedman two- 
way analysis of variance (Siegel, 1956). A 
similar (statistically insignificant) rank order 
holds for the submissive subjects’ check of their 
respective dominant partners in Groups A, B, 
and G, i.e., the submissive subjects in Group 
A rated their own dominant partners as more 
dominant than did the submissive subjects in 
Group B rate their own partners. The sub- 
missive subjects in Group B in turn rated their 
dominant partners as more dominant than the 
submissive subjects in Group G rated their 
own dominant partners. 


Discussion 


The general pattern of the achievement of 
the seven groups tends to validate the hypoth- 
eses, both in statistical significance of the dif- 
ferences between groups and in the direction 
and ranking of the predicted differences. The 
failure of the results to support Hypothesis 2 
is due largely to the unpredicted high achieve- 
ment of Group E. Group C also achieved higher 
scores than predicted in Hypothesis 4. Both 
of these groups have paired dominant subjects. 
The greater achievement of these two groups 
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is in part due to the higher aspiration level of — 
the dominant subjects, as previously noted, 

With the exception of Group G, the assign- 
ment of dominant and submissive roles to any — 
of the pairings resulted in an increase in 
achievement. The greatest absolute increase in 
achievement as a result of assignment of roles 
was between the dominant subject with a sub- 
missive subject pairings (Group A compared 
with Group B). The assignment of roles to 
paired dominant subjects or paired submissive 
subjects resulted in virtually the same absolute 
increase in achievement, i.e., the difference 
between Group C and Group E is similar to 
the difference between Group D and Group F. 
The assignment of roles gave a structure within 
which the subjects were able to function more 
effectively than the comparable pairings with- 
out role assignment. 

There was a rapid rise in achievement across 
trials for all groups, so that by Trial 6 all groups 
were performing at virtually the same level. 
This similarity of level of performance by Trial 
6 is in part an artifact of the task, since the top 
possible score for a trial was 32. Group G made 
the greatest absolute gain from Trial 1 through 
Trial 6. However, this is due in part to the low 
initial performance on Trial 1. If there had 
been, e.g., only three trials instead of six, the 
differences in achievement would have been 
greater and in the same direction. The major 
differences in achievement between groups ap- 
peared in the initial trials. The effect of the 
relative dominance of the partners as well as 
the assignment of roles is greatest here. 

The ICL results indicate that the subjects’ 
definition of the situation in terms of domi- 
nance closely reflected both the selection of the 
subjects high and low in personal dominance ~ 
as well as the assignment of dominant and sub- 
missive roles. The degree of dominance as; 
cribed to the partner, relative to the subjects 
own dominance, is a function of the subjects 
personal dominance. These findings are in ac- 
cord with those of Leary (1957) and Nabo 
(1953). The degree of dominance ascribed to the 
self and the partner varied in the direction 0 
the assigned role. ot 

These relationships between the subjects 
own dominance and the degree of dominance 
he ascribes to others are in contrast to a study 
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DOMINANCE IN COOPERATIVE PROBLEM SOLVING 


(Altrocchi, 1959) employing the same pool of 
subjects as the present study, where the domi- 
nant subjects did not ascribe less dominance to 
others than the submissive subjects, One criti- 
cal difference between these studies is that 
Altrocchi’s subjects observed movies of social 
objects before rating them, whereas the sub- 
jects in the present study interacted before 
rating each other, thus both communicating 
as well as evoking cues (Bruner & Tagiuri, 
1954) of dominance or submissiveness. 

Both the dominant and submissive subjects 
attributed considerable dominance to their 
partners, regardless of the partner’s personal 
dominance. This finding may be attributed in 
part to the cooperative nature of the task as 
well as the fact that dominance is an approved 
trait for males, so that attributing dominance 
to another is a way of indicating social desira- 
bility (Edwards, 1957). 

Generalizations from the achievement and 
perceptual findings are restricted in that the 
subjects were males chosen from a rather re- 
stricted socioeconomic-educational stratum of 
society as well as from the extremes of the dis- 
tribution of dominance. However, the task is 
representative (Brunswik, 1947) of many life 
experiences, where two individuals work to- 
gether in the pursuit of some objectively 
mutual goal under circumstances requiring a 
good deal of communication and cooperation. 


SUMMARY 


This study related different pairings of 
dominant and submissive males, who inter- 
acted under various role assignment conditions, 
to their joint achievement in a cooperative 
problem solving situation. Predictions of dif- 
ferential achievement were derived from the 
thesis that the productivity of a pair depends 
upon the degree to which conditions permit 
each member to utilize his habitual patterns 
of interpersonal behavior. 

A personality inventory was administered to 
748 male university undergraduates. Volun- 
teer subjects scoring very high or very low on 
a dominance scale were selected for participa- 
tion in the cooperative task situation. The task 
was the operation of two model railroad trains 
for a series of six 3-minute trials. All groups 
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were homogeneous with respect to age, and 
Score on social responsibility, self-control, and 
vocabulary tests. 

Of four predictions concerning the relative 
achievement of the seven interaction groups, 
three were confirmed. The most productive 
group was composed of pairs in which the 
dominant subject was assigned the dominant 
role and the submissive partner the submissive 
role. The least productive group was composed 
of these pairings with the roles reversed. Paired 
dominant subjects were more successful than 
paired submissive subjects, and both of these 
Pairings achieved more when assigned roles, 
All groups showed significant increases in per- 
formance across trials. 

The degree of dominance attributed to others 
was a function of the subjects’ personal domi- 
nance. Dominant subjects perceived their 
partners as less dominant than themselves and 
submissive subjects perceived their partners as 
more dominant than themselves. The degree 
of dominance ascribed to the self and the 
partner varied in the direction of the assigned 
role. 

It was concluded that congruence of role and 
habitual pattern within the subject and com- 
plementarity of patterns as between subjects 
were major determining variables in coopera- 
tive achievement. Sullivan’s (1953) personality 
theory was employed in theoretical considera- 
tion of these results. 
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DELAY OF GRATIFICATION, 
AND ACQUIESCENCE 
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NEED FOR ACHIEVEMENT, 
IN ANOTHER CULTURE 
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Rosine between overt behavior 


sought foundations in personality research. 
This paper explores the relationship between 
a person's choice preferences for immediate, 
smaller (ImR) as opposed to delayed, larger 
(DelR) reinforcement and his motive system. 
It thus secks to extend our knowledge of 
preference for delayed gratification, some 
antecedents and correlates of which have 
already been investigated (Mischel, 1958, 1959, 
1961), Here choice behavior with respect to 
preference for ImR or DelR is related to the 
need for achievement (n Achievement) as 
scored from fantasy material in the manner 
described by McClelland and his associates 
=” Atkinson, Clark, & Lowell, 
‘The central aim is not primarily to elaborate 
the network of correlates of n Achievement, 
but rather, to clarify some of the variables 
relevant to preferences for ImR or DelR 
relating such choices to a motive that 
logically relevant and whose measurement and 
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mentally with respect to in 
of gratification (Mischel, 1999), 

Rewarch on the need F ae ee 
quires no new summary (eg, Atkinson, N 
It is consistent with past theorising te 


DelR as to ImR. The esence of the 
definition of n Achievement is “competition 
with a of excellence” 


ability 1 delay and postpone the relatively 
trivial bot immediately available for the sake 
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of later but more important outcomes. Achieve- 
ment fantasies may be thought of in part as 
reflecting as well as sustaining and mediating 
the individual’s strivings for future rewards 
and attainments of excellence. Indeed, im- 
bedded in the operations for scoring n Achieve- 
ment is the criterion of “long-term involve- 
ment.’ Clearly there can be little realization 
(and thus little maintenance) of the motive to 
compete with a standard of excellence unless 
the person is able to delay immediate but 
smaller gratifications and to choose instead 
larger future rewards and goals. 

A direct and positive relationship between 
preference for DelR and n Achievement, then, 
is to be expected for the following reasons. It 
has been argued (Metzner, 1960) that one of 
the crucial conditions that facilitates the 
development of the ability to delay gratifica- 
tion is the acquired reward value of working 
itself. Learning to delay starts when behaviors 
based only on the pleasure principle are either 
frustrated or punished. Then, fantasy becomes 
an acquired primary process substitute and 
work a secondary process detour, both of 
which are rewarded by eventual (larger) grati- 
fications, thus ultimately becoming rewarding 
activities in their own right. Liking to work for 
its own sake is generally assumed to be a 
basic ingredient of the high n Achievement 
pattern. Presumably, persons high in n 
Achievement have learned to like work, and 
they have learned this in part as a response to 
demands to forego immediate gratification in 
favor of more long-term goals. That is, they 
have learned to tolerate waiting periods and 
in the course of such socialization have built 
up a general readiness to delay when it is 
demanded by conditions or people. Conse- 
quently, when persons high in n Achievement 
are given choices between immediate, smaller 
rewards and delayed, larger ones, the balance 
between the negative valence of waiting and 
the positive valence of the delayed, larger 


2 Since “long-term involvement” is a criterion used 
in scoring for the presence of Achievement Imagery, 
and may have direct overlap with the ability to delay 
reinforcement, it is important to note that on only 
seven of the protocols did this criterion serve as the 
basis for Achievement Imagery scoring, and three of 
these protocols were given by subjects not consistently 
preferring delayed, larger reinforcement on the choice 
measures. 
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reward is weighted in favor of the latter be- 
cause the former is relatively minor for them, 
This is in contrast to the state of affairs with — 
persons who have not learned work as a 
tesponse to the delay situation, or who have 
not acquired intrinsic interest in achievement 
(ie., persons low in n Achievement). The 
anticipated positive relationship between 
preference for DelR and n Achievement does 
not depend on an absolute preference by ` 
individuals high in n Achievement for the 
delayed alternative, but, rather, applies 
equally to relatively less nonavoidance of the — 
delayed alternative on their part. 

In addition to the central aim of testing the 
relationship between the need for achievement 
and patterns of preference for reinforcement 
with respect to delay of gratification, the 
paper also presents data relevant to the 
measurement and correlates of n Achievement 
itself, and to further correlates of preference 
for ImR or DelR. Especially for use in cross- 
cultural research, in which verbalizations in 
response to Thematic Apperception Test 
(TAT) pictures appropriate for the usual 
scoring of n Achievement often are difficult 
or impossible to obtain, alternative n Achieve- 
ment measures are much needed. Graphic 
materials (“doodles”) scored for n Achieve- 
ment (Aronson, 1958) have been used with — 
young children. The present study offers 
another alternative to measuring n Achieve- 
ment via the TAT, far simpler than the Test 
of Insight (French, 1958), and possibly ap- 
propriate for use in many cultures. 

Further, an attempt was made to use @ 
measure—suitable for research in another 
culture—of simple acquiescence or “agreeing 
tendency.” Much recent research with United s 
States samples (e.g., Bass, 1956; Messick & 
Jackson, 1957) has focused on the response set 
to agree, and an intensive investigation of the 
problem (Couch & Keniston, 1960) has related 
“yeasaying” as opposed to “naysaying” to two 
personality syndromes. These syndromes ate 
conceptually germane to the present distinc- 
tion concerning preference for ImR-Delts. 
“Yeasayers were shown to be individuals with 
weak ego controls, who accept impulses without 
reservation, and... easily respond to stint 
exerted on them. The naysayer inhibits ar 
suppresses his impulses...” (p. 173)- The 
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former “can freely indulge in impulse gratifica- 
tion” whereas “the egos of naysayers take over 
the controlling functions of the parents, and the 
suppression of impulses is subsequently self- 
maintained and self-rewarded” (p. 173). A 
cluster of scales including impulsivity and 
dependency characterized the positive end of 
the agreeing tendency; the disagreeing tend- 
ency was defined by scales including ego 
strength, responsibility, and trust. Further, in 
interviews, yeasayers (in contrast to naysayers) 
described themselves as “... unreflective, 
quick to act, easily influenced, and unable to 
tolerate delays in gratification” (p. 170). 

Clearly, these two personality syndromes 
are directly relevant to preference for ImR- 
DelR. Indeed, I have reasoned previously 
(Mischel, 1958), with some direct experimental 
support (Mahrer, 1956), that “trust,” or the 
expectancy that the promised delayed reward 
is forthcoming, is a basic factor in preference 
for DelR, and empirically have related prefer- 
ence for DelR to social responsibility (Mischel, 
1961). It is to be expected that DelR preference 
should be related to naysaying and ImR 
preference to yeasaying and to the personality 
syndromes associated with each. Similarly, 
in the realm of n Achievement research, 
numerous attempts, mostly with mixed or 
dubious success (e.g., Samelson, 1958) have 
been made to relate n Achievement inversely to 
conforming behavior. Acquiescence may be 
thought of as a special case of conforming in 
an ambiguous situation, and again in the 
Present research the often expected but rarely 
attained relationship between n Achievement 
and nonacquiescence was anticipated. 

Lastly, data were also collected on social 
responsibility, as measured by an inde- 
pendently validated scale (Harris, 1957), 
substantially correlated with other measures of 
adjustment and maturity. Replication was thus 
Sought for a previously established positive 
telationship between measures of preference 
for DelR (as opposed to ImR) and social 
Tesponsibility (Mischel, 1961) and to de- 
termine the extent of interrelationship be- 
tween social responsibility and the other 
variables measured in this study. 

The over-all methodological aim is to aid 
the development of relatively simple but 
quantifiable techniques for testing psychologi- 
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cal hypotheses in cultures much different 
from our own in which measures developed in 
the United States are inappropriate or ex- 
cessively cumbersome. Such techniques are 
needed if the generality of psychological 
principles established on United States 
samples is to be extended to grossly different 
populations. 
METHOD 

Subjects 

A total of 112 Trinidadian Negro children (68 boys 
and 44 girls), all in the age group 11-14, with a modal 
age of 13, were tested in a government school located 
near the capital city. The school contained children 


from the urban environs, primarily from the lower 
middle and lower socioeconomic strata of the society. 


Measures of Preference for ImR or DelR 


Three preference measures (a behavioral choice and 
two questionnaire items) were used to elicit preference 
for ImR or DelR. These were selected on the basis of 
earlier research with comparable Trinidadian samples, 
The general procedures used in the pretesting and 
selection have been described in detail (Mischel, 1959, 
1961). Briefly, the behavioral measure consisted of a 
choice between ImR or DelR in the form of a small 
candy bar (costing approximately 10 cents) available 
immediately or a much larger candy bar (costing ap- 
proximately 25 cents) for which the subject must wait 
one week,? The reward choice was administered as the 
last item in the total procedure and was structured as 
a way of “thanking” the subjects for their cooperation, 
The two verbal items, one inserted near the beginning, 
the other near the middle of the procedure, were: 

1. I would rather get ten dollars right now than 
have to wait a whole month and get thirty dollars then, 

2. I would rather wait to get a much larger gift 
much later rather than get a smaller one now. 

Previous research (Mischel, 1961) has shown that 
the subject’s responses over three such measures are 
significantly interrelated and that the three measures 
in combination tend to provide a more useful classifica- 
tion of subjects with respect to ImR or DelR prefer- 
ences than does a single measure, Consequently, follow- 
ing the 1961 procedures, subjects were divided for 
purposes of data analysis into “Consistent ImR” (ImR 
preference over the three measures), “Consistent DelR” 
(consistent DelR preferences over the three measures), 
and “Inconsistent R” (one ImR and two DelR re- 
sponses, or the reverse), Although the predictions center 
on differences between the Consistent ImR and Con- 
sistent DelR groups, results are also presented in terms 
of the number of DelR choices out of the total three 


3The size of the immediately available reinforce- 
ment used here was deliberately greater than that used 
in past work (Mischel, 1961) in order to increase the 
probability of a more equal dichotomy in the immediate, 
smaller versus delayed, larger choice distribution. 
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possible made by the subject, i.e., 3 DelR, 2 DelR, 1 
DelR, or 0 DelR. 


Measures of n Achievement 


Two measures of n Achievement were used. The 
first was the standard group-administered procedure 
developed by McClelland and his associates (1953), 
using five TAT-type cards.‘ The second measure was 
an innovation, in the form of an open-ended aspiration 
question. The latter was introduced in the context of 
“let’s pretend there is a magic man”: “Now let's pre- 
tend that the magic man who came along could change 
you into anything that you wanted to be, what would 
you want to be?” Subjects were instructed to answer 
in one word. 


Measure of Acquiescence 


A simple measure of acquiescence or conformity 
suitable for large scale research in other cultures— 
simple to answer, simple and rapid in administration, 
and meaningful in other cultural contexts—was sought. 
The measure adopted consisted of the following in- 
structions, administered orally by the experimenter: 

I have something in mind. I am closing my eyes 
and concentrating and thinking of it ... it is some- 
thing that you might agree with or that you might 
disagree with ... you might disagree with what I’m 
thinking of or you might agree with it ... if you 
agree put down a Y for Yes; if you disagree put 
down a N for No. (This was repeated twice, reversing 
the position of Y and N in the sequence of instruc- 
tions.) Now I’m concentrating on it (closing eyes); 
go ahead. 


Measure of Social Responsibility 


Social responsibility was determined by response to 
a quite carefully constructed and independently vali- 
dated Social Responsibility Scale (Harris, 1957). The 
Social Responsibility Scale (SRS) used has been found 
to correlate substantially with other measures of per- 
sonal and social adjustment and was designed to “dis- 
criminate children who have, with their peers, a reputa- 
tion for responsibility as contrasted with children who 
have little reputation for responsibility” (p. 326). All 
items from the 50-item SRS (Harris, 1957, called “Social 
Attitudes Scale,” p. 324, Table 1), with the exception 
of Item 50, were administered in the modified form 
described earlier (Mischel, 1961). 


Procedure 


The experimenter was introduced as an American 
from a college in the United States interested in gather- 


4The five TAT-type pictures used were two men 
(“inventors”) in a shop working at a machine; a boy 
in a checked shirt at a desk, an open book in front of 
him; a man at his desk in a dark office; a man looking 
at photos; a family at dinner. They were administered 
in printed booklets with the standard four questions 
printed for each picture on a separate page. The only 
modification made in the reproduction of the cards was 
a darkening of the pigmentation of the people in the 
pictures. 
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ing information on the children in the various schools 
of the island. To help with this the subjects were asked 
to answer a number of questions. The details of the 
instructions are similar to those previously reported 
(Mischel, 1961). Subjects were seated sufficiently far 
apart from each other to give reasonable assurance 
that their choices were made independently. All testing 
was done in group settings in large classrooms, with two 
sessions. All measures except the TAT and the candy 
choice were given in the first session, 


RESULTS AND DISCUSSION 


Examination of both sex and age in relation 
to preference for DelR or ImR revealed no 
differences approaching statistical significance 
and consequently the male and female data 
and all age groups were treated collectively in 
subsequent analyses. 


n Achievement and Preference for Delayed 
Reinforcement 

Inspection of the responses to the five 
TAT-type cards indicated that the protocols 
were, on the whole, impoverished and short, 
with relatively little elaboration, a finding not 
unexpected for this culture. Simplified scoring 
for n Achievement was therefore decided upon. 
Protocols were scored only for presence Or 
absence of achievement imagery (AI or UI, 
scored +1 or —1), or doubtful achievement 
imagery (TI, scored 0), in the manner de- 
scribed by McClelland et al. (1953), without 
continuing additional scoring of AI cards, 
This simplified procedure avoided the neces- 
sity for making subtler and potentially less 
reliable scoring decisions on relatively frag- 
mentary responses that would be especially 
difficult to interpret, as normative scoring 
samples are not available for the culture. 
Since the range of scores obtained with the 
shortened scoring method for five C 
extended from +5 to —5, and since any lack 
of discrimination through simplified scoring 1$ 
likely to result in inability to reject the null 
hypothesis, the procedure seemed justifiable. 

Comparison of subjects with respect tO 
delay of reinforcement patterns as they related 
to n Achievement was done as follows. Subjects 
were first divided into Consistent ImR, Con- 
sistent DelR, and Inconsistent R, to categorize 
their choices with respect to the three measures 
of preference for ImR or DelR, the choice 
preferences with respect to DelR also being 
shown in terms of the number of DelR choices 
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TABLE 1 
REINFORCEMENT PREFERENCE WITH RESPECT TO DELAY IN RELATION TO 
n ACHIEVEMENT SCORES, ASPIRATIONS, AND ACQUIESCENCE 
+ 
n Achievement p oe ee tribution Aspirations® Beis 
Reintorcaaeai ese 
5 -s5 | -2 +1 | +3 Achieve- 
N u’ esp Occupa- Personal 
Bpo |B] Bs] tont | RSE | Trait | Yes) No 
Consistently Delayed: fone 
3 37 92 2.35 1/12)6] 7{ 11 18 4 14 25 | 12 
Inconsistent: 
18 1.11 0| 4/4] 6| 4 8 2 7 12| 6 
0 2.32 
1 27 —.74 8| 4/6) 8| 1 4 5 17 21) 6 
Consistently Immedi- 
ate: 
0 30| —.70 2.62 | 11] 6/3] 5| 5 9 0 17. | 27 | 3 
a ee OT zue) K — = 
Totals: 112 20 | 26 |19 | 26 | 21 39 11 55 85 | 27 


® N of delayed reward choices made by subject, 


b Subject’s total score for the five TAT cards: N of subjects in each category of distribution. 
© N of subjects giving Occupational aspirations, Achievement Trait aspirations, and Personal Trait aspirations; seven subjects failed 


to state any aspirations. 


* t of 3 DelR versus 0 DelR on mean n Achievement = 2.61, p <.01; tof 3 DelR + 2 DelR versus 1 DelR + 0 DelR on mean n Achieve- 


ment = 3.78, p <.001. 


made by the subject out of the three total 
possibilities. Thus the Consistent ImR group 
contains all subjects with 0 DelR choices, the 
Inconsistent group is further subdivided into 
subjects with one as opposed to subjects with 
2 DelR choices, and the Consistent DelR group 
contains all subjects with 3 DelR choices. 
Table 1 presents the means and standard 
deviations of n Achievement scores for subjects 
in each of these categories with respect to 
reinforcement preference. As anticipated, the 
mean n Achievement score of Consistent 
DelR subjects as compared to Consistent ImR 
subjects is significantly higher (¢ = 2.61; p 
< .01). (One-tailed tests of significance are 
used throughout this paper.) The hypothesized 
relationship between preference for ImR as 
opposed to DelR and n Achievement is thus 
strongly supported. Similarly, if subjects are 
dichotomized into “High DelR” (2 or 3 DelR 
responses, M = .98; SD = 2.22), and com- 
Pared on mean n Achievement with “Low 
DelR” (0 or 1 DelR response, M = —.41; SD 
= 2.46), the resulting ¢ is 3.78 (df = 110; p 
< 001). Clearly, subjects showing greater 
Preference for DelR have significantly higher 
n Achievement scores than subjects with 
lesser preference for DelR. Of secondary in- 
terest is the fact that the mean n Achievement 
Scores of subjects in the Inconsistent R cate- 


gory are between those of the Consistent DelR 
and Consistent ImR groups. The difference in 
means between the Consistent DelR group 
and the Inconsistent R group reaches signifi- 
cance (p < .05), but the difference in means 
between the latter and the Consistent ImR 
group does not reach an acceptable level. If 
the data are analyzed in correlational terms, 
a point biserial correlation of .31 (p < .01) is 
obtained from the correlation of n Achievement 
scores with Consistent DelR or Consistent 
ImR choice. A Pearson correlation between the 
number of DelR choices (0-3) and n Achieve- 
ment scores yields an r of .27 (p < .005). 
Responses to the open-ended aspiration 
question used to measure n Achievement were 
classified into three categories that emerged 
from inspection of the data: “Occupational” 
responses and “Trait” responses, the latter 
being further subdivided into “Achievement 
Traits” and “Personal Traits.” The first 
category contained all responses mentioning 
an occupation or profession (e.g., policeman, 
pilot, doctor, priest); the second contained all 
responses mentioning personal traits that 
appeared to be directly achievement-related 
(e.g., important, big shot, bright, successful) ; 
the third contained all other responses, and 
consisted of traits that were not explicitly 
achievement-related (e.g., nice, same, a baby, 
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small, adult, brother, honest, kind, brown). 
Judgments of whether a response belonged in 
the second or third category were made by 
two raters with knowledge of the culture; 
ratings showed more than 90% agreement. 
The Occupational category was of course 
extended to include all possible occupational 
aspirations within the culture. Thus, for 
example, king, prince, queen were included 
in the first category since Trinidad is within 
the British Commonwealth and such aspira- 
tions are analogous to “president” in a 
United States sample. The major conceptual 
distinction between the aspirations in the 
first as opposed to the second and third cate- 
gories is that the former are clearly long-term 
aspirations of a career type, whereas the latter 
are not necessarily long-term goals, and are 
not explicitly career goals, but rather personal 
qualities.® 

Numerous other distinctions can of course 
be made in scoring such responses. The major 
reason for using the present categorization is 
that when Occupational and Achievement 
Trait categories were combined it classified re- 
sponses into a dichotomy which is as close to 
a 50-50 division as it was possible to obtain and 
which also appears to draw, albeit grossly, the 
desired conceptual distinction. The alternative 
possible distinction between occupational 
responses and all others is firmer since it is not 
dependent on nearly as subtle a judgment as 
that entailed in discriminating between kinds 
of traits. Statistical comparisons were made 
in which the first category was opposed to the 
other two combined, as well as in which the 
first two categories were combined and op- 
posed to the third. 

To determine the usefulness of this open- 
ended aspiration question for measuring n 
Achievement, responses in each category were 
related to the mean of n Achievement scores 
as measured in response to the more standard 
TAT-type procedure. Table 2 presents the 
mean and standard deviations of n Achieve- 
ment scores for subjects giving aspiration 
responses in each of the three categories and 
in combined categories, as well as the number 
of subjects in various categorizations of the 


5 The. crudeness of the present scheme should be 
clear, For example, “policeman,” although an occupa- 
tional aspiration, may be less related to striving for 
achievement than is the aspiration to be “honest.” 


WALTER MISCHEL 


TABLE 2 


n ACHIEVEMENT IN RELATION TO AsPIRATIONS 
AND ACQUIESCENCE 
a 


N of subjects giving each| 
ind jig pl Ei Acquiescence 
— 
n Achievement e Bey 
scores a S 
3 #3 HA | Yes | No 
g | 6 | ba 
6 |< À 
Positive 
+3 to +5 13 2 6 15 6 
+1 to +2 12 5 10 7 9 
0 6 1 9 14 5 
Negative 
=i to -2 7 3 “4 21 5 
=3 to -5 1 0 16 18 2 
Totals: 39 il 55 85 27 
n Achievement M: 1.44 1.09 —.73 04 19 
m 
1.36 —.42 
n Achievement SD; 1.89 2.15 2.56 2.53 2.25 
SS 
1.96 2.59 


Note.—t of Occupational aspirations versus all other aspira- 
tions on mean n Achievement = 3.88; p <.001. ¢ of Occupational 
aspirations + Achievement Trait aspirations versus Personal 
Trait aspirations on mean n Achievement = 4.54; p <.001, 

^ Total score for the five TAT cards. 


n Achievement distribution giving each kind 
of aspiration. The difference between the n 
Achievement means of subjects giving Oc- 
cupational responses as compared to all others 


is significant (f = 3.88; p < .001), the mean y 


of n Achievement scores of subjects in t 
group giving occupational aspirations being 
significantly higher. Similarly, comparison of 
the mean n Achievement of subjects giving 
occupational responses or Achievement Traits 
versus all other responses results in @ t of 
4.54 (p < .001), the former having signifi- 
cantly higher n Achievement scores than the 
latter. A Pearson correlation between Occupa- 
tional responses, Achievement Trait responses, 
and Personal Trait responses (assigned values 
from 2 to 0) and n Achievement scores yields 
an r of 41 (p < .001). 

This short-cut method may serve as 4 
useful alternative to the usual much longet 
procedure of scoring fantasy material for P 
Achievement, especially when the latter 4 
difficult to obtain, as it is in many kinds 
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field work. Some very recent and independently 
formulated attempts to relate n Achievement 
to realism in vocational aspiratians (Mahone, 
1960) should be noted, and provide further 
encouragement for exploring relationships be- 
tween motive variables and aspirations. 

The data for the relationship between 
aspirations and reinforcement preference pat- 
terns with respect to delay are presented in 
Table 1. A chi square comparison of subjects 
giving 3, 2, 1, and 0 DelR choices with the 
number of responses given by these groups in 
each of the three aspiration categories cannot 
be made meaningfully since this results in an 
excess of expected frequencies that are less 
than 5. Combining categories, subjects giving 
2 or 3 DelR choices (“High DelR”) were 
compared with subjects giving 0 or 1 DelR 
choices (“Low DelR’’) in regard to the num- 
ber of responses in each of the three categories 
of aspirations. This comparison results in a 
chi square of 7.49 (df = 2; p < .05). Subjects 
with “High” DelR preference tend to give 
more Occupational and fewer Personal Trait 
Tesponses than subjects with “Low” DelR 
preference. Chi square analyses of High-Low 
DelR in relation to number of Occupational 
aspiration responses versus number of all 
other aspiration responses, or in relation to a 
combination of the Occupational category and 
the Achievement Trait category as opposed to 
the Personal Trait category, both show a 
significant relationship of the same nature 
(p < .02, chi squares corrected for continuity). 


Acquiescence, Reinforcement Preference, and n 
Achievement 

The acquiescence data for yeasaying or 
naysaying in an ambiguous situation are 
related to the reinforcement preference pat- 
terns in Table 1. Consistent DelR subjects 
gave 25 yes and 12 no responses; consistent 
TmR subjects gave 27 yes and 3 no responses. 
Application of the chi square test to these 
data results in an uncorrected chi square of 
4.80 (df = 1; p <.05). Similarly, a dichot- 
omization into “High DelR” as opposed to 
“Low DelR” to compare number of yes or no 
Tesponses, results in an uncorrected chi square 
of 4.38 (df = 1; p < .05).6 As anticipated, 

° If extreme caution is used and the decision is made 


to apply the Yates correction for continuity on all 
2 X 2 contingency tables, although all expected fre- 
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subjects high in preference for DelR gave 
relatively more no and fewer yes responses 
than did subjects low in preference for DelR. 

In spite of its relative weakness, this associ- 
ation is striking since only one item was used 
to measure acquiescent yeasaying tendencies. 
The relationship between yeasaying and 
naysaying tendencies and reinforcement prefer- 
ence patterns would seem well worth further 
study with more reliable multiple measures to 
obtain yeasaying data. The present measure of 
acquiescence may have advantages in studying 
various aspects of acquiescent behavior by 
permitting easy manipulation of the reference 
group, e.g., merely changing the public 
identification of the examiner. The method 
may also be used readily to study conforming 
tendencies cross-culturally. The preponderance 
of yes responses in the present results may be 
taken as a caution against excessive generaliza- 
tions about strong conforming tendencies in 
our own culture without reference to com- 
parative data. The primary relevance of these 
data, however, is that they serve as an empiri- 
cal link between the patterns isolated by 
Couch and Keniston (1960) to distinguish 
yeasayers and naysayers and the present 
distinction between subjects consistently 
preferring DelR or ImR. Clearly the patterns 
have strong conceptual overlap: the present 
findings provide support for their empirical 
overlap. 

The n Achievement scores obtained both 
from the standard n Achievement measure and 
the aspiration measure were related to yea- 
saying as opposed to naysaying. Theoretical 
considerations about the personality correlates 
of high n Achievement lead to the anticipa- 
tion that persons high in n Achievement 
should show less acquiescence or yeasaying, 
However, although the mean n Achievement 
scores (on the standard measure) of non- 
acquiescent subjects is higher (V = 85; M 
= 04; SD = 2.53) the difference clearly falls 
short of acceptable significance levels and 
can only be interpreted as a trend (t = 1.35; 


quencies in each cell do exceed 5, the comparison of 
Consistent DelR with Consistent ImR subjects on 
acquiescence results in a chi square of 3.59; the com- 
parison of the “High” versus “Low” preference for 
DelR dichotomy with acquiescence results in a chi 

of 3.51. A chi square of 3.84 is required for 
p <.05 and of 2.71 for p <.10. 
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10 > p > .05, one-tailed test). Table 2 shows 
the number of yes and no responses given to 
the acquiescence measure by subjects in 
various categorizations of the total distribu- 
tion of n Achievement scores for the five TAT 
stories. Comparison of subjects with positive 
scores as opposed to subjects with negative 
scores on the number of yes versus no re- 
sponses results in an uncorrected chi square 
of 3.59 which approaches but does not reach 
an acceptable significance level (p < .10; chi 
square corrected for continuity = 2.72, p < 
.10). If subjects with zero and negative n 
Achievement scores are combined and com- 
pared with subjects having positive n Achieve- 
ment scores, this represents the closest 
median split possible, using all subjects on 
the yes-no dichotomy. An uncorrected chi 
square of 2.73 with p < .10 results. In sum, a 
trend of such dubious statistical significance 
appears to characterize the relationship be- 
tween our measure of acquiescence and the 
TAT n Achievement measure that we cannot 
safely reject the null hypothesis. As such, the 
present findings share the uncertain conclu- 
sions about the relationship between acquies- 
cence and n Achievement obtained by other 
investigators (e.g., Samelson, 1958). 

The data also permit a test of the relation- 
ship between the aspiration measure of n 
Achievement and acquiescence (Table 3). If 
subjects giving Occupational aspirations are 
combined with those giving Achievement 
Trait aspirations and compared to those 
giving Personal Trait aspirations with respect 
to the frequency of the yes or no responses, the 
resulting chi square does not approach statisti- 
cal significance. However, comparison of 
subjects giving Occupational aspirations with 
those giving all other aspirations on acquies- 
cence results in a chi square of 4.27 (corrected 

TABLE 3 


ACQUIESCENCE IN RELATION To EACH 
CATEGORY OF ASPIRATIONS 


Aspirations 
Acquiescence 5 
Occupational eee amini 
Yes 24 10 44 
No 15 1 il 


Note.—Chi square comparison of Occupational aspiration; with 
all others on Yes versus No: x? = 4.27 (corrected for continuity); 
p <.05. 
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for continuity), p < .05. Subjects gi 
pational aspirations, when compared 
giving all other aspirations, significan 
relatively less yeasaying and more naj 
It is of interest that the aspiration m 
Achievement is thus significantly ri 

acquiescence whereas results from th 
standard fantasy measure of n Achieve 
fall short of statistical significance. Even m 
clear-cut relationships may perhaps b 
pected when data that lend themselves 
detailed and subtle scoring are emplo 
when multiple acquiescence measures are 


Social Responsibility in Relation to the Ob 
Measures 

As regards the SRS, the relationship 
preference for DelR and social respons 
reported earlier (Mischel, 1961) is replic: 
Pearson correlation of .28 (p < .005, 
tained between number of DelR choices. 
and SRS scores. Using ¿ tests, 
differences between means (p < 
obtained when “High DelR” subjects 
2 DelR choices) are compared with 
DelR” subjects (1 and 0 DelR choi 
means and SDs of the former are 31.2 
4.28, and of the latter 29.05 and 3.7 
secondary finding of considerable ini 
(Table 4) is a strong relationship between 
and n Achievement (r = .40, p < .001 
TAT measure; r = .33, p < .001 with aspi 
tion measure). This appears to be one of t 
very few instances in which a significant pO: 
tive correlation is obtained between n A 
ment and a questionnaire measure. TI 
cally, although not specifically predi 
seems extremely plausible that so 
sponsibility and achievement moti 
should be interrelated. Operationally, 1 
encouraging that at least in this cult 
setting, questionnaire data do appear 
lated with scores based on fantasy matt 


An Overview 


Summarizing the results, Table 4 pre 
the intercorrelations of preference for 1 
the two n Achievement measures, and 
Preference for DelR, n Achievement m 
by two different but intercorrelated mé 
(TAT and open-ended aspirations), and 
scores are all significantly and ë 
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interrelated. The acquiescence measure is not 
significantly associated with social responsibil- 
ity scores (r = .06, p < .20), Although, as 
discussed above, low significant relationships 
do seem to hold between naysaying and 
preference for DelR and between paysaying 
and Occupational (as opposed to all other) 
aspirations. Finally, partial correlations were 
computed between preference for DelR and 
n Achievement, holding SRS constant, and 
between preference for DelR and SRS, holding 
n Achievement constant. In both cases, low 
but positive and still significant correlations 
result (r = .18 and .19, respectively, p < .05). 
This finding suggests that the major measures 
used to delineate the cluster of correlates 
associated with preference for DelR as opposed 
to ImR, while themselves significantly inter- 
correlated, still appear to tap different aspects 
of the constellation. 

The main import of the over-all findings is 
that they relate reinforcement preference to 
other more thoroughly explored variables. 
When the present findings are taken collec- 
tively with those obtained on other previously 
reported samples, a network of meaningful 
correlates becomes apparent. These are inter- 
preted as providing considerable evidence 
that an obviously crucial but relatively un- 
studied dimension of behavior, the ability to 
delay the relatively trivial but immediately 
available for the sake of larger but later 
outcomes, can be explored usefully with 
telatively simple direct measures of choice 
preference. The correlates of these choice 
preferences are becoming increasingly clear, 
although controlled studies of antecedents are 
still lacking. 

The simple preference patterns with respect 
to delay elicited thus far provide only an early 
model and a prelude to more complex and 
subtler batteries. In such batteries, delay 
preference patterns systematically sampling a 
variety of value and need areas could be tapped, 
and the valences of particular intervening 
work and fantasy activities (in addition to the 
simple waiting period over a time interval used 
at present) could be elicited. The ultimate aim 
would be the construction of predictive indi- 
Vidual topographies for complex choices on 
the dimension of delay, and the measurement 
of individual time lines. 

The findings also support the fruitfulness 
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TABLE 4 
INTERCORRELATIONS OF PREFERENCE FOR DELAYED, 
LARGER REINFORCEMENT (DelR), n ACHIEVEMENT, 
AND SocIaL Responsisitity (SRS) 


(N = 112) 
n Achievement 
SRS 
TAT Aspirations* 
Preference for DelR r= 27° | C= 26° r= ,2g°° 
n Achievement 
TAT rm Aer? |y m .40eer 
Aspirations* r= 33 
*N = 105, 
* p <.05 (x? = 7.49, df = 2), 
** p <,005. 
"> <,001. 


of devising simple quantifiable measures for 
eliciting data on potentially complex dimen- 
sions in field research situations in other 
cultures. Although such field research is often 
thought to present serious methodological 
problems when quantifiable data on psychologi- 
cal dimensions are sought, the present 
results indicated, on the contrary, that ex- 
ceedingly simple measures may serve to yield 
useful data that might only be elicited from 
more sophisticated subjects by much more 
complex methods. In the present situation, 
the subjects’ lack of psychological sophistica- 
tion appears to have been a strong asset in 
circumventing sources of error that so often 
bedevil comparable researches. 


SUMMARY 


This study, using 112 Trinidadian subjects 
aged 11-14, tested the relationship between 
preference for immediate, smaller (ImR) or 
delayed, larger (DelR) reinforcement in choice 
situations and (a) n Achievement, (b) acquies- 
cence (yeasaying or naysaying in an ambiguous 
situation). The expected significant positive 
relationships between preference for DelR and 
n Achievement were found. The inverse 
relationship between preference for DelR and 
acquiescence was, as predicted, also obtained. 
n Achievement was inversely related to 
acquiescence but only as a statistically non- 
significant trend. Responses to a simple open- 
ended aspiration measure, appropriate for use 
in other cultures, were found to significantly 
differentiate high and low n Achievement 
subjects as measured by the standard pro- 
cedure, more occupational aspirations being 
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given by high n Achievement subjects. The 
possible usefulness of this aspiration measure 
as an alternative index of n Achievement in 
field work was suggested. Data on the inter- 
relationships between a measure of social 
responsibility and the other variables were 
also presented. The over-all findings support 
the usefulness of relatively simple choice 
situations for eliciting preference for ImR as 
opposed to DelR in the attempt to elaborate 
the variables relevant to the ability to delay 
immediate gratification for the sake of later, 
larger outcomes, within the context of another 
culture. The present findings serve to elabo- 
rate the network of correlates of choice pref- 
erences for delayed, larger gratifications when 
these are opposed to smaller but immediately 
available rewards. 
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EDNICK (1958) and Chapman (1958) 
have suggested that the aberrant 
language and conceptual behavior 

found in schizophrenia are, in part at least, 
due to the competition and intrusion of inap- 
propriate associative responses. These in- 
vestigators differ, however, in the assumptions 
underlying this assertion. Mednick, using 
Hullian learning theory (Hull, 1943) as his 
frame of reference, assumes that schizophrenia 
is characterized by heightened anxiety (D), 
which leads to an increase in the habit strength 
of remote associations within the response 
hierarchy. Thus, in simple tasks, i.e., those 
with few competing responses, schizophrenic 
performance should be superior to that of 
normals, whereas in more complex situations 
the reverse should hold. Mednick and DeVito 
(1958) support this contention. They required 
schizophrenics and normals to learn a list of 
paired-associates of low complexity (e.g., table- 
chair, dark-light, etc.) and one of high com- 
plexity (e.g., table-gas, comfort-chair, etc.). 
Complexity, therefore, was equated with the 
extent to which an S-R pair was in keeping 
with the cultural frequency of association 
between them (Russell & Jenkins, 1954). 
Schizophrenics learned the list of low com- 
plexity more quickly than did normals, 
whereas the rank order of groups was sig- 
mney reversed with the high complexity 
ist. 

For Chapman (1958), the interference hy- 
pothesis appears to have arisen chiefly from 
clinical observation (Arieti, 1955; Cameron, 
1944) and, as such, is the result of empirical 
generalization rather than theoretical deduc- 
tion. Chapman used a conceptual card-sorting 


1 The authors wish to express their gratitude to the 
staff of the Veterans Administration Hospital, Lexing- 
ton, for their assistance in permitting the study to be 
performed at that facility and for their assistance in 
recruiting subjects. 


task in which the subject was required to 
place an object-name (e.g., hat) in one of three 
categories: a member of the same class (cap), 
an associatively linked object (head), or an 
irrelevant object (street). Schizophrenics were 
found to commit more errors of the associated 
object type than were normals, especially with 
an increase in the difficulty of concept (Chap- 
man, 1958). Ignoring the differences in ration- 
ale, the results of the Mednick and Chapman 
studies are quite compatible. 

The generality of the interference hypothesis 
is open to question, however. The Chapman 
and Mednick studies have certain features in 
common: The S-R associations were learned 
outside of the experimental situation (probably 
prior to the onset of psychosis), and the find- 
ings therefore represent some degree of con- 
founding of acquisition and retention. Also, 
the associations were meaningful, a form of 
association which many authors interpret as 
being dependent upon the mediation of im- 
plicit responses (Osgood, Suci, & Tannen- 
baum, 1957). 

In an effort to clarify the range of valid 
application of the interference hypothesis, the 
present investigation explored the suscepti- 
bility of schizophrenics to interference effects 
in both the acquisition and retention of a set 
of rote verbal habits. In a serial learning 
situation, amount of interference in acquisition 
was manipulated by means of intralist simi- 
larity, while the analogous function in reten- 
tion was served by varying the interlist 
similarity according to the retroaction para- 
digm. This approach has the further advantage 
of employing an experimental technique about 
which much is known. 


METHOD 
Subjects 
Seventy-two subjects were used in the. experiment, 
36 normals and 36 schizophrenics. The normals were 
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TABLE 1 TABLE 2 
EXPERIMENTAL Lists ANALysIS OF TRIAL, Errors/TRIAL, AND RATE 
MEASURES IN LEARNING Tasks 
NI M NIL 
Trials Errors/Trial | Seconds/Trial 
XUB MAN YOM Gera d (ed rn Soni 
FOJ DOOR VEP 
QIY END ZAW Groups (G) |» 1| 9.99** | 14.84** 1.17 
KEJ YEAR JEG Lists (L) 1 | 31.66** | 110.39** 28.71** 
YUN GLASS ZUX GXL 1| 0.21 4.90* 0.05 
SIH PLAY KUH Within cells | 68 | (9.58) (0.30) (235.62) 
Total 71 
selected from students enrolled in laboratory sections 
of introductory psychology; the schizophrenics were nS Ni 


drawn from a population of hospitalized veterans. No 
subject had previously participated in a verbal learning 
study. The schizophrenic group had a mean of 11.9 
years of schooling? and a mean age of 31.8 years. All 
patients were receiving some form of tranquilizing 
medication at the time of testing, and all but 10 were 
from so-called chronic wards. The dosage for all but 11 
of the patients was less than 100 milligrams per day. 
No attempt was made to differentiate among diagnostic 
subtypes, but case history material was examined to 
remove those who had ever carried a diagnosis other 
than schizophrenia. 


Experimental Materials 

As learning was expected to proceed at the slowest 
rate in the condition in which schizophrenics were re- 
quired to learn nonsense syllables, an attempt was 
made to select materials for this condition first. A list 
of 13 nonsense syllables of 0-20% association value 
(Glaze, 1928) was composed and presented to six pa- 
tients at a 3-second intersyllable rate. No subject could 
learn the list to a criterion of one perfect repetition. 
The original list was then progressively shortened until 
it had decreased to only 6 syllables (five anticipations) 
in length. Even with this drastic reduction, schizo- 
phrenic subjects found the task exceedingly difficult. 
Accordingly, a new strategy was adopted. In place of 
a machine-paced rate of presentation, subjects were 
given a switch which permitted them to expose the 
syllables at a self-determined rate. Under these condi- 
tions, a list of 6 syllables was learned in a reasonable 
number of trials, 

Three sets of syllables were then constructed—a 
nonsense list (NI), a meaningful list (M), and a second 
nonsense list (NII) to be interpolated between the 
learning and relearning of NI. The meaningful words 
were monosyllabic and occurred with a frequency of at 
least 100 per million (Thorndike & Lorge, 1944). The 
items contained in each of the three lists are presented 
in Table 1. 


Procedure 


The schizophrenic and normal samples were each 
randomly divided into two groups, one of 24 subjects 


2With a sample of 315 schizophrenic patients se- 
lected from the same hospital, Richard M. Griffith has 
estimated the correlation between W-B IQ and years 
of schooling to be .62 (p < .001). 


and one of 12 subjects. The groups of 24 subjects 
learned NI; the groups of 12 subjects learned M. This 
provided for a comparison of the relative performance 
of schizophrenics and normals as a function of intralist 
similarity. The groups which had learned NI were then 
further split into two subgroups of 12 subjects each. One 
of the subgroups received NII as the interpolated task, 
while the other subgroup received M. All who had 
learned NI were then required to relearn it. In every 
case, the acquisition criterion was one perfect repetition, 
and the interlist rest interval was 2 minutes. The recall 
and relearning data permitted a comparison between 
groups with respect to the retroactive interference 
effects of highly similar (NII) and highly dissimilar 
(M) interpolated material. Standard rote serial learn- 
ing instructions were employed. The experimental 
session rarely exceeded 30 minutes, thus, diminishing 
the possibility of intrasession behavioral changes in 
the schizophrenics. Time per trial as well as errors and 
trials to criterion were recorded. 


RESULTS? 


Learning Tasks 

Schizophrenics required significantly more 
trials than normals to attain the criterion of 
one perfect repetition (see Table 2, Trials 
column). However, contrary to what would be 
expected from the interference hypothesis, 
this difference was not more pronounced with 
NI than with M (G X L insignificant). For 
both lists, one-tailed ¢ tests were significant 
between groups; ¢ (46 df) = 2.64, p < -01 and 
t (22 df) = 1.87, p < .05 for NI and M, 
respectively. By the trials variable, therefore, 
group differences remained relatively con- 
stant in spite of a marked difference m list 
difficulty. ) 

As the difference between groups m 
error score was spuriously inflated due to 

3 The authors are indebted to the University of 


Kentucky Computing Center for making their facilities 
available. 


total 


INTERFERENCE EFFECTS 


TABLE 3 


Mean TRIAL, ERRORS/TRIAL, AND SEcoNDs/TRIAL 
MEASURES IN LEARNING, INTERPOLATION, AND 
RELEARNING 4 


A In Relearning 
Learning | “ation NI 
Groups Measures 
NI | u |NI| u | Nuja 
Trials 8.8 | 41| 5.9] 2.6] so] 1.8 
Schizo. | E¥Fors/ 2.6 | 1.4] 2.3] 0.9] 1.9] 0.3 
areal Trial 
Phrenic | Seconds/ | ss | 31 |47 |32 | 46 | 34 
Trial 
Trials 6.2] 2.2] 5.1] 18] 3.5] 1.4 
Errors/ 2.31] 0.5] 1.9] 0.8] 1.8] 0.3 
Normal ‘Trial 
Seconds/ | 54 | 27 | 43 |29 | 38 | 31 
Trial 


unequal opportunities to err resulting from an 
inequality in trials to criterion, the derived 
measure of errors per trial was used. An analy- 
sis of this measure revealed a higher rate of 
error production in the schizophrenic group 
(see Table 2, Errors/Trial column). Again, 
the schizophrenics achieved significantly higher 
scores on both NI, ¢ (46 df) = 2.00, p < .05 
and M, ¢ (22 df) = 3.91, p < .01. An exami- 
nation of the type of error as a function of 
groups resulted in an interesting finding. 
While no qualitative differences were detected 
in the errors of commission (intra- and extra- 
list intrusions), errors of omission (failures 
to respond) were much more frequent among 
schizophrenics. In fact, when this latter type 
of error was removed from the total error 
Score, all error sources of variance fell to in- 
significance with the exception of lists, F 
(1, 68 df) = 29.73, p < .001. Thus, in the 
learning task, it would appear that group 
differences arose from a general decrement in 
the response tendencies of the schizophrenic 
group. 

It will be recalled that with a machine- 
paced rate of syllable presentation, schizo- 
Phrenics found serial learning to be an ex- 
tremely difficult task. Accordingly, it could 
be argued that the schizophrenic has a slower 
self-presentation rate than the normal and that 
this reduced rate may compensate for the 
deleterious effects of response competition. 
This was not the case. The sole significant 
Tate effect was associated with the differences 
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TABLE 4 


ANALYSIS OF TRIAL, Errors/TRIAL, AND RATE 
MEASURES IN INTERPOLATION TASKS 


Sues ay | Trials |Brrorg/Tall Seconds/Trial 
Groups (G) DAS 1.70 0.40 
Lists (L) 1 | 41.03** | 39.00** 9.33** 
GXL 1| 0.00 | 0.44 0.02 
Within cells | 44 | (3.25) | (0.50) | (255.18) 

Total 47 

"p< .01, 


between lists (see Table 2, Seconds/Trial 
column). 

In order to assess the possible effects of 
tranquilizing medication upon performance, 
all of the above criterion measures were re- 
analyzed with levels of drug as a variable. In 
no instance was medication or any of its in- 
teractions a significant source of variance. 


Interpolated Tasks 


With NII and M as the interpolated tasks, 
the performance of schizophrenics and normals 
was statistically indistinguishable (see Table 4) 
by all measures (see Table 3, Interpolation 
column). Meaningful and nonsense lists both 
gave rise to significant group differences in 
the initial learning task. The additional insig- 
nificance of group differences in errors of 
omission during interpolation tends to sub- 
stantiate the interpretation that failures to 
respond accounted for the group differences 
during original learning. 

An analysis of qualitative error differences 
(intrusions of NI into NII) and drug levels 
led to no statistically significant conclusions. 


Retention Task 

As interference effects are primarily con- 
fined to the initial trials of relearning (Under- 
wood, 1945), the number of syllables incor- 
rectly recalled on the first relearning trial 
provides an appropriate index for a test of the 
interference hypothesis as applied to retention. 
By this measure, NII interfered with the recall 
of NI to a significantly greater degree than 
did M (see Table 5, Errors column). Thus, the 
necessary precondition for the test was estab- 
lished. However, when the data were examined 
for group differences, none was found. The 
number of syllables correctly recalled was 
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TABLE 5 


ANALysIS OF ERROR AND TIME MEASURES IN RECALL 
AND TRIALS MEASURE IN RELEARNING 


Recall bee 
Source Of Sage Ones eae i 
Errors _|Seconds/Trial F 
F F 
Groups (G) 1 -66 69 4,83* 
Interpolated 1 | 75.61** 5.40* | 40.83** 
Lists (L) 
GXL 1 .12 +82 1.95 
Within cells 44 | (1.55) | (536.93) | (2.09) 
Total 47 
"p< .05. 
>< .01, 


essentially the same for both groups (2.6 for 
schizophrenics and 2.9 for normals); more 
importantly, the difference between groups 
was invariant over lists (G X L insignificant). 
An analysis of the rate of syllable presentation 
during the recall trial also revealed no group 
differences (see Table 5, Seconds/Trial col- 
umn). The average duration of the recall trial 
was 49 seconds for schizophrenics and 43 
seconds for normals. In short, neither the 
acquisition nor the recall data furnish support 
for the interference hypothesis. 

When all the relearning data were appraised, 
the interpolation of NII was found to produce 
an increase in the number of errors per trial, 
F (1, 44 df) = 183.06, p < .01, and a decrease 
in the rate of syllable presentation, F (1, 44 
df) = 5.90, p < .05, as compared to per- 
formance after the interpolation of M. Again, 
there were no group differences (see Table 3, 
Relearning column). An analysis of the trials 
data resulted in a somewhat different finding. 
In addition to the unequal relearning scores 
following the interpolation of NII and M, 
schizophrenics required significantly more 
trials to relearn NI than did normals (see 
Table 5, Trials column). The schizophrenic 
decrement was primarily due to performance 
after the interpolation of NII as shown by £ 
tests between groups on each of the lists 
separately. Normals relearned NI following 
NII in fewer trials than schizophrenics, £ 
(22 df) = 2.08, p < .025, whereas there was 
no significant difference between groups fol- 
lowing the interpolation of M, ¢ (22 df) = 
0.80, p > .05. A comparison of performance 
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between lists NIL and NI provides supple- 
mentary information. While normals relearn 
NI in fewer érials than they learned NII, ¢ 
(11 df) = 3.29, p < .01, schizophrenics re- 
quired the same number of trials to relearn 
NI as to learn NIT, ¢ (11 df) = 1.29, p > .05, 
As there was no evidence of a group difference 
in recall, the relearning differences reflect a 
general performance decrement in the schizo- 
phrenics unrelated to interference effects. This 
decrement is most pronounced in the NI-NII- 
NI sequence, the condition in which the level _ 
of difficulty remained high throughout the 
experiment. 

No significant group differences in errors of 
commission (intrusions of NII into NI) or 
omission were found in either recall or re- 
learning. As was the case in the learning and 
interpolation tasks, no relationships were ob- 
tained between drug levels and any of the ` 
experimental variables. 


Discussion 


The interference hypothesis, as separately 
advanced by Mednick and Chapman, states 
that the behavior of schizophrenics is ad- 
versely affected to a disproportionate degree 
by increased response competition. The results 
reported here do not support this position. 
Although increased intralist and interlist simi- 
larity markedly interfered with acquisition and 
recall, respectively, the schizophrenic and 
normal groups were equally affected. The 
primary factor accounting for these dis- 
crepant findings appears to be the source of |} 
the response competition. Whereas Chapman ~ 
and Mednick selected distracting material 
from response hierarchies formed through pre- 
existing differential reinforcement, the stimulus 
material (nonsense syllables) employed here 
had not been previously discriminated. Com- 
petition, therefore, was not a product of 
meaningful similarity, but was largely en- 
gendered by primary stimulus similarity among 
the syllables. 

Other evidence tends to support the con- 
clusion that the interference hypothesis, 1s 
tenable only with respect to competition 
arising from meaningful similarity. Abana 
(1955) studied the acquisition of a s 
of adjectives by schizophrenics and normals 
For half of each group, distracting nonsens 
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symbols were placed within a rectangular 
border which surrounded the items of the 
list. Acquisition was retarded hy the distrac- 
tors, but both groups displayed an equal 
decrement. 

The assumption that interference phenom- 
ena, where they do occur in schizophrenia, 
result from heightened anxiety is also of 
questionable validity. O’Connor and Rawns- 
ley (1959) found schizophrenics and normals 
to be no different in the acquisition of the 
conditioned eyeblink, whereas normals of high 
anxiety have been shown to condition more 
rapidly (Spence & Taylor, 1951). The same 
authors reported, however, that verbal condi- 
tioning was inferior in schizophrenics. In work 
concerned with primary stimulus generaliza- 
tion along the pitch continuum, Garmezy 
(1952) found no difference between schizo- 
phrenics and normals under nonstress condi- 
tions. 

In the present experiment, those group 
differences which did emerge were apparently 
the conseqeunce of factors other than a dif- 
ferential susceptibility to interference effects. 
In the initial learning task, the performance of 
schizophrenics was inferior to that of normals 
with both the meaningful and nonsense ma- 
terials. A closer examination of the data re- 
vealed that the difference was largely due to 
a general failure of the schizophrenics to 
respond. In the relearning task, the schizo- 
phrenics were again inferior to the normals. 
This finding was not a function of differences 
in recall, however, as the errors committed 
on the first trial of relearning failed to dis- 
criminate between the groups. In this respect, 
the data are similar to those reported in an 
early study by Hull (1917). The relearning 
differences were primarily confined to the 
experimental sequence of greatest difficulty and 
are perhaps analogous to fatigue effects. This 
supposition is strengthened by the work of 
Bleke (1955), who reported greater reminis- 
cence effects in schizophrenics under conditions 
in which wrong responses led to verbal punish- 
ment. Shakow (1946) has also found reminis- 
cence effects in schizophrenics on the pursuit 
rotor. It should be noted that both of these 
factors, the decreased readiness to respond and 
the brevity of performance, make for difficul- 
ties in attempting to determine whether 


schizophrenia is characterized by a basic con- 
ceptual deficit. 

Another variable influencing the behavior 
of the schizophrenic appears to be the manner 
in which the experimental material is pre- 
sented. Although not studied systematically, 
preliminary work strongly suggests that per- 
formance deteriorates inordinately if the 
schizophrenic is not allowed to program the 
stimuli himself. A psychophysical study of 
brightness discrimination by Wohl (1958) 
leads to the same conclusion. Using the method 
of average error, the difference limen was the 
same for both schizophrenics and normals. 
With the method of constant stimuli, the 
difference limen for the schizophrenics was 
larger. The experimental findings probably 
reflect the waxing and waning of attention so 
frequently reported by clinical observers as a 
predominant feature of schizophrenia. 


SUMMARY 


Two groups of subjects, 36 normals and 36 
schizophrenics, were studied in the acquisition 
and retention of serial lists of varying intralist 
and interlist similarity. An examination of the 
data led to the following conclusions: 

1. The heightened sensitivity of schizo- 
phrenics to interference and distraction is 
confined to response competition based upon 
meaningful similarity. 

2. Prolonged warm-up and brevity of actual 
performance make for grave difficulties in 
achieving a definitive statement concerning a 
basic conceptual deficit in schizophrenia. 

3. Experimental situations in which the 
subject controls the time of presentation of the 
critical stimuli are of particular benefit to 
schizophrenics. 
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DISAGREEMENT WITH OTHERS, THEIR CREDIBILITY, 
AND EXPERIENCED STRESS 


* HAROLD B. 
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o the extent that a person (eo) is not 
able to account for some judgmental dis- 
| agreement with another (alter), he ex- 
periences stress. A large body of literature has 
examined various methods by which ego may 
reduce this stress through resolving the dis- 
agreement with alter (e.g., Back, 1951; Fest- 
inger 1950). Among these methods is for ego 
to employ information he may have regarding 
his ability relative to alter. A number of stud- 
ies have specifically manipulated this factor 
(Festinger, Gerard, Hymovitch, Kelley, & Ra- 
ven, 1952; Festinger & Thibaut, 1951; Gerard, 
1953) and have shown a tendency on the part 
of ego to reject extreme opinions when he be- 
lieves that his ability might possibly differ from 
another holding an opinion very discrepant 
from his own. Gross trends toward group sub- 
division were discovered in this earlier research 
where ego presumed the existence of an ability 
differential. The present study shifts the focus 
by inquiring into processes occurring at the 
individual level that may account for this 
tendency observed in groups. 

When ego is confronted by disagreement, 
the correctness of his judgment is brought into 
question and he must decide how much credi- 
bility to attach to his judgment and to alter’s. 
If he has evidence indicating that his ability 
is superior to that of alter, he should attach 
little credibility to alter’s judgment; the 
impact of the disagreement is thus reduced for 
him. Similarly if ego believes himself to be 
inferior to alter, a basis for resolving the disa- 
greement also exists; that is, he may attach 
little credibility to his own judgment. If, on 
the other hand, he were provided with a basis 
for attributing equal credibility to his own 
and to alter’s judgment he should find it 
relatively difficult to resolve the discrepancy. 

When confronted by disagreement we have 
said that ego should tend to compare his 
ability with that of alter. He should also 
expect alter to be making a similar comparison. 
Given a basis for an ability differential, the 
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evaluation ego has of himself, the evaluation 
he has of alter, and the evaluation he assumes 
alter to have of him should tend to become 
internally consistent. The criterion of con- 
sistency here is the balance criterion of Heider 
(1958) and Cartwright and Harary (1956). 
We have, however, made our performance 
evaluation psychologically interactive by 
endowing them with behavioral implications. 
This dynamic involves an assumption for which 
there is considerable evidence: that ego’s 
disposition toward alter conditions his ac- 
ceptance or rejection of alter’s opinions and 
evaluations, including the evaluation he 
assumes that alter has of him. Inconsistency 
would obtain, for example, if ego evaluated 
both himself and alter positively and he as- 
sumed that alter evaluated him negatively. 
Under these circumstances his evaluation of 
himself would tend to be equivocal. Being 
positively disposed toward alter would, ac- 
cording to our assumption, induce ego to accept 
the evaluation he believes that alter has of him, 
one that conflicts with his own high self- 
evaluation. A strain toward consistency or 
balance should exist that should tend to 
produce a change in one or more of the evalua- 
tions. After having been provided with a basis 
for explaining the discrepancy we would expect 
ego to develop and maintain an unequivocal 
self-evaluation, which, according to our argu- 
ment, should be manifested in internally 
consistent self-other evaluations. 

A previous experiment (Gerard & Rotter, 
1961) found that internally consistent evalua- 
tions were held by subjects who yielded a great 
deal and by those who maintained their own 
judgment in the face of disagreement. No at- 
tempt was made to vary the subject’s relative 
ability. The present suggestion, however, is 
that accepting or rejecting the others’ judg- 
ments reflects a situation in which the subject 
attributes an ability difference between himself 
and the others whereas a lack of behavioral 
resolution, reflected by yielding only part of the 
time, reflects a situation in which the subject 
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believes himself to be about equal to the others 
in ability. 

Previous research (Festinger et al., 1952; 
Festinger & Thibaut, 1951; Gerard, 1953) has 
shown that as judgmental discrepancy in- 
creases so do tendencies toward resolving the 
discrepancy. Resolution should therefore be 
more rapid, and be accompanied both by a 
reduction of stress and an increase in evalua- 
tion consistency. 

The foregoing discussion has advanced the 
following hypotheses: 

1. When confronted by judgmental disagree- 
ment with alter, ego should experience less 
stress where an ability differential exists as 
compared to a condition where alter is pre- 
sumed to be equal in ability. 

2. When confronted by disagreement with 
alter, ego should develop and maintain greater 
self-other evaluation consistency where an 
ability differential exists as compared to a 
condition where alter is presumed to be equal 
in ability. 

3. The greater the discrepancy confronting 
ego, the more rapidly should the tendencies 
described in Hypotheses 1 and 2 become 
manifest. 


METHOD 


Subjects 


Eighty paid volunteers from Drew University 
served as subjects, Each experimental group was either 
all male or all female, the sex factor being balanced 
across the design. 


Procedure 


The stimuli developed by Asch (1951) were used. 
Each consists of a single line presented at the left of 
the subject’s visual field and three comparison lines of 
differing length presented at the right, one of which is 
equal in length to the single line. The subject’s task is to 
determine which of the comparison lines is equal in 
length to the single line. The differences among the 
comparison lines are rather gross, making the task easy. 
Two series of nine stimuli each were projected succes- 
sively on a screen 20 feet in front of the subject. For 
each stimulus the subject depressed one of three keys de- 
pending upon which comparison line he chose. His 
judgments and what he presumed to be the judgments 
of three other subjects registered on a panel of lamps 
in front of him. In actuality the others’ judgments were 
preprogramed by the experimenter such that on Trials 
1, 2, and 5 their judgments were correct whereas on the 
other six trials in each series they were unanimous in 
their choice of one of the two incorrect comparison 
lines. 
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The four subjects in each group were seated in ad 
joining cubicles and each was privately informed 
he would be identified as subject Number 3. He did no 
know the mal identity of the others whom he be 
Hend had been given the identification Numbers 1,2 
and 4. 

The sequence of events was as follows: A slide was 
projected. After 8 seconds, the subject was asked to 
make his ctioice and to keep his choice key depressed, 
The display was then turned on, The “others? judg- 
ments” were identically displayed in all four cubicles. 
After 4 seconds the display was turned off and the sub- 
ject was asked to release the key and await the next 
slide. This procedure served to confront each subject 
with the fact that he was a deviate in a group of four. 
The simultaneous presentation of the judgments pre- 
cluded the possibility of yielding. ; 

After each series of judgments the subject answered _ 
a questionnaire designed to determine the degree of 
consistency of his self-other evaluations. 


Experimental Variations 

In order to induce an ability differential the subject 
was given another judgmental task prior to the line 
judgments. The stimuli for this series were visual mazes — 
developed by Jackson and Salzstein (1958). All the 
mazes consisted of identical circular figures in which 
the same four curved lines, originating from a common 
point on the circumference, proceeded across the in- 
terior of the circle to four other points on the circum- 
ference. The subject’s task was to choose the shortest 
of the four possible paths through the interior of the 
maze from the start—the common beginning of all four 
paths—to an arrowhead variously located on the cir- 
cumference, Part of the circumference, between the 
end of the interior path and the arrowhead, was to be 
included in the estimate of length. The only difference 
among stimuli was in the placement of the arrowhead, 
We pretested 96 of these mazes on 20 subjects not in- 
cluded in our experimental sample and selected 10 that 
were judged to be difficult by most subjects. After 
judging these 10 mazes the subject was scored as to 
how many correct choices he had made, He was told 
that most people like to know how they compare with 
others in this ability. We were therefore returning his 
answer sheet with his score and the scores of the three 
others. Ten points were given for each correct choice 80 
that a perfect score was 100. In order to manipulate 
differential ability, false scores were given to the subject. 
In all cases he himself received a score of 70 but the 
scores of the others were varied in order to create the 
impression of differential ability. His own score was 
encircled in red. No indication was given as to a 
the name or identification number of those making F 
other scores. In one treatment the others scored 30, 40, 
and 50 (Others-less), in a second treatment they were 
given as 60, 70, and 80 (Others-equal), and in the is 
they were 90, 100, and 100 (Others-greater). eet. 
ject was told that the same ability is involved in 
ing the judgments that were to follow. 

In eae te examine the hypothesized effects ie 
increase in resolution tendencies, we varied the ext 
to which the others disagreed with reality on the ct 
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TABLE 1 
PERFORMANCE EVALUATIONS FOR BOTH SERIES OF JUDGMENTS 
Evaluatioa of Self Evaluation of Others Other’s Evaluation of Self 
oF _ Small _ Larg Small Large Small L 
Ability of Ores Discrepancy | Discrepancy | Discrepancy | Discrepancy Diserepancy Discrepancy 
ist | and | ist | 2nd | 1 tnd | 1 
Series Series Series Series tf Series Series sat, series a series Seis 
Less 2.53 | 2.33 | 2.20 | 2.28 | 2.47 | 2.53 | 3.20 | 3.00 | 3.07 | 3.53 | 3.64 | 3.58 
Equal 2.67 | 2.67 | 2.54 | 2.90 | 2.50 | 2.50 | 2.62 | 2.40 | 3.17 | 3.42 | 3.31 | 3.60 
Greater 3.92 | 3.75 | 3.13 | 2.80 | 2.31 | 1.92 | 2.00 | 2.47 | 3.85 | 3.62 | 3.78 | 3.78 


trials. In the Large Discrepancy treatment the others 
appeared to choose the more incorrect of the two incor- 
rect comparison lines whereas in the Small Discrepancy 
treatment they chose the one which was closer in 
length to the correct line. 


Measurement of Stress 


With the aid of a switching arrangement we recorded 
the base level skin resistance of our four subjects on a 
single recorder, the C. H. Stoelting psychogalvano- 
graph, by rotating the one instrument to sample each 
subject periodically. The subject was attached across 
the input with Scherping type finger electrodes to which 
a paste of glycerine, bentonite, and salt water had been 
applied. The rationale given the subject for using these 
electrodes was that we were interested in examining his 
physiological reactions while making the judgments. 


RESULTS 
Experimental Manipulations 


On the questionnaire administered after each 
series of judgments the subject evaluated his 
own ability and the ability of the others, and 
guessed how the others would evaluate him. 
The same six-point scale, which ranged from a 
very positive to a very negative evaluation, 
was used for each of the three ability estima- 
tions. These data are presented in Table 1. Re- 
call that the ability manipulation varied the 
scores of the others while the subject’s own 
score was fixed at 70, Under Small Discrepancy 
we find, for both series, little difference in the 
ability attributed to the others. The differ- 
ential emerges in self-evaluation (p < .01 by 
an F test), This does not obtain under Large 
Discrepancy where we see a linear effect of 
ability upon the evaluation of the others for 
the first series (p < .05), an effect that disap- 
pears, however, for the second series. We also 
note that under Large Discrepancy the linear 
effect upon self-evaluation, which was sig- 
nificant for the first series (p < .01), tends to 


be reduced. No consistent effects appear for 
the other’s evaluation of the self. 

There is only indirect evidence on the suc- 
cess of the discrepancy manipulation. On each 
questionnaire the subject was asked to indicate 
the number of judgments concerning which 
there was disagreement. The average per- 
ceived disagreement under Small Discrepancy 
was 4.59 trials as compared to 5.78 for the 
Large Discrepancy treatment (p < .001 by i). 
These figures represent the combined average 
for each of the two series. The extent of the 
disagreement on the critical trials thus ap- 
pears to have been reflected in the perceived 
frequency of disagreement. 


Siress 


Tn view of metric considerations discussed by 
Lacey and Siegal (1949) we converted our 
resistance (R) measurements into conductance 
(C) as follows: 


1 
C(umhos) = Rohm} x 10° 


Table 2 presents the change in base conduct- 
ance from the end of the maze judgments to 
the end of the first series of line judgments and 
then to the end of the second series of line 
judgments. There were no significant dif- 
ferences among treatments at the end of the 
maze judgments. In spite of a polarizing 
tendency associated with the type of electrodes 
used, conductance increases over time (p < 
.001 by a sign test for each series). The rate of 
increase, however, decreases from the first to 
the second series (p < .01 by i). The ability 
variable shows an effect in that for the first 
series the increase in conductance is greater the 
lower the ability of the others relative to the 
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TABLE 2 


CHANGE IN Base LeveL OF SKIN 
CONDUCTANCE IN MICROMHOS 


Small Discrepancy Large Discrepancy 
Ability of Others 
Relative to Self 


Ast Series | 2nd Series | 1st Series | 2nd Series 
hi Change 


Change | Change Change 

Less +3.55 | +0.48 | +3.00 | +2.47 

Equal +2.73 | +1.62 | +1.67 | +1.44 

Greater +1.97 | +1.41 | +1.38 | +1.47 
TABLE 3 


PROPORTION or Supyects Havinc BALANCED 
SELF-OTHER EVALUATIONS 
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Ability of Small Discrepancy Large Discrepancy 
nee 

elative 

to Selt | sirka | Series} d [setts acts] d 
Less -13 | .73 -00 | .46 | .50 | +.04 
Equal -50 | .67 | +.17 | .77 | .90 | +.13 
Greater | .69 | .69 -00 | .93 | .93 -00 


subject (the linear component is significant at 
beyond the .01 level by an F test). This trend 
continues for the Large Discrepancy treatment 
(p < .01) whereas under Small Discrepancy we 
note a leveling off tendency where the others 
are lower in ability (the increase is only 0.48). 
The increase for the first series tends to be 
somewhat greater under Small as compared to 
Large Discrepancy (p < .10). 


Consistency of Self-Other Evaluation 


We see from Table 1 that the subjects tended 
to select their evaluation alternatives from the 
positive end of the six-point performance scale, 
the over-all median occurring between the 
second and third scale points. We therefore 
labeled an evaluation of 1 or 2 as positive and 3 
through 6 as negative, evaluations of 5 and 6 
being very infrequent. The Cartwright and 
Harary (1956) criterion applied here would 
classify as balanced those instances in which 
all three evaluations are positive along with 
those in which one is positive and the other two 
are negative. Of the eight possible configura- 
tions (each of the three evaluations can be 
either positive or negative), four are balanced. 
The balanced evaluations are the consistent 
ones, that is, those in which an unequivocal 
self-evaluation obtains. Table 3 shows the 
proportion of balanced evaluations occurring 


in each experimental condition for both seri 
of judgments. Under Small Discrepancy 
proportion for the equal ability treatment 
somewhat Idwer than for either of the oth 
treatments (p < .20 by chi square). This d 
ference disappears after the second seri 
Under Large Discrepancy a linear tenden 
obtains. Where the others are lower in abili 
the relatively small proportion of balanced 
evaluations does not change after the secor 
series. The other two treatments becon 
equated. We also note that where there ten 
to be a greater proportion of balanced eval 
tions, the proportion tends to be somewh 
greater in the Large as compared to the Small 
Discrepancy treatment. k 

The foregoing analysis can be refined if v 
assume that where the subject made ver 
positive or very negative evaluations 
degree of consistency would be felt mot 
strongly, so to speak, than where his evalua- 
tions were more moderate. With the aid of this 
assumption, we derived the following measure 
of the degree of balance: an evaluation of 1 
was scored +2 and 2 was scored +1, reflecting 
the fact that the former was a more positive 
evaluation. An evaluation of 3 was scored —1 
and 4 through 6 were scored —2. Balance 
scores of +8, +4, +2, +1, —1, —2, —4, and 
—8 were therefore possible. These scores 
reflect, we propose, the degree to which the 
evaluations are consistent. Using this measure 
we determined the mean degree of consistency ~ 
in all conditions for both series (Table 4). The 
trends here are generally consistent with those 
in Table 3. Under Small Discrepancy we note @ 
curvilinear trend with ability (p < .10 by #) for 
the first series which appears to accentuate 
somewhat for the second series (p < .05 by Ì). Í 
Under Large Discrepancy a relatively high 
degree of consistency appears in the Others- 


TABLE 4 


MEAN DEGREE OF OVER-ALL CONSISTENCY 
AMONG SUBJECT’S EVALUATIONS 


Ability of Small Discrepancy Large Discrepancy 
elative Ist 2n a 
fo alt ak ee d ii Series 
Series | Series Series 
Less 2.27 | 2.00 | —0.27 | 0.27 | 0.17 Ey 
Equal 0.92 | 1.06 | +0.14 | 2.85 | 3.80 Ti 
Greater | 1.15 | 2.42 | 41.27] 2.70 | 4.15 5 


DISAGREEMENT WITH OTHERS 


equal and Others-greater treatments as com- 
pared with the Others-less treatment (p < .01 
by 1). This difference, too, appears accentuated 
over time (p < .001 by i). A trend which ap- 
pears to be consistent for both discrepancy 
treatments is the linear effect of ability upon 
change in the degree of balance (pe< .01 by 
F). 


Discussion 


Hypotheses 1 and 2 appear to be substan- 
tiated only under Small Discrepancy where 
curvilinear trends were noted for both the 
conductance and consistency data. During the 
first series we saw a linear effect of ability upon 
conductance for both discrepancy levels. 
Under Small Discrepancy there was evidence 
of a leveling off during the second series in both 
ability treatments where a differential existed. 
In these two treatments greater consistency of 
evaluation was also observed, although in the 
Others-less treatment a continued increase in 
consistency was not in evidence. A fact that is 
possibly important in explaining some in- 
congruence between the conductance and 
consistency data concerns the timing of these 
measures. Conductance change is based upon 
skin resistance readings taken during the 
actual confrontation whereas the consistency 
data were obtained from a questionnaire ad- 
ministered after the subject had experienced 
the disagreement. Between the end of each 
series of judgments and the time the subject 
actually made his questionnaire responses, 
time had elapsed during which he had a chance 
to reflect upon what had happened and bring 
his evaluations into line with events. Thus in 
the Small Discrepancy-Others-less condition, 
the lack of coincidence between the conduct- 
ance and consistency data might be accounted 
for by this time lapse. 

Under Large Discrepancy, which shows the 
same linear effect of ability upon conductance 
change for the first series, the same leveling 
off trend does not appear in the Others-less 
treatment. Self-evaluation consistency like- 
wise shows a linear effect for the first series that 
Is sustained over time. Perhaps the most we 
can say here is to suggest that partial support 
for Hypotheses 1 and 2 exists under Large 
Discrepancy in the comparison between the 
Others-greater and Others-equal conditions. 
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We note some support for Hypothesis 3, 
which predicts both greater relative stress 
reduction and greater evaluation consistency 
under Large Discrepancy. The evidence shows 
less increase in stress under Large Discrepancy 
and also, where relatively high consistency does 
obtain (under the Others-equal and Others- 
greater treatments), the level is higher than 
under Small Discrepancy. Again we are faced 
with the results for the Large Discrepancy- 
Others-less condition which do not coincide 
with the prediction. 

In attempting an ex post facto interpretation 
of the discrepant results, two facts assume 
crucial importance. The first is that even 
though the ability manipulation varied the 
others’ scores and held the subject’s score 
fixed, the subject did the reverse. He apparently 
used their scores as a benchmark or average 
against which to judge his own performance. 
His own score was labile relative to the others. 
We thus have a situation in which the subject 
sees the others as of average ability and him- 
self as less, more, or equally able. 

The second fact, which requires under- 
scoring, is that the subject found himself in 
disagreement with three independent judges. 
No social influence among the others was 
possible since all judgments were displayed 
simultaneously. This undoubtedly had con- 
siderable impact upon the subject and would 
tend to reduce the credibility of his own judg- 
ment. 

What are the implications of these cir- 
cumstances? We can assume that the degree to 
which the others’ judgments were compelling 
was approximately the same for the three 
ability levels. The variable factor was the 
amount of credibility the subject could attach 
to his own choice. Seen in this light, the results 
may reflect increasing severity of a post- 
decisional predicament in which the subject 
must reconcile the disagreement with knowl- 
edge he has about himself. The more difficult 
it is for him to lower his commitment to the 
choice he had made, the greater the stress he is 
likely to experience and the more likely should 
his self-evaluation remain in question. These 
highly speculative remarks may help to under- 
stand the effects observed under Large Dis- 
crepancy. Under Small Discrepancy the sub- 
ject may have been able to discount the others 
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relative to himself since there was greater room 
for doubt as to who was correct. Some weak 
support for this supposition appears if, in 
Table 1, we examine changes in evaluations 
from the first series to the second series. A 
slight, but statistically insignificant trend 
exists for the subject under Small Discrepancy 
to increase his self-evaluation relative to the 
others over time whereas the reverse obtains 
under Large Discrepancy. 

In line with these remarks we find in Table 4 
a linear trend in both discrepancy treatments 
of the effect of ability upon change in degree of 
consistency, a finding that would suggest that 
the same predicament occurred to some extent 
with increasing self-evaluation in both treat- 
ments. 

This attempt at interpreting the unpredicted 
findings must be taken as suggestive. Research 
is in progress that studies, among other things, 
the effects of making the others’ judgments 
truly labile, of varying the impact of the others’ 
judgment, and of varying the time at which 
consistency measures are administered. In this 
research continuous skin resistance records are 
being made for each subject. 


SUMMARY 


In a setting where the subject was con- 
fronted with disagreement from three other 
subjects, we varied both the degree to which he 
was in disagreement with them and his relative 
ability to make the visual judgments in ques- 
tion. He was confronted with two series of nine 
judgments each and found himself in disagree- 
ment with the others on six of the nine trials. 
We determined, after each series, the internal 
consistency of his self-evaluation, his evalua- 
tion of the others, and the evaluation he 
guessed they had made of him. We also 
measured the stress the subject experienced as 
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reflected by changes in his skin conductance, 
Where there was a clear differential in ability, 
the subject evidenced both a relative dimin 
tion of stres$ and greater relative consistency 
of self-other evaluation. This relationship held 
where he was in mild disagreement with the 
others. Where the disagreement was large, the 
greater his ability relative to the others, the f 
more stress did he experience and the less 
consistent were his self-other attitudes, An 
interpretation for this finding is offered. ' 
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ONSIDERABLE attention has Been de- 
voted in recent years to the study of 
conformity. In the typical experiment 

the subject is confronted with a conflict in- 
duced by finding that his judgment concerning 
some object or event is different from the 
judgments of one or more other subjects. 
Interest has focused upon factors that tend to 
produce either independence or yielding. 
Subjects in these experiments are generally 
strangers to one another and assume that their 
association will terminate at the end of the 
experimental session. What would be the effect 
of projecting their association with each other 
into the future, beyond the one experimental 
session? What is the nature of the expectations 
that a person holds in regard to the other(s) 
when he anticipates future association? 

The other may serve the person as a referent 


` against which he may “check” his judgments 


and opinions. To the extent that he attributes 
to the other the skills or capacities necessary to 
make a judgment or to hold an opinion, the 
individual may rely upon him as a source of 
information about the object or event under 
consideration. 

The other may also serve to mediate other 
satisfactions for him. In the case of institu- 
tionalized associations such as marriage, 
employment, schooling, and club membership, 
some of the satisfactions may be easily identi- 
fied. Often, however, the bases for interde- 
pendence defy easy and immediate discovery. 
Tn casual human intercourse, the participants 
are continually appealing to each other’s needs. 
The continuation of a given relationship will 
depend upon the extent to which the needs of 
the persons concerned are being met as com- 
pared with their respective estimates as to the 
Possibility for meeting these needs via other 
activities, We have each learned that through 
association with others we do realize satisfac- 
tions of one kind or another. Thus, to the ex- 
tent that a person projects an association with 
another into the future, he should, in general, 
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anticipate dependence upon him. Disagree- 
ment between two individuals concerning 
something that each regards as important can 
be a threat to their continuing relationship. 
Indeed, there is a tendency over time for a 
person to ignore another whose opinion is very 
different from his own (Festinger, Gerard, 
Hymovitch, Kelley, & Raven, 1952; Schach- 
ter, 1955). In the face of disagreement we can 
expect a person to experience a greater force to 
resolve the disagreement, the longer the 
projected future association between them. 

The nature of the projected future associa- 

tion should affect the individual’s present 
disposition toward the other. If he expects their 
future association to be concerned with subject 
matter that is the same or similar to the subject 
matter with which they are currently con- 
cerned, he should estimate that the other’s 
present and future judgments should be highly 
correlated. If their present judgments disagree 
and he expects the same activity to continue, 
he should anticipate future disagreement as 
well. In general, to the extent that the future 
holds similar subject matter in store, he should 
anticipate that present disagreement presages 
future disagreement. Given that he does not 
reject the other as a source of information, the 
more continuous the future (as regards subject 
matter) the more will the individual try to 
resolve any discrepancy of judgment that may 
arise. 
Tn summary, the longer a person’s projected 
future with another and the more similar he 
expects the future activities to be to present 
ones, the more readily should he attempt to 
adjust to present discrepancies of judgment 
with the other. 

In addition to the effects of future perspec- 
tive, the present experiment was also con- 
cerned with studying the relationship between 
the subject’s self-other evaluations and his 
tendency to yield to the others’ judgments. Let 
us assume that his evaluation of the other(s) 
conditions the subject’s acceptance or rejection 
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of their opinions and evaluations, We can, with 
the aid of this assumption, designate as con- 
sistent or inconsistent the set of evaluations 
that includes the subject’s evaluation of him- 
self, his evaluation of the other, and the evalua- 
tion he believes the other has of him. To be 
consistent his self-evaluation must coincide 
with the evaluation he sees reflected by the 
other, this reflected evaluation being composed 
of two components. One component is the 
subject’s evaluation of the other, that should 
condition the subject’s acceptance of the 
other’s evaluation of him, and the evaluation 
he believes the other has of him (cf. Gerard, 
1961, for a more detailed discussion of direct 
and reflected self-evaluation). The Heider 
(1958) criterion of balance applies here. Of the 
eight possible configurations (each of the three 
evaluations can be either positive or negative), 
four are balanced or consistent and four are 
unbalanced or inconsistent. We would expect 
that a person who yields consistently or main- 
tains his independence consistently should tend 
to be consistent in his self-evaluation, If a 
sequence of similar decisions (to yield or not to 
yield) is made, the process should be circular 
and cumulative. That is, consistency of évalua- 
tion should condition the decision which, in 
turn, should tend to produce greater consist- 
ency of evaluation, 


METHOD 


Subjects 


Ninety-six Rutgers University undergraduates were 
used in the experiment as paid volunteers, Both men 
and women took part, the design being balanced for 
sex. In any given experimental group only men or only 
women were run. 


General Setting 


The present experiment employed a modified version 
of the so-called Asch (1951) situation. Subjects were 
tested four at a time and were seated in adjoining 
cubicles visually isolating them from each other. A 
slide consisting of a single line on the left and three 
comparison lines on the right, each of a different length, 
was projected on a screen 20 feet away. The subject’s 
task was to choose from among the three comparison 
lines the one most nearly equal in length to the line on 
the left. One of the three lines was actually equal to 
the single line. On the subject’s desk was a small en- 
velope that contained a card with the subject’s identifi- 
cation number on it. The experimenter asked the sub- 
ject to look at the card and informed him that the 
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was Subject #3 and he assumed that the responses he 
saw on his panel had been made by the other three 
subjects. Actually the responses of the others (1, 2, and 
4) were fictitious and appeared in a preprogramed se- 
quence such that “the others” were always in unani- 
mous agreement and in disagreement with the correct 
comparison line on six of the nine trials, They “chose”! 
the correct line on Trials 1, 2, and 5. 

The subject received his instructions over a headset, 
The entire sequence of events during a series was elec- 
tronically controlled by tones which were recorded on 
tape along with the verbal instructions. These tones, 
which were not heard by the subject, were used to 
operate the slide projector and to display each false 
judgment at the proper time. The subject’s choices 
were automatically recorded in the form of tones on a 
second tape recorder. 


Experimental Variations 


When the subject signed up for the experiment, he 
volunteered to take part in four sessions on four differ- 
ent days. This four-session perspective was used as a 
basis for inducing different expectations about the 
future, 

In order to induce a perspective of continued asso- 
ciation with the others who were present, half of the 
groups were told that the same individuals would be 
together during all four sessions. The other half of the 
groups were told that the group composition would 
undoubtedly change from session to session. b 

The expectation of continuity of activity during all 
four sessions was induced by informing the group that 
we were concerned only with visual discrimination, 
and each of the sessions would therefore require similar 
judgments, The first session would concern estimating 
the length of lines, the second the size of areas T 
geometric figures, the third estimation of the length of 
curved lines, and the fourth estimates of the length “ 
figures that had been distorted slightly. The experi- 
menter then presented illustrations of all four types 
of judgment. To induce expectations of varied activities, 
the group was told that we were interested in various 
kinds of discrimination. The first session would one 
Comparing the length of lines, the second the qual 
of odors, the third lifted weights, and the fourth ti 
length of musical passages. Illustrations of the first three 
types of judgment were presented. 


_— 


TIME PERSPECTIVE, CONSISTENCY, AND INFLUENCE 


The assignment of subjects was balanced in a 2 X 2 
factorial design with 24 subjects in each of the four 
treatments: continued membership-continued task 
(MT), continued membership-discontinued task (MT), 
discontinued membership-continued task (MT), and 
discontinued membership-discontinued task (MT). 


RESULTS AND Discussion 
e 


We examine first the number of errors made 
by the subject in each treatment as an indica- 
tion of a willingness to adjust to the dis- 
crepancy in favor of the others, then turn our 
attention to the attitudinal data. 


Analysis of Conforming Errors 


We predicted that the anticipation of con- 
tinued future membership would engender 
dependency and, in the face of disagreement, 
greater willingness to yield to others; the 
tendency should be heightened with the 
anticipation of a continuous task. Table 1 
presents the average number of yielding 
responses on each of the four series. An exami- 
nation of the data for the effects of membership 
and task continuity does not present a simple 
picture. Over all 36 trials we observe a slight, 
statistically insignificant trend opposite to the 
predicted effect of membership perspective, 
(M vs. M), whereas the effect of task perspec- 
tive (T vs. T) shows no trend. 

Interesting details emerge, however, when 
the tendency to yield over time is examined. 
The M treatment showed little change over 
time, albeit minor fluctuations did occur from 
series to series. The M treatment presents 
quite a different picture. An initially high level 
of yielding was maintained for two series, but 
began to drop off during the third series, and 
dropped sharply at theend (p < .01 by # for the 
difference between Series I and IV). A plausible 
Interpretation of these results is based on the 
assumption that the short run perspective 
Produces a sense of urgency. Since there is only 
a limited time during which the sharp dis- 
crepancy in viewpoint can be resolved, the 
Subject displays an all-out initial effort to 
Compromise. When this fails after repeated 
tries, he comes to realize that the others will 
Continue to disagree. The subject therefore 
begins to revert to his initial private judgments. 
The very sharp drop after the third series sug- 
gests that if the subject anticipated a fifth and 
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TABLE 1 


JupGMENTAL Errors as AFFECTED BY TASK AND 
MEMBERSHIP CONTINUITY AND SEATING 


ARRANGEMENT 
Series 

Variable* 

I 1 m IV Total 
M 1.69 | 1.52 | 1.60 | 1.56 | 6.37 
M 1.98 98 | 1.79 | 1.44 | 7.19 
T 1,92 | 1.83 | 1.69 | 1.46 | 6.90 
a 1.75 | 1.67 | 1.71 | 1.54 | 6.67 
MT 1.83 | 1.75 | 1.67 | 1.63 | 6.88 
MT 2.00 | 1.92 | 1.71 | 1.29 | 6.92 
MT 1.54 | 1.29 | 1.54 | 1.50 | 5.87 
MT 1.96 | 2.04 | 1.88 | 1.58 | 7.46 
Seat 1 1.33 | 1.42 | 1.08 | 1.08 | 4,91 
Seat 2 1.42 | 1.83 | 1.67 | 1.13 | 6.05 
Seat 3 2.04 | 2.04 | 1.88 | 1.79 | 7.75 
Seat 4 2.54 | 1.71 | 2.17 | 2.00 | 8.42 


® M = Continued membership, M = Discontinued membership; 
T = Continued task, T = Discontinued task. 


a sixth series, his tendency to yield would 
probably continue to decrease rapidly. For the 
M treatment, however, we would expect little 
change. 

A similar trend appears as a function of task 
continuity. However, the relationship is 
reversed. The T treatment showed a somewhat 
higher initial level than the T treatment, and 
then dropped off at an increasing rate from 
series to series. The T treatment, on the other 
hand, fluctuated from series to series with 
some tendency toward an over-all decrease. 
The initially higher level for the T treatment 
conforms to our expectation. The sharp de- 
crease, however, may reflect the subject’s 
reaction to an unbending majority (p < .01 
by # for the difference between Series I and 
IV). An initial willingness to compromise with 
no evidence of reciprocation may have re- 
sulted in a strong tendency to reject the infor- 
mation proffered by the others. Also, given 
that tasks similar to the present one are antici- 
pated in the future, it is important to establish 
initially a stable frame of reference so that the 
future tasks may be performed adequately. 
Since disagreement with the others was so 
marked, the subject tended to rely upon his 
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own visual sense impressions rather than those 
of the others. Had the disagreement been less 
pronounced, reliance upon the others rather 
than upon the self might have been enhanced, 
given a continuing task perspective. 

We observe the sharpest decrease in errors 
over time in the MT treatment (p < .01 by F 
for the linear trend), which results, we have 
suggested, from the initial urgency of a short- 
term time perspective and subsequent rejec- 
tion after unsuccessful overtures combined 
with the need to establish a stable visual frame 
of reference since the future presages similar 
judgments. Comparison with the MT treat- 
ment suggests the operation of constraints 
against rejection since the individual probably 
anticipates at least partial reliance upon the 
others as future sources of information. In 
comparing the MT and MT treatments, we 
again observe the differential decrease due to 
the membership variable. Both trends, how- 
ever, are tempered by the effects of the antici- 
pation of discontinuing the activity. 

These results were, for the most part, un- 
anticipated and the foregoing discussion must 
therefore be regarded as tenuous speculation as 
to the processes which underlie them, 

An unanticipated and puzzling effect of the 
seating arrangement also occurred. A progres- 
sive increase in yielding occurs with seat num- 
ber (p < .01 by F for the linear trend), There 
is a strong difference over all 36 trials and also 
over time from series to series, with the excep- 
tion of Series II. Since there was a free choice 
of seat as the subjects entered the room, there 
may have been a personality predisposition in 
the choice of seat. The physical arrangement of 
the cubicles may have, in and of itself, pro- 
duced the difference irrespective of possible 
self-selection. Cubicle 1 was closest to the door. 
The person in that cubicle, therefore, may have 
felt that he had easy access to an escape route 
so to speak. The person in Cubicle 4, on the 
other hand, had all three of the others between 

himself and possible escape. The greater the 
number of people between the subject and the 
door, the more “hemmed-in” might he have 
felt. The present data do not permit checking 
this interpretation. 


Consistency of Evaluation and Yielding 


After each series the subject answered a 
questionnaire concerning his reactions. The 
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1. Based on the previous nine 
your ability tp make these visual di 
the number opposite the most appro 
2. Based on the previous nine ju dg 
the ability of the others (on the average) 
discriminations. Circle one. 

3. Try to predict the average eva 
will make of your performance on the 
judgments. Circle one. 


Each item was followed by the 
point rating scale: 


1. Very satisfactory (+3) 

2. Quite satisfactory (+2) 

3. Somewhat satisfactory (+1) 
4. Somewhat unsatisfactory (-1) 
5. Quite unsatisfactory (—2) 

6. Very unsatisfactory (—3) 


The subject’s responses were v 
+3 to —3 to reflect the direction 
of the evaluation. For each subj 
questionnaire, we computed the | 
balance of his evaluation by mi 
signed values for each item: Thus, 
of balance in a triad of evaluations 
+2, —1, and —2 is +4. This procedure 
10 possible numerical values with 
positive or negative sign: 1, 2, 3, 4, 6, 
18, 27. The interval between adj 
becomes larger as the values increas j 
there is little basis for believing thi 
tervals between large values are ps 
cally larger. Increasing the value of 
bonds by some amount should 
the same effect regardless of the 
balance. In order to approximate 
affairs we performed a rank transform 
the balance values which we then ti 

metric scale assuming equal intel 
values, both positive and nega 


TABLE 2 
Mean DEGREE or Consist 


D 
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signed ranks from 1 to 10 in order of increasing 
magnitude. For example, a value of +18 now 
becomes +9 and a value of —27 becomes — 10, 

In examining balance tendencies over time 
we used the expedient of classifying the sub- 
jects as to the total number of errors for all four 
series. The maximum number of érrors is 24 
since there are six critical trials for each of the 
four series. Initially we examined the balance 
data on a series-by-series error rate, but the 
evaluation of differences became unwieldy. 
We therefore relied upon total errors as the 
basis for classification in order to obviate the 
difficulty encountered with those subjects whose 
error rate varied a gréat deal from series to 
series. The trends resulting from classification 
by total errors are in all essential respects 
similar to those based on a series-by-series 
error classification. 

Table 2 presents the average degree of 
balance, as determined from the questionnaire 
items, by total errors and series. The error 
categories were chosen in order to obtain ap- 
proximately equal numbers of subjects in each. 
We predicted that the subject would tend to 
reduce inconsistency over time. This is clearly 
evident in the column totals for each series, 
The variance due to series is significant at the 
10 level; the mean square due to the linear 
component of the three degrees of freedom 
(n) for series is significant at beyond the .02 
evel, 

There are certain obvious shortcomings of 
the balance metric used in the foregoing 
analysis. To what extent does a linear ordering 
of the 10 values violate the psychology of the 
situation? For example, does a +2, +2, +2 
configuration possess two units more of balance 
than a +1, +2, +3 configuration? Offhand, 
this does not seem reasonable. Our metric no 
doubt does considerable violence to the data. 
More probably the 10 states represent some 
partial order. Short of a detailed investigation 
of the psychological space represented by these 
10 configurations we relied upon our rough and 
ready metric. We decided, however, to use 
another, less powerful evaluation which makes 
fewer assumptions. We determined the propor- 
tion of consistent evaluation triads (those hav- 
ing a plus [+] value) for Series I and Series IV. 

proportion for Series I is .62 (60/96) 
Whereas for Series IV it is .80 (77/96) (p < .05 
by chi square). We can accept this difference 
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TABLE 3 
Nomper or Sunyects ix Eacu ov Tue PossmLe 
PATTERNS OF CONSISTENCY AND INCONSIST- 
ENCY IN SELP-OTHER EVALUATIONS 
Series 
oe ES ei V T 
I u m Iv R 
DE Aea ae E E E L E Ma A 

1 A 7 = - 13 
2 = ee = + 11 
3 a + - 4 
4 = - + + 6 
5 al + - - 2 
SMM, i hbo All = yey ote 0 
7 a iF + = 0 
8 - + + + 12 
DE nE Aj 0 
10 + - - + 1 
BM Ar icine Yank chet | uncee 2 
12 + - + + 3 
13 + + > - 1 
Te es Te i 3 
15 A + + - 1 
16 + F + + 37 


as adding some conviction to the conclusion 
reached via the detailed information offered by 
our questionable metric, 

Our other prediction is also confirmed. The 
tendency toward balance in self-other attitudes 
is associated with behavior evidencing minimal 
conflict. Let us consider a subject with 0-1 
errors as a consistent nonyielder, a subject with 
2-4 errors as somewhat nonyielding, a subject 
with 5-10 errors as somewhat yielding, and a 
subject with 11-24 errors as yielding. For the 
two extreme categories, where there is pre- 
sumably a strong commitment to the behavior, 
consistency tends to be greater than in the two 
middle categories. The variance due to errors 
is significant at beyond the .05 level. The 
quadratic component of the 3 df involved is 
significant at beyond the .02 level indicating 
the trend we predicted. 

Again we performed the simpler analysis 
based upon the proportion of balanced triads. 
The two extreme categories (0-1 and 11-24) 
were combined and compared with the two 
middle categories (2-4 and 5-10). This dual 
classification represents a difference in the de- 
gree of commitment to behavior reflecting, we 
have assumed, a difference in conflict, The 
proportion of consistent triads over all four 
series in the extreme categories is .78 (150/192) 
and in the middle categories is .66 (127/192) 
(p < .05 by chi square). This difference over 
the four series reflects a consistent series-by- 
series trend. 
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TABLE 4 


CHANGES IN EVALUATION COMPONENTS FOR SUBJECTS WHO INCREASE CONSISTENCY 
FROM SERIES I To IV By Error CATEGORY 
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= Self-Evaluation Evaluation of Others Others Evaluation of Self 
hte Increase Decrease Same Increase Decrease Same Increase Decrease Same _ 
0-1 1 2 7 1 9 0 2 7 1 
24 8 3 2 4 6 3 8 2 3 
5-10 6 1 6 3 2 8 6 2 5 

11-24 6 4 3 3 3 7 7 3 3 


The increase in self-evaluation consistency 
can be seen more vividly with another type of 
analysis. Table 3 presents all 16 possible 
balance patterns over the four series. The first 
8 patterns, which show initial imbalance, tend 
to become balanced. This effect can be tested 
by comparing the cell frequencies in the column 
at the far right for Patterns 2,4, and 8, which 
are those that show a change toward balance, 
with the frequencies for Patterns 1,3,5,6,and 7 
(p < .001 by a binomial summation test). We 
obtain this large difference in spite of having 
included those subjects in Pattern 1 who may 
have changed to a balanced configuration had 
the experiment continued. The table also shows 
that those subjects who start out with a con- 
sistent self-evaluation (Patterns 9 through 16) 
tend to remain consistent (p < .001). Those 
few subjects who do change to an inconsistent 
configuration tend to change back again, Out 
of a total of 48 subjects who start out as con- 
sistent only 4 are inconsistent on Series IV. 

One may inquire as to changes occurring 
over time in each of the three evaluation 
components for those subjects who show an 
increase in consistency. These data are shown 
in Table 4. The frequencies indicated are for 
an increase, a decrease, or no change in each 
of the evaluations in comparing Series I to 
Series IV for subjects who fall into the four 
error categories. The data for the subjects who 
did not change or who became less consistent 
are not shown. When the subject remained 
steadfast (0-1 category) he was clearly dif- 
ferent from the others in his mode of in- 
creasing consistency. He tended to decrease 
both his evaluation of the others and the 
evaluation he believed the others had of him. 
This incidentally is coupled with an initial 
high self-evaluation. The other three categories, 


which are similar to each other, present a 
different picture. Here we see that the sub- 
ject’s mode of increasing balance is to increase 
both self-evaluation and the evaluation he 
assumes the other has of him. 

We were also interested in determining the 
extent to which acceptance or rejection, ie,, 
degree of yielding, evidenced a diminution of 
conflict over time. There appears to be a 
progressive increase from series-to-series in the 
number of subjects who made either six errors 
or no errors, a trend that suggests that conflict 
did tend to decrease. To evaluate this change 
we compared the proportion of subjects who 
made six errors or none on Series I and on 
Series IV. The proportion on Series I is .29 
(28/96) whereas on Series IV it is .47 (45/96) 
(p < .02 by chi square). We also compared for 
the first and last series, the proportion of sub- 
jects who made 0, 1, 5, or 6 errors. These pro- | 
portions are .51 (49/96) and .75 (72/ 96), 
respectively, (p < .01 by chi square). This 
would seem ample evidence of conflict resolu- 
tion over time. Table 5 shows the complete 
picture of series-by-series errors. 


Questionnaire Evidence of Conflict 


We made an effort to obtain the subjects 
report of the degree of conflict he experience 
during each successive series. We asked the 
subject to indicate on each questionnaire the 
number of judgments in the previous nine 0n 
which he believed that he experienced T 
difficulty than the others and the number 0 
these judgments on which the average person 
would be perturbed. Neither question is direct 
but we believed that the subject would be less 
defensive in answering these indirect questions 
The difficulty item shows a curvilinear tren 
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TABLE 5 


NUMBER OF SUBJECTS IN EACH Error CATEGORY FOR 
EACH SERIES OF JUDGMENTS 


——— 
Series 0 1 2 3 4 5 6 
I 26 17 22S 10, ae. 
Ii 33 16 18 T E a ss) 
Tit 3 21 14 14 44 D 
IV 38 23 12 10 2 Seay 


which tends to mirror the consistency data. 
The responses were summed over the four 
series as were the errors. The trend is roughly 
summarized if we categorize total errors into 
our 0-1, 2-4, 5-10, 11-24 categories and 
examine the proportion of subjects in each 
category who claim to have had more difficulty 
on 10 or more judgments (of 36). The propor- 
tions are .17 for 0-1 errors, .36 for 2-4 errors, 
.63 for 5-10 errors, and .48 for 11-24 errors. 
We note a sharp increase through the 5-10 
category followed by an abrupt decrease. The 
two successive increases are quite strong (p < 
01 for each by chi square). The decrease does 
not quite approach significance. However, if 
the decrease were evaluated against an extrap- 
olated trend, the difference would be quite 
marked. 

On the intended “projective” question, “on 
how many of the previous nine slides would 
the average person be perturbed,” no such 
trend appeared. The data are arrayed in a tri- 
angular fashion. As the number of errors in- 
crease, the range of the number of projected 
perturbed responses tends to narrow. Trichot- 
omizing the total number of perturbed re- 
Sponses into 0-7, 8-16, and 17-36, produces 
approximately equal ’s in each category. If 
we examine the proportion of subjects falling 
into the middle category, i.e., having 8-16 
Perturbed responses, for the 0-1, 2-4, 5-10, 
11-24 error categories these proportions are, 
Tespectively, .30, .46, .54, and .60. We note 
that the Proportion doubles between the 0-1 
and 11-24 error categories (p < .05 by chi 
Square). It appears that among those subjects 
Who did not yield were the most calm and most 
perturbed individuals. 

These two questionnaire items do not ap- 
pear to reflect the same thing. Projected 
conflict and estimated difficulty may not 
necessarily be highly correlated. Furthermore 
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it is impossible at this point to know if the 
subject’s response referred to the initial 
difficulty of the judgment, diffculty in retro- 
spect after having made his choice, or a denial 
in an attempt to achieve congruence with his 
choice. 

The perturbance question introduces the 
further contamination of any projective item: 
to what extent is the subject projecting his 
own psychological state and to what extent is 
he imagining how someone else would indeed 
react? To what extent does he consider him- 
self average and in what sense does he interpret 
“average”? 

A further distressing problem is created by 
the skewed distribution of judgment errors. In 
balancing the data for category comparison 
we required wider categories with increasing 
errors. The minimum error category is 1 error 
wide whereas the maximum category is 14 
errors wide! Offhand, we would expect greater 
heterogeneity the wider the category. The 
perturbance data suggest the opposite. There 
is also a slight statistically insignificant trend 
for the 0-1 category to be most heterogeneous 
with respect to degree of consistency. Any 
reconciliation of this inverse relationship is 
beyond the present data. 

The consistency and error data present a 
fairly clear-cut case for our hypotheses. The 
questionnaire data, if we reluctantly accept 
them at face value, suggest complexities that 
go much beyond the present data. We are 
currently pursuing a number of these problems. 


SUMMARY 


We explored some effects of future perspec- 
tive with respect to group memberships upon 
the tendency of an individual to conform to the 
judgments of other group members when con- 
fronted with disagreement with them. Expec- 
tations were induced of future tasks that would 
be similar or dissimilar to the present one. 
Expectations of either continuing or non- 
continuing group membership were also 
manipulated. There was a tendency to reject 
the others’ judgments under both the anticipa- 
tion of a continuous task and of noncon- 
tinuing membership. 

We also found that the strength of the 
jndividual’s tendency to yield to the others or 
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FAMILIARITY AND THE ORDER OF PRESENTATION 
OF PERSUASIVE COMMUNICATIONS: 


ROBERT E. LANA 
American University 


ARL Hovland and his associates (Hov- 
land, 1957; Hovland, Janis, & 
Kelley, 1953) have developed a pro- 

gram of research in the field of communication 
and persuasion during the last few years that 
represents the most comprehensive experi- 
mental and theoretical work in this field to 
date. One of the most recent books by Hovland, 
The Order of Presentation in Persuasion, in- 
cludes a compendium of research examining 
various methodological and conceptual ques- 
tions concerning the position of various types 
of affirmative and negative arguments and 
their individual and combined effects on 
attitude and opinion change. One of the 
persisting problems which Hovland recognized, 
but did not directly examine, is that con- 
cerning previous familiarity of the subject with 
the topic of communication and the cor- 
responding effect of the order of presentation 
of conflicting arguments on attitude or opinion 
change. Two early key studies by Lund (1925) 
and Cromwell (1950) indicated that neither a 
primacy nor a recency effect is immediately 
evident in the situation where two communica- 
tions on a single topic, one affirmative and the 
other negative, are presented successively. 
Primacy refers to the situation where the 
argument given first, whether pro or con, is 
more effective in changing opinion in the direc- 
tion of the argument. Recency refers to a 
Similar effectiveness of the argument pre- 
Sented second. In Lund’s study arguments 
Were presented successively for and against an 
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issue, affirmative arguments first for half the 
group, and negative arguments first for the 
other half. Pretest questionnaires were filled 
out before the first communication and again 
after the first and second communications. 
The change resulting from the first communica- 
tion was greater than that resulting from the 
second. This was described by Lund as the law 
of primacy in communication. Cromwell later 
reported directly opposite results where 
speeches presented in the second position had 
a greater influence on the attitude of the 
listeners than those presented first. In this 
latter study the effects were evaluated after 
the two sides had been presented. In the Lund 
study effects were evaluated after each side 
had been presented. In replicating both of 
these studies Hovland and Mandell found no 
consistent primacy or recency effects. Among 
the various experiments discussed by Hovland 
(1957), attempting to determine the various 
factors influencing primacy and recency effects, 
none deal directly with the problem of previous 
familiarity of the subjects with the topic of the 
communication. However, Hovland and his 
coauthors recognized the need for research in 
this direction. In the summary of their book, 
they state, “The results of the present series of 
studies seem to suggest that the nearer one 
comes to achieving primacy in the sense of the 
first presentation of unfamiliar material, the 
more apt one is to obtain primacy effects” (p. 
139). Also, “Undoubtedly further research 
bearing directly on the relationship between 
prior familiarity with the topic under discus- 
sion and the relative effectiveness of the side of 
the issue presented first would greatly help to 
clarify the extent of generality of a law of 
imacy” (p. 140). 
ry is a report of research designed to 
investigate the effects of the order of presenta- 
tion of a pro and con communication on 
opinion change of adult subjects who have been 
differentially familiarized with the topic of the 
communications. The greatest difficulty in the 
construction of the experimental design was 
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the necessity of insuring minimal initial 
familiarity of the subjects with the topic 
utilized. Since college students were to act as 
subjects, the opinion areas such as ethnic 
prejudice, political preference, etc. usually 
tapped in studies such as this one could not be 
used since all subjects would most likely be 
familiar with these topics a certain large but 
indeterminable degree. What was needed was a 
topic about which the subjects had little 
information and had formed little previous 
opinion. This topical area would also have to be 
uncomplicated enough so that the subjects 
could easily form an opinion when confronted 
with conflicting arguments relevant to it. In 
other words, topics such as those concerned 
with technical problems in constructing an 
H-bomb, etc. could not be used. With this 
consideration, the topic of animal vivisection 
was chosen as the opinion area to be used in 
this study because it is a topic that is only 
rarely considered in public discussions through 
the mass media (at least at present). It is a 
simpler matter to manipulate subjects’ famil- 
iarity toward vivisection than it would be if a 
more popular or controversial topic were used. 
That is, initial familiarity toward vivisection 
should be close to zero for most people. An 
attempt to examine this assumption was made 
in the study. 


METHOD 


Materials 


Two questions, “Do you know what the word vivi- 
section means?” and “If you do describe its meaning 
in one or two sentences,” were duplicated on sheets of 
paper and all subjects were asked to respond to them. A 
reliable 10-item Likert-type vivisection questionnaire, 
developed by Molnar (1955), was employed as the 
pretest and posttest in order to evaluate opinion toward 


TABLE 1 
EXPERIMENTAL DESIGN 
Ai Ar As 
Two questions Two questions Two questions 
Pretest Pretest Pretest 
Long familiarization | Short familiarization 
talk and discussion | talk 
12 days 12 days 12 days 
Bı B: Bı B: Bı B: 
Pro Con Pro Con Pro Con 
Con Pro Con Pro Con Pro 
Posttest Posttest Posttest 
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TABLE 2 


SUMMARY OF PRETEST AND POSTTEST MEAN Sco: 
AND DIFFERENCE SCORES 


Sizi: As As 
(Very familiar) (Familiar) (Unfamiliar) 
Score 
Bi B: Bı B: Bı 
ro | (Con | (Pro | (Con | (Pro 
1st) Ast) Ast) Ist) Ast) 
Pre 35.08 | 35.90 | 35.88 | 36.00 | 36.73 
Post 38.42 | 37.50 | 38.31 | 37.56 | 37.53 
Gains 3.34 1.60 2.43 1.56 . 
(N) (12) (10) (16) | (16) | (15) 
Difference +1.74 +.87 
of difer- (primacy) (primacy) 
ences 


vivisection along a continuum of favorability. Two in- 
formational standardized talks on vivisection, with no 
direct positive or negative statements about the prac- 
tice, were administered to two of the groups. The first 
talk was 213 words in length and the second 92 words in 
length. The amount of information presented was 
roughly proportional to the number of words in each 
talk. A pro and a con argument, both of approximately 
500 words length, concerning the topic of vivisection 
were used as the persuasive materials in the study. 
These arguments were developed by Molnar (1955) for 
use in another context and take 4 minutes to present. 
These same materials were used by the investigator to 
examine the related pretest-treatment interaction ef- 
fect (Lana, 1959) and have proved to be excellent 
devices for that purpose. 


Experimental Design 

A total of 85 undergraduates in three Psychology I 
classes at the American University served as subjects 
in the experiment. The subjects were randomly assigned 
to one of six treatment groups. All subjects were asked 
to answer the two questions concerning the nature 
vivisection described above on a sheet of paper. After 
these papers were collected all subjects responded to 
the 10-item Vivisection Questionnaire which was 4 
collected. Group Ai was then read the long familiariza- 
tion talk mentioned above and discussed the nature of 
vivisection with the experimenter until questions an 
comments from the class stopped. The discussion lasted 
approximately 10 minutes. The experimenter carefully 
avoided making any value judgments about the topic 
during the discussion. The group was then di: a 
Twelve days later this group was subdivided into two 
groups, one of which (Bı) listened to the pro communi- 
cation and then, immediately following, the con one 
munication. The other subgroup (Bz) listened to 
two statements in reverse order, first con and then ee 
Both were then posttested with the identical questio 
naire that was used as the pretest. Group As h 
answered the initial questions and responded to i 
questionnaire for the first time, listened to the shor! 4 
familiarization talk immediately afterward but Mis 
allowed no discussion. Twelve days later the group Wi 
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TABLE 3 
RESULTS or ¢ Tests 
Groups 
A Ar Ai 
Difference of differ- +1.74 +.87 —1.51 
ences 
is 2.03 1.44 2.11 
? <.05 >.05¢ <.05 


®* Combined error term with 79 df utilized in ¢ tests. 


subdivided into two groups (Bı and Bs) and followed 
the same procedure as Group A. Group A; answered 
the questions and filled out the questionnaire only, 
Twelve days later the same procedure utilized with 
Groups Ai and A: was carried out. The design is sum- 
marized in Table 1. 


RESULTS 


The responses to the two questions on the 
meaning of the term “vivisection” indicated 
that 95% of all subjects have little or no idea 
what the term means. Since the term vivisec- 
tion has sometimes been used to imply a nega- 
tive value judgment in occasional newspaper 
campaigns against the practice of surgical 
operation on live animals, it is probably not an 
unfamiliar term to individuals who are aware 
of the phenomena about which it refers. It is 
thus concluded that the large majority of 
subjects used in this experiment knew very 
little about the topic of vivisection and hence 
Probably had formed, at most, only mild 
opinions about the matter. The experimental 
results were analyzed using a subtractive 
procedure described by Hovland and Mandell 
(Hovland, 1957). Pretest means for all groups 
are subtracted from their respective posttest 
mean scores. Each B, difference score is then 
subtracted from its appropriate Bə score 
yielding three “difference between differences” 
Scores, one for each of the three A groups. 
These final difference scores were then tested 
against the null hypothesis by utilizing a £ test. 
If the mean differences are positive then a 
Primacy effect has been demonstrated (if 
Statistically significant). If the mean dif- 
ferences are negative, a recency effect has been 
shown (if statistically significant). A summary 
of the mean pretest and posttest scores for ail 
groups and a summary of the analytic proce- 
dure are included in Table 2. 

A Bartlett’s test for homogeneity of variance 
Was run on all scores and x? was not significant 
at the .05 level. It was concluded that the 
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Scores were homogeneous with respect to 
variance. A combined error term for the 
posttest scores was then calculated and 
employed in the ¢ tests involved in testing the 
three pairs of difference scores. A significant 
primacy effect was found in the “Very familiar” 
group and a significant recency effect in the 
“Unfamiliar” group. The “Familiar” group 
indicated a primacy effect which was not 
Statistically significant. The .05 level of 
significance was set as the alpha level for the 
study. These results are summarized in Table 3. 


Discussion 


The results demonstrate that increased 
familiarity produces increased primacy effects, 
at least under the conditions of this study. A 
recency effect is apparent in the group that 
was unfamiliar with the topic before the 
presentation of the conflicting communications. 
It is likely that the informational talks on 
vivisection influenced opinion as well as the 
persuasive arguments, perhaps in terms of 
increasing stability rather than in influencing 
the means of the various groups. However, 
care was taken to make the familiarization 
talks as neutral as possible. In any case, the 
major thesis of this paper is not disturbed by 
this possibility. Thus it has been shown that a 
communication presented first, whether pro or 
con, is more effective the more familiar the 
subject is with the topic. It should be noted, 
however, that the range of familiarity em- 
ployed in this study probably extends from 
somewhere near zero to “somewhat familiar” 
at most. If a topic with which the subjects are 
very familiar, such as racial prejudice, were 
used, the degree of familiarity with the topic 
of the various experimental groups would vary 
from a base very much greater than zero to a 
very great familiarity compared with the 
“unfamiliarized” group in this study, which is 
close to zero familiarity. Perhaps different 
arrangements of primacy and recency effects 
for the various groups might result under these 
conditions. If, however, familiarity is greater 
than zero then primacy is not actually achieved 
in the sense of the first presentation of ma- 
terial. It may be that measurable primacy and 
recency effects only appear when one is dealing 
with a topic with which the subject is only 
slightly familiar or not familiar with at all. 
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Hovland and Mandell’s (Hovland, 1957) 
study, which showed no clear cut primacy or 
recency effects using a topic with which the 
subjects were already familiar, and the results 
of the current study seem to support this 
contention. 

A study by Luchins cited by Hovland (1957) 
is also relevant in this context. Luchins 
presented two different descriptions of the 
same individual to high school and college 
students. These descriptions were charac- 
terized by being polar in nature on an intro- 
version-extroversion continuum, hence they 
are essentially opposed communications. The 
subjects read the descriptions from a sheet of 
paper. Complete lack of familiarity with the 
subject matter was assured since the charac- 
ter who was being described (“Jim”) was 
fictitious. The subjects were then asked to 
describe Jim in their own words and answer 
questions about his personality characteristics. 
The presentation of the opposing descrip- 
tions were juxtaposed for the various experi- 
mental and control groups. Luchins found 
consistent primacy effects. He favors a “set” 
explanation of his results in that the initial 
description directed the latter opinions of the 
subjects in much the same way that a par- 
ticular initial solution to a problem effects 
later attempts to solve similar problems. 

Luchins’ experiment is the only one in the 
literature bearing directly on this problem. 
All of the groups in his study began with 
essentially zero familiarity toward the topic. 
In the present study there are three levels of 
familiarity operating, and yet the presence of 
primacy or recency is opposite to the effects 
obtained by Luchins. That is, with increasing 
familiarity primacy increases, and recency 
occurs with low familiarity. 

Luchins suggests certain possible reasons 
that may account for the discrepancies. The 
communications used in both studies were 
markedly different. In one case the subjects 
read the communication and in the other it 
was read to them. Luchins’ communication 
describes a person, the vivisection communi- 
cation a process. Any of these factors may be 
central in influencing the predominance of 
primacy or recency effects in a given communi- 
cation situation. Perhaps major differences 
between Luchins’ tasks and the present one 
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could have accounted for the opposi 
In Luchins’ study the subjects, bei 
to describe an individual in terms of ci 
characteristics, were probably not 
weigh both possibilities of interpretat 
they might be expected to weigh 
sides of an, argument with respect to 

question such as vivisection. When th 
questionnaire was administered in this 
is probably safe to assume that the 
realized they were expected to make a 
between the two major possibilities sug 
by the pro and con vivisection arg 
In Luchins’ study it is not probable th 
subjects perceived themselves as bein, 
similar kind of choice situation. Since 
have been relatively few experiments | 
problem of familiarity and its effect on 6 
of presentation of a persuasive con 
tion, and since the differences amo 
studies may be attributable to techn 
methodological difficulties rather 
retical ones, any attempt at a 


for further research before any 
theoretical interpretation of primacy, 
and familiarity can be attempted. Th 

tion of a recency effect when the top 
communication is unfamiliar to the 

and an increasing tendency for primac 
increasing familiarity shown in this” 
must be examined in context with sever: 
variables yet to be studied before any 
cal conclusions can be made. Recent 
have been demonstrated by other wo 
various situations which must be consi 


questionnaire was distributed after 
two opposing communications 

delivered when the topic included d 
descriptions of a single individual. 
Campbell (1959) showed that recency í 

were strongest when there was a long ! 
between first and second communicati 
immediate measurement after the 
communication. When the two arg 
presented successively and the testing 
for a considerable time, recency is at 
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mum. Film, tape recorded, written, and 
spoken presentation of persuasive arguments 
must be examined to determine any possible 
differential effects of these variables on opin- 
ion change. Communications about contro- 
versial topics such as racial prejudice or 
political problems may have different effects 
on order of presentation and familiarity fac- 
tors. The question as to whether the subject 
is aware of the intent of the communicator 
must also be answered. Some interview or 
questionnaire technique would have to be 
developed for this purpose. Before studies 
attempting to solve some of these problems are 
completed, anything approaching a compre- 
hensive theoretical formulation of communi- 
cation effects on opinion will be extremely 
difficult. 


SUMMARY AND CONCLUSIONS 


This study was designed to examine the 
influence of the order of presentation of a pro 
and a con communication about vivisection on 
adult subjects who have been differentially 
familiarized with the topic of communication. 

Eighty-five subjects were divided into three 
experimental groups which received a long 
familiarization talk and discussion on vivi- 
section, a short familiarization talk with no 
discussion, and neither a talk nor discussion, 
respectively, after all groups had answered 
two questions as to the meaning of the 
term “vivisection” and had filled out a 
vivisection opinion questionnaire. After 12 
days each group was divided into two sub- 
groups, one of which received a pro and then a 
con argument about vivisection, and the 
other of which listened to the arguments in 
reverse order. All groups then responded to the 
vivisection opinion questionnaire for the second 
time. An analysis of difference scores between 
Pre and posttest mean scores for all groups 
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indicated that the argument in first position 
(primacy effect) was most successful in chang- 
ing opinion for the group with the greatest 
familiarization. The argument in second 
position (recency effect) was most effective in 
changing the opinion of the group with no 
familiarization. 

It is concluded that: 

1. Under the conditions of the experiment, 
increasing familiarity with the topic of a 
communication on the part of the subjects is 
directly related to an increasing primacy effect 
when opposing arguments are presented suc- 
cessively. 

2. A recency effect is evident when the 
subjects’ familiarity with the topic of commu- 
munication is near zero and when opposing 
arguments are presented successively. 
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OPINION EVALUATION AND AFFILIATION! 
ROLAND RADLOFF? 


University of Minnesota 


HE present study was designed to test 

the hypothesis that a person’s needs for 

self-evaluation regarding the correctness 
of his opinions arouse affiliative tendencies 
in him. It stems directly from results and 
discussion contained in Schachter’s (1959) 
recent book The Psychology of Affiliation and is 
related to Festinger’s (1954) theory of social 
comparison processes. 

Schachter (1959) found a positive relation 
between emotional arousal and affiliative 
tendencies; specifically, under conditions of 
experimentally manipulated anxiety, highly 
anxious subjects wanted to be with others more 
than did nonanxious subjects. From a series of 
experiments designed and executed to explore 
the determinants of this relationship, Schach- 
ter concluded that the affiliative tendencies he 
found were a manifestation of “needs for 
anxiety reduction and needs for self-evalua- 
tion; that is, ambiguous situations or feelings 
lead to a desire to be with others as a means of 
socially evaluating and determining the 
‘appropriate’ and proper reaction” (p. 132). 

Festinger (1954) has postulated that: 
“There exists, in the human organism, a drive 
to evaluate his opinions and abilities” (p. 117). 
Further, Festinger proposes that in the absence 
of objective or physical evaluative criteria, 
opinions are evaluated by comparing them 
with those of other people. The opinion is then 
taken as correct or incorrect in terms of agree- 
ment or disagreement with the opinions that 
the others hold. The drive for self-evaluation 


1 This study is part of a dissertation (Radloff, 1959) 
submitted to the faculty of the Graduate School of 
the University of Minnesota in partial fulfillment of the 
requirements for the degree of Doctor of Philosophy. 

This study is also part of a program of research on 
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Laboratory for Research in Social Relations at the 
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direction of Stanley Schachter and is supported by a 
research grant from the National Institute of Mental 
Health, United States Public Health Service. 

2 The author is now with the United States Public 
Health Service. 

The author wishes to thank Stanley Schachter for 
support and advice on all phases of the study. 
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has been successfully employed to account for 
a variety of behaviors in a number of labora- 
tory studies. Specifically, group members have 
been found to change their opinions, (Back, 
1951), exert influence to change the opinions of 
others (Festinger, Gerard, Hymovitch, Kelley, 
& Raven, 1952), and reject unchanging de- 
viates in order to create uniformity of opinion 
within the group (Schachter, 1951), thereby 
providing validity for opinions by consensus, In 
all of the studies just cited, opinion compari- 
sons were made in existing groups. No experi- 
mental evidence is yet available that bears 
directly on the question of opinion evaluation. 
and affiliation. The present study was therefore 
planned to demonstrate that needs for evalua- 
tion of opinions are important determinants of 
group formation in addition to their function 
as sources of pressure to uniformity within 
groups. The major hypothesis of the study is: 
A person who is uncertain about the correctness 
of one of his opinions, for which he finds no ob- 
jective criteria available by which to evaluate 
its correctness, should seek affiliation with other 
people in order to evaluate his opinion via 
social comparison. y 
In an experimental test of this hypothesis, 
specific predictions regarding affiliative tend- 
encies would follow from a manipulation of 
evaluative needs. With initial need arous 
held constant, if for some subjects information 
that satisfies evaluative needs is provided while 
for others it is not, affiliative tendencies should 
be stronger in the latter group than in the 
former. Since people are assumed to satisfy 
evaluative needs by comparing their opinions 
with those of others, one method of manipula- 
ting evaluative information would be to 
provide subjects in different conditions Wi 
information about the opinions of various 
groups of other people. Such information can 
be ordered on a continuum of adequacy for 
purposes of comparison and evaluation. Points 
ranging from inadequate to adequate could 
represented by giving the subjects information 
about the opinions of: (a) no others, (6) it- 
relevant others or inferiors, (c) peers, (@) & 
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perts. (Inferiors are defined as people who have 
less education, information, and experience 
than the subjects; and whose inferiorities are 
relevant to the validity of their opinions on the 
issue in question. Experts are defined as people 
who have greater education, information, and 
experience than the subjects, and whose 
superiorities are relevant to the issue in ques- 
tion.) 

The magnitude of evaluative needs (and 
therefore of affiliative tendencies) are predicted 
to vary in the following manner according to 
the conditions just specified a > b > ¢ > d. 
These predictions depend in part on the 
degrees of inferiority and expertness that 
the subjects attribute to groups b and d. Ac- 
cording to the strength of the manipulation 
and the subject’s perception, could be closer 
to a or to c, and d could be close to or distant 
from c in terms of adequacy as comparison 
groups. In the absence of prior knowledge 
regarding the strength of the manipulations, 
predictions can be made only for the rank order 
of the groups and not for the magnitude of the 
differences. 

The manipulations just considered assume 
that the subjects’ commitment to an opinion is 
held constant. If commitment is allowed to 
vary a person committed to an opinion may be 
expected to have greater evaluative needs than 
someone who is not so committed. 

Differences can be predicted within the 
groups according to the subjects’ agreement 
or disagreement with the opinions of the others 
provided him. According to Festinger (1954), 
opinion evaluation is impaired when the only 
comparison available is a very divergent one. 
Therefore, subjects in the peer comparison 
Condition who disagree with the opinions of a 
large majority of the comparison group should 
have evaluative needs that are relatively un- 
Satisfied as compared with those who agree 
With the opinions of the large majority of the 
comparison group. Evaluative needs should be 
less effected by agreement or disagreement with 
a large majority of the members of inferior or 
expert comparison groups than they are by 
agreement or disagreement with peers. The 
magnitude of such differential effects is again 
dependent upon the extent to which these 
groups are perceived to be different from the 
Peer group. Information about the opinions of 
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inferiors should be irrelevant for both those 
who agree and those who disagree if the in- 
feriors are sufficiently low in status. Receipt of 
information regarding the opinions of experts 
provides adequate evaluative information for 
both those who agree and those who disagree if 
the experts are sufficiently high in status. 

Further within-group differences may be 
predicted on the basis of individual differences. 
Schachter (1959) found an interaction between 
birth order, anxiety, and affiliation. Specifi- 
cally, he found that affiliation tendencies of 
early born subjects were more strongly effected 
by the manipulation of anxiety than were 
those of later born subjects. Schachter at- 
tributed these differences to the greater 
dependency needs of first born and only 
children. Relevant to the present study the 
question may be asked, is this effect specific to 
anxiety or does it apply to a variety of situa- 
tions in which the need for social evaluation is 
a motive? A partial answer to this question can 
emerge from a test of the following hypothesis 
in the present study: First born and only 
subjects, who are presumably more dependent 
on other people for the evaluation of their 
opinions than are later born subjects, should 
manifest greater differences in affiliative 
tendencies as a result of varying needs for 
social evaluation. 


METHOD 


The experiment designed to test these hypotheses 
was conducted as a combined survey of student opinion 
and a recruiting session for participants in discussion 
groups. Subjects were first asked to give their opinions 
on an issue in writing. Opportunities for evaluation were 
manipulated by telling them the opinions of groups of 
other people. Affiliation tendencies were measured by 
asking the subjects to volunteer for discussion groups 
that were to talk about the issue on which they had 
just given their opinions. 


Procedure 


Subjects were 480 females from introductory psy- 
chology classes at the University of Minnesota, who 
were randomly assigned to experimental groups within 
the time limits their schedules would permit, 

Subjects were assembled in groups of 25-30, After 
they had filled out a short questionnaire giving data 
on birth order, the experimenter read to all groups a 
statement designed to arouse interest in the topic, im- 
press upon subjects the importance of the opinion they 
were asked to give, and provide a rationale for the proce- 
dure to follow. The gist of the remarks was that the 
number of potential students and costs of college educa- 
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tion would present serious problems by 1970, Data were 
given to emphasize the gravity of the situation. The 
specific aspect of the problem the subjects were asked 
to consider was, who shall pay these increased costs, 
students or governmental units? Their stake in the 
problem as future parents and taxpayers was empha- 
sized, and they were told that their special information 
on and concern for the problem would add weight to 
their opinions, 

After hearing the statement, each subject received 
an opinion survey form asking her to “Please indicate 
by a check mark on the following scale what per cent 
of the cost of their education you feel college students 
or their parents should pay.” The scale ranged from 
0% to 100% with intervals of 5%. Subjects were also 
asked to indicate how certain they were of their opin- 
ions on a seven-point scale with categories ranging from 
“absolutely certain” to “completely uncertain.” 


Manipulation of Evaluative Information 


At this point subjects were provided with differential 
opportunities to evaluate the correctness of their opin- 
ions by giving them information about the opinions of 
others, in the form of a frequency distribution that was 
drawn on the blackboard and explained in detail to the 
subjects. The abscissa of the distribution was the same 
scale on which subjects had made a check of their 
opinions; the ordinate represented frequency of response 
in the particular comparison group. For all groups given 
information the curves were identical in shape, skewed 
with a highly peaked mode. As a control for possible 
interaction between the subject’s position on the opin- 
ion scale and agreement or disagreement with others, 
half the groups in each condition were shown a distribu- 
tion with the mode between 20% and 40%, while the 
other half were shown a distribution with the mode 
between 60% and 80%. Seventy-five percent of the 
responses of the comparison group were ostensibly in 
this 20% range, with 12% in the long tail and 13% in 
the short tail of the distribution. By means of this 
graph each subject could see that either an overwhelm- 
ing proportion of the comparison group (75%) agreed 
with her, or that an even larger proportion (87% or 
88%) disagreed with her, according to the position she 
had checked on the opinion survey form. 

These manipulations produced the following condi- 
tions: 

No Information. Subjects in this condition were 
not given any information about the opinions of 
other people. That is, no frequency distribution 
was shown. g 

Inferior. Subjects in this condition were shown a 
distribution purportedly summarizing the opinions 
of high school sophomores. 

Peer. Subjects in this condition were shown a dis- 
tribution purportedly summarizing the opinions of 
college sophomores at their university. 

Expert. Subjects in this condition were shown a 
distribution purportedly summarizing the opinions 
of leading economists, university presidents, and 
state legislators concerned with financing higher 
education. 

No Decision. In a fifth condition, the extent to 
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which the subjects held an opinion on 
manipulated by omitting administra 
vey form. Subjects in this condition wei 
any information about the opinions of 
Their interest in joining a discussion g 

ited immediately after the interest aro 
ment had been read. a 


Measuring Affiliative Tendencies 


Affiliation tendencies were measured by a: 
jects to volunteer to take part in discussiot 
that were to talk about the problem of finane 
education. The experimenter stated that the di 
groups would be composed of six to eight “stu 
yourselves,” The ostensible purpose of the 
groups was to “generate some original ideas 
be put to use.” An information form was. 
on which the subject was asked to give hi 
address and to indicate the amount of int 
in volunteering for a discussion group. The fou 
available were: j 

1. I would like very much to take p 

cussion group. 4 
2. I wouldn’t mind taking part in- 

group. 4 
3. I could take part in a discussion gron 


sary. 
4. Ihave no interest in taking part ina d 
group. f 


This instrument provided the measure of the ¢ 
ent variable, 


Independent Measure of Need fo 
Evaluation 


Subjects were asked their reasons for 


social evaluation. An open-ended questi 
distributed with the explanation that the 
Opinion Research Center, sponsors of the st 
ducted many such discussion groups, and was int 
in the problem of why people do and do not 
in such groups in order that more effective 
techniques might be employed. Since this ques 
was not introduced until the experiment had 
progress for some time, data are available 
portion of the subjects. 

At the conclusion of the experiment subj 
asked not to discuss the issue with other 
students since the experimenter wanted to hay 
expression of student opinion, uncontam 
formation about how others felt. A questi 
beled “Previous Interest” revealed that sul 
later groups had had no discussions wi 
previous groups, and that they did not 
true nature of the experiment. 


RESULTS X 
Differences in Affiliation Tendencies 
Conditions 


The major hypothesis of a positi 
between need for opinion eval 
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TABLE 1 
Means oF INTEREST IN JOINING A DISCUSSION Group 
AND PERCENTAGE IN EACH GROUP INDICATING 
Maximum INTEREST | 


Condition NU Dicustn Manman 

roup' Interest 
No Information 114 2.02 40% 
Inferior 93 2.30 BA 

Peer 107 2.47 19' 
Expert 87 2.63 14% 
F = 6,98; df = 396,3; p < .001 

No Decision 79 2.53 12% 


® Lower scores indicate greater interest in joining discussion 
groups, 


affiliation tendencies is strongly supported by 
the data reported in Table 1. Means indicating 
the strength of affiliation tendencies are in the 
predicted order of magnitude for the four 
conditions and the variance of the condition 
means around the grand mean is highly 
significant. The exact probability of the four 
means being ranked in the predicted order is 
.042 since there are 24 possible ways in which 
they could have been ordered by chance. 
Thus it can be concluded that providing 
subjects with differential opportunities to 
evaluate their opinions strongly effected 
affiliative tendencies. 

A comparison of the No Information and No 
Decision condition affiliation means reveals 
that holding an opinion also effects affiliative 
tendencies. Subjects who were asked to form 
and record their opinions exhibited signifi- 
cantly stronger affiliative tendencies than 
those who were not asked to do so. 


Effects of Agreeing or Disagreeing with Others 


According to the point they had checked in 
recording their opinions, subjects were divided 
into two groups for the purpose of an analysis 
of differences within conditions. If the check 
mark indicating the subject’s opinion lay within 
the 20-point modal range of the comparison 
distribution that she had been shown, the 
Subject was classified as a “conformer”; if it 
lay outside this range, she was classified a 
“deviate.” 

_ Results presented in Table 2 show no 
Significant differences between the affiliation 
tendencies in any of the three conditions in 
which information was provided. However, a 
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straight “conformer-deviate” division assumes 
that the subjects who were deviates at the 
time their opinions were measured were also 
deviates at the time affiliation tendencies were 
measured. Festinger’s (1954) theory of social 
comparison processes holds, and numerous 
experimental studies (Back, 1951; Festinger 
et al, 1952; Festinger & Thibaut, 1951; 
Gerard, 1953; Hochbaum, 1954) have found, 
that one manifestation of pressures toward 
uniformity is a change of opinion toward the 
group norm by deviates. It may therefore be 
presumed that in the present study, many 
subjects who were deviates according to their 
opinion statements may no longer have been 
deviates at the time their affiliation tendencies 
were measured, having changed their opinions 
to conform to the norm of the comparison 
group. Since it was not feasible to include a 
direct measure of opinion change in the study, 
the measure of certainty of opinion was em- 
ployed to separate those who presumedly 
changed their opinions from those who did not. 
In previous studies in which opinion change 
has been measured, Hochbaum (1954) and 
Ehrlich (1958) found that subjects who were 
uncertain of their opinions changed to conform 
to a normative opinion significantly more than 
did those who were certain. Therefore, in the 
present study, subjects were dichotomized 
near the median according to their responses to 
the scale measuring certainty of opinion. 
Subjects who indicated that they were “ab- 
solutely,” “very,” or “quite” certain were 
called certain; while subjects checking “fairly” 
and “not at all” certain and “very” and 
“completely” uncertain were designated un- 
certain for purposes of analysis. 

Comparisons of the affiliation means re- 
ported in Table 3 give strong support to the 
hypothesis that deviates in the Peer condition 
who were certain of their opinions have un- 
satisfied evaluative needs. The affiliation 
tendencies of subjects in this group were nearly 


TABLE 2 


MEAN AFFILIATION SCORES FOR CONFORMERS 
AND DEvIATES 


Condition  Conformers N Deviates N 


i BO ae 27 112,285) 66 
Por 258k ak) 8 
Expert 2.70 23 2.61 64 
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TABLE 3 


MEAN AFFILIATION SCORES FOR CERTAIN 
AND UNCERTAIN DEVIATES 


Deviates Who Deviates Who 
Condition Were Certainof MN Were Uncertain N 
eir Opinions of Their Opinions 
Inferior 2.24 34 2.47 32 
Peer 2.09 32 2.71 41 
Expert 2.54 35 2.69 29 


equal to the affiliation tendencies of subjects in 
the No Information group. Peer deviates who 
were certain of their opinions manifested 
stronger affiliative tendencies than did Peer 
conformers (p < .10) or Peer deviates who were 
uncertain of their opinions (p < .01). Also, as 
predicted, there were no differences within 
either the Inferior or Expert conditions be- 
tween deviates who were certain of their 
opinions and deviates who were uncertain of 
their opinions and conformers. 


Analysis of Subjects’ Reasons for Joining 
Groups 

Data presented in Table 4 provide a direct 
measure of the inferred construct “need for 
social evaluation.” To simplify the analysis for 
“reasons” and birth order, subjects were 
divided into two groups—one group predicted 
to have high needs for opinion evaluation and 
the other to have low needs, according to the 
theory of social comparison processes. Needs 
for evaluation were expected to remain high 
for subjects in the No Information and Inferior 
conditions and for deviates in the Peer condi- 
tion who were certain of their opinions; they 
should be low for subjects in the Expert condi- 
tion, for deviates who were uncertain of their 
opinions and conformers in the Peer condition. 
Affiliation means for the “High Need” and 
“Low Need” groups, respectively, were 2.14 
and 2.63 (p < .001). The Ns in Table 4 are 
quite small compared to these in previous 
tables, since these data were gathered on only 
a portion of the subjects in each condition and 
very few subjects expressed interest in joining 
discussion groups in the “Low Need” group. 
Reasons were given in response to the open- 
ended request: “Please indicate why you are 
or are not interested in joining a discussion 
group to talk about this issue.” Each subject 
was free to give as many reasons as she desired, 
but the mean number of reasons given by the 
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High Need and Low Need groups and { 
proportional frequencies of subjects sta n 
specific number of reasons were ideni 
A reason was coded as expressing a need 
social evaluation only if the subject mention 
wanting to hear the views of other peop 
wanting t» compare her views with th 
others. Other categories of reasons for wan 
to join discussion groups were: social or “4 
aspects of groups, interesting nature of discu 
sion groups, information seeking (no mentit 
of getting ideas from other participant ) 
egocentric—wanting to make certain the 
opinion was heard, wanting to help own | 
nancial position, wanting to help younger si 
or their own children, performing a duty f 
society, q 
The data support the hypothesis that ne 
for social evaluation account for differences 
affiliation tendencies between the two group: 
Among subjects who desired to join discu: 
groups, those in the High Need group g 
reasons indicating needs for social evaluat 
significantly more often than did those in th 
Low Need group (p < .05). In other words, 
greater affiliative tendencies of subjects in 
High Need group may be directly attribute 
to greater needs for social evaluation. 
The classification in terms of High Need and 
Low Need divides subjects from the Peer 
dition on theoretical grounds. While 
internal analysis was necessary in order t 
provide an NV of sufficient magnitude to permit 
a reliable measure of the “reasons” index 
similar differences with respect to “reasons” 
apply as well for the treatment groups thes 
could be classified in their entirety as “High 
“Low” in need. Thus subjects from the No 
Information and Inferior conditions who dé 


TABLE 4 

Soctat EVALUATION REASONS VERSUS OTHER 
REASONS FOR WANTING TO JOIN 

Discussion GROUPS* 


c 
Group Serine Rese Reason) yaa 
i 59 
High Need 20 39 : 
Low Need 3 21 24 
Total 23 60 


X = 3.89, df = 1, p < .05 


^ Two coders agreed on the assignment of reasons to categories 
in 100% of the cases. 
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sired to join groups gave reasons indicating 
needs for social evaluation more frequently 
than did their affiliatively inclined counter- 
parts from the Expert condition (.10 > p > 
.05). 


Individual Differences: The Effects of Birth 
Order 

Data presented in Table 5 indicate that, in 
support of the hypothesis concerned with birth 
order, manipulation of evaluative needs pro- 
duced a large and significant difference in 
affiliation tendencies for early borns (p < .001) 
and a smaller, nonsignificant difference for 
later borns. The interaction between birth 
order and needs is significant (p < .01). The 
results of the present experiment complement 
Schachter’s (1959) findings that early and later 
borns do not differ in overall affiliative tend- 
encies; they only differ when dependency-re- 
lated needs are aroused. 

Again, it should be noted that while the 
analysis according to birth order includes 
subjects from the Peer condition, differences of 
similar magnitude are obtained when only the 
conditions that could be assigned entirely to 
“High” or “Low” need are considered (Table 
6). Although the interaction of birth order with 
need is less reliable (.05 > p > .01) in Table 6 
than it is in Table 5, the mean affiliation scores 
are virtually identical in the two tables. 

An examination of the reasons for joining 
discussion groups provides evidence that the 
greater affiliative tendencies of early borns in 
the High Need conditions were motivated by 
needs for social evaluation. Among subjects 
from whom reasons were solicited, there were 
17 early borns and 14 later borns who showed 
maximum interest in joining groups in the 
High Need conditions. Of these subjects, 9 
early borns and 3 later borns gave reasons 
indicating need for social evaluation. The exact 


TABLE 5 
AFFILIATION MEANS BY BIRTH ORDER AND 
CONDITION 
Bi ka ly Row 
rth Order Condition | and | N | Later | N | Means 
First 
High Need 2.03 | 118 | 2.26 | 116 | 2.14 
w Need 2.81 74 | 2.50 | 82 | 2. 
Column Means 2.33 2.36 
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TABLE 6 
AFFILIATION MEANS BY BIRTH ORDER AND 
CONDITION 
i Onl 
Birth Order Condition | and’ | | tater | v | Rom, 
No Information and | 2.00 | 99 | 2.28 | 108 | 2.14 
'enors 
2.83 | 35 | 2.50 | 52 | 2.63 
umn Means 2.22 2.35 


probability of a distribution this extreme or 
more so occurring by chance, given fixed 
marginals, is .075. 

Discussion 


The results of the present experiment sup- 
port the hypothesis that the drive for self- 
evaluation with respect to opinions can be an 
important determinant of affiliation. 

As is generally the case in experimental 
social psychology, the manipulations in the 
present study were relatively weak, One could 
therefore not expect to arouse affiliative 
tendencies in all subjects in the High Need 
group. The contention is plausible, however, 
that most people at some time experience 
evaluative needs which can be satisfied only 
by social comparison, of sufficient strength to 
motivate them to seek out others. The relation- 
ship established would thus seem to have wide 
generality. 

A theoretically significant contribution of 
the present study is the demonstration that 
the affiliative consequences of evaluative needs, 
first employed by Schachter (1959) to account 
for affiliative responses under conditions of 
emotional arousal, are apparently applicable in 
a variety of seemingly disparate contexts. That 
is, whenever evaluative needs capable of being 
satisfied by social comparison are aroused, the 
arousal of affiliative tendencies is a potential 
consequence. The arousal of evaluative needs 
concerning abilities, for example, should also 
produce affiliative tendencies in the absence of 
adequate nonsocial means of evaluation. 


No Decision versus No Information 

The large difference in affiliation tendencies 
between subjects in the No Information and 
No Decision conditions raises the question, 
what are the consequences of stating an 
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opinion? One probable consequence is an 
increase in the salience of the issue; thus sub- 
jects in the No Information condition may 
have a greater desire to discuss an issue which 
is more important or interesting to them. 
Another possible consequence is the generation 
of cognitive dissonance (Festinger, 1957) which 
may arise when an individual commits himself 
to a position on a complicated, multisided 
issue. In the present experiment, dissonance 
may have been created in the following ways. 
First, the person may be aware that he could 
have taken a position other than the one he has 
assumed; second, he may feel that other people 
will not agree with his stand. There is evidence 
(Festinger, Riecken, & Schachter, 1956) that 
dissonance arising from these sources can 
produce affiliative behavior. 


A Contradiction (?) 


The hypothesis that unchanging (certain) 
deviates in the Peer condition would want to 
join a discussion group composed of other 
people from their psychology classes contains 
a possible logical contradiction if it is presumed 
that the subjects were perceptive and logical. 
The distribution of opinions that these sub- 
jects, Peer deviates, had seen told them that 
nearly 90% of the potential discussants would 
disagree with them. Festinger (1954) states 
that for social comparison to produce stable 
evaluation of opinions, the opinions of others 
must be close to one’s own. Clearly there 
would be few if any people in a discussion group 
composed of six to eight people with opinions 
close to those of the deviate, given the distri- 
bution shown. Why, then, did the subject’s 
behavior conform to the predictions of this 
apparently contradictory hypothesis? 

Several explanations are possible. One is that 
the subjects may simply not have made the 
logical inference necessary for them to have 
predicted the probable composition of the 
discussion groups. At the other extreme of the 
subjects’ perceptive ability, it would probably 
be implausible to assume that subjects would 
hypothesize that groups would be composed 
largely of deviates; that subjects would realize 
that conformers would not want to join discus- 
sion groups because their opinions had already 
been evaluated. 

A theoretically more interesting and ex- 
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perimentally testable hypothesis is that | 
changing deviates considered but disregar 
the probable, composition of the discuss 
groups. They may have had such strong ne 
for more evaluative information that a 
tential comparison even with quite diver 
views was’ sufficiently attractive to a 
affiliative tendencies. According to th 
reasoning, it would be expected that, given 
choice, unchanging deviates would rather 
a discussion group composed of people 
opinions were similar to their own. 


Birth Order 


As mentioned in the introduction, 
hypothesis concerning the effects of ordin 
position stemmed from Schachter’s (19% 
experiments on anxiety and affiliation. H 
invoked dependency as a construct to accou 
for the greater affiliative tendencies of e 
borns under conditions of high anxiety. 

With reference to birth order and dep 
ency, the present study fills in the missing 
in a 2 X 2 table summarizing the resul 
four experiments in which emotions 
opinions were in need of evaluation, 
changes as a result of social comparison 
affiliative tendencies were measured. Comp: 
with later borns: early born subjects showed 
greater changes in (a) emotional reactions 
(Wrightsman, 1959) and (6) opinions (Ehrlich, 
1958) as a result of social comparison; eat 
borns also manifested greater affiliative te 
encies when in need of (a) emotional com 
parison (Schachter, 1959) and (b) opinion 
comparison (present study). The complemen È 
tary results of these four studies strongly 
suggest that the static variable of birth order 18 
related to the more dynamic concept ok 
dependency. i 

SUMMARY ; 

The hypothesis that self-evaluative needs 
concerning opinions are an important determi 
nant of group formation was tested experi 
mentally. fi 

Subjects were presented with a problem an “fl 
asked to state their opinions. Adequacy © 
opinion evaluation was varied at four leves i 
along a continuum by telling subjects 1 i 
ferent conditions the opinions © ae 
groups of other people. Affiliation tendencii 
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were measured by assessing the level of interest 
the subjects had in joining discussion groups to 
talk about the opinion in need of evaluation. 

Results gave strong support to the hypothe- 
sis. Affiliation tendencies varied according to 
the strength of evaluative needs. 

Consonant with Schachter’s (1959) findings 
regarding emotional evaluation and affiliation, 
the affiliative responses of first born and only 
subjects were more strongly effected by 
varying evaluative needs than were those of 
the later borns. 
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N a series of experiments, Schachter (1959) 

has studied some aspects of affiliation by 

inquiring into the instrumentalities served 
by being with others when a person is exposed 
to a stressful situation. The subject in his ex- 
periments anticipated either a weak or a strong 
shock and then indicated his desire to wait 
either with others or alone before getting 
shocked. Schachter suggests five plausible 
“explanatory possibilities” that might each 
account for the experimentally demonstrated 
positive relationship between fear and affilia- 
tion. Being together with others might provide 
a person with: a collective means for avoiding 
the experiment, i.e., escape; the attainment of 
cognitive clarity regarding the impending dan- 
ger; comfort and support; distraction; or a 
means for attaining clarity regarding his 
“worked up” body state, i.e., self-evaluation. 
By varying aspects of the projected waiting 
situation described to the subject, and on the 
basis of internal analyses of his data, Schachter 
rules out all but the comfort and self-evalua- 
tion hypotheses and proceeds to lay heavier 
emphasis upon self-evaluation. The present 
experiment was prompted by the following 
difficulties inherent in Schachter’s method of 
inferring such a desire for self-evaluation via 
social comparison: the unduly circumscribed 
list of “plausible explanatory possibilities,” the 
reliance upon internal correlational analysis to 
eliminate the three other possibilities, and the 
possible presence of experimental constraints 
limiting the operation of other motives which 
may, under ordinary circumstances, produce 
affiliation. 

Let us accept the basic assumption underly- 
ing comparison theory, namely, that the per- 
son will attempt to reduce cognitive unclarity 
regarding some aspect of himself. If informa- 
tion from others is available to him regarding 
that aspect of themselves, he should tend to 


1 We wish to thank George S. Rotter for help in 
running the experiment and Stanley Schachter and 
Arthur R. Cohen for helpful, critical comments. 
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employ the information in attempting 
ate himself in that respect. A number of 
of internal cognitive unclarity rega 
aroused state may exist: The individ 
want to /abel the emotion, measure its 
determine whether, in a normative se 
appropriate, or discover whether or no 
reacting adaptively to the impending sit 
For the present, we single out the a 
intensity. We assume, in line with comp 
theory, that intensity information from 
provides the individual witha frame of ref 
within which to evaluate the intensity 
own reaction. ti 
Schachter (1959) created his high 
fear treatments by verbal instructio 
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control for a considerable amount of ci 
cation, as some of Schachter’s own } 
indicates. We therefore decided to run 
periment under restricted conditions, tl 
jects receiving their instructions in is 

Tf a person is uncertain as to the 
of his emotional reaction, he should’ 
formation from others that will help h 
ure it. To the extent that he has info 
from others regarding the intensity 
reactions, his desire to be with them 
the impending experience should be 


METHOD 


> 
Two levels of fear were induced by telling ! 
ject that he would receive either a strong Or 
electrical shock. He was led to believe that, 
aid of certain electrodes attached to him, W 
accurately determine the extent of his emotior 
actions. Within the two fear treatments three 
tion treatments were administered. In one 
the subject received information as to the degi 
own reaction and the reactions of three other 
who were seated in adjacent cubicles. In as 
ment the subject received information as to his 
only. In the third treatment no information ¥ 
although the same electrodes were in place. 
treatment was in all essential respects a Tep 
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Schachter’s basic experiment with the exception that 
subjects were isolated from one another during the fear 
induction and four subjects were used per group wher- 
ever possible (Schachter used from five to eight sub- 
jects). 

i The subject was asked to state whether he wanted 
to wait alone or together before getting shocked and 
to what degree he desired the waiting situation he 
chose. Questionnaire and other data were then obtained. 


Subjects 


Twenty-nine male and 71 female undergraduates at 
Brooklyn College served as paid volunteers. The sex 
distribution was determined by the number of available 
volunteers. Each group consisted of either all male or 
all female subjects. 


General Setting 


Subjects were seated in adjoining cubicles. A pair of 
Scherping finger electrodes were attached to the index 
and ring fingers of the subject’s nonpreferred hand. In 
addition, an EKG electrode was attached to his fore- 
arm, which was immobilized by strapping it into a 
specially designed arm rest. The subject was informed 
that the finger electrodes registered skin resistance and 
that the arm electrode measured muscle tremor. By 
employing this information, he was informed, we could 
accurately determine his degree of emotionality. We 
were interested, the experimenter continued, in deter- 
mining his emotional reaction to various kinds of situa- 
tions that he would soon experience. In actuality, the 
forearm electrode was a dummy. The finger electrodes, 
however, were real and were connected to one of the 
inputs of a four-channel constant current galvanometer 
circuit designed to measure skin resistance.’ Since 
changes in skin resistance are known to reflect stress, 
we were provided with a physiological check on the 
fear manipulation. All of the instructions for each of 
the six experimental conditions were prerecorded on 
magnetic tape. Constant within-condition exposure 
was thereby achieved. Instructions were received by 
the subject over a headset with a rubber cup over each 
earpiece that fitted tightly against his head. This 
ee achieved some degree of auditory isola- 
ion, 


Manipulation of Fear 


Upon entering the laboratory, the subject saw a 
rather impressive array of electrical equipment with 
many switches, variously colored lights, and many 
lengths of wire and cabling. This was actually the con- 
trol and recording equipment used in the experiment 
but could easily be viewed in retrospect by the subject 
as equipment that could be used to deliver electrical 
shocks, In the high fear treatment a bold sign saying 

Caution, High Voltage” was prominently on display. 
The two experimenters were dressed in white laboratory 
Coats and introduced themselves to the subjects. In 
the High Fear treatment one of the experimenters had 
a stethescope bulging out of his pocket and a sphygmo- 
manometer was conspicuously on display. Both these 


* The instrument was designed by J. Kouzoujian. 
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features were absent in the Low Fear treatment, A 
short waiting period ensued which was used to establish 
the subject’s base level skin conductance. In both 
treatments the subject was told that the experiment 
was concerned with the effects of electrical shock. 

In the High Fear treatment the instructions con- 
tinued: 

We would like to give each of you a series of elec- 
trical shocks. Now, I feel that I must be completely 
honest with you and tell you exactly what you are 
in for. These shocks will hurt; they will be somewhat 
painful. As you can guess, if in research of this sort 
we are to learn anything at all that will really help 
humanity, it is necessary that our shocks be intense. 
What we will do is hook you up to the apparatus, 
give you a series of electrical shocks and then deter- 
mine your reaction to the shocks. Again, I do want 
to be honest with you and tell you that these shocks 
will be somewhat painful, but of course they will do 
no permanent damage. 

In the Low Fear treatment the following instruc- 
tions were substituted for the foregoing ones: 

We would like to give each of you a series of very 
mild shocks, I assure you that what you feel will not 
in any way be painful. It will resemble more a trickle 
or a tingle than anything unpleasant. We will put 
electrodes on your hands, give you a series of very 
mild shocks and then determine your reaction to the 
shocks. I assure you there is absolutely nothing to 
be worried about. 

At the beginning of the Low Fear instruction the 
subjects were told that “they would enjoy the experi- 
ment.” This remark was omitted in the High Fear 
treatment. In all conditions it was emphasized that the 
electrodes which were attached to the person were for 
recording purposes only and would not be used for the 
administration of the shocks. 


Manipulation of Information 

After the fear induction we established three infor- 
mation treatments: the subject received his emotion- 
ality index and the indexes of the others (Inf-S + 0), 
only his own emotionality index (Inf-S), no information 

Inf). 
Oe Las -+ O treatment was introduced as follows: 
As you no doubt may guess, even the anticipation 
of getting a shock may affect you emotionally and 
hence produce physiological reactions, In front of you 
there are four microammeters which have been cal- 
ibrated to register the emotionality index I referred 
to earlier. We thought you would like to see how you 
yourself and the others are reacting to the prospect 
of getting shocked. As you can see each meter goes 
from 0 to a maximum of 100. The first meter to regis- 
ter will show your own index. Then the other meters 
will be connected so that you may see how emotional 
the others are. - 

At this point the subject’s meter was switched on 
and a few seconds later the other three meters were 
energized. Each subject was in exactly the same paral- 
lel situation. The third meter from the left was the 
first to read, this presumably being oe aes ore 

tionality index. Preset potentiometers in the meter 
seats aged the meters to read 79, 80, 82, and 81 
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in the High Fear condition and 29, 30, 32, and 31 in 
the Low Fear condition. The third figure was presum- 
ably the subject’s own index. In both treatments we 
arbitrarily set the subject’s reading slightly higher 
than the rest. On an a priori basis we believed that 
either a lower reading or one which was at the average 
would tend to reduce the subject’s actual emotional 
reaction and thus confound the fear manipulation, 
Since we know absolutely nothing, as yet, about the 
disturbing or reassuring effects of this kind of informa- 
tion, the choice seemed reasonable. In the Inf-S treat- 
ment, the subject was confronted by a single meter 
which registered 82 in the High Fear and 32 in the 
Low Fear treatment. 

After the information manipulations were admin- 
istered the subject was told that there would be a 10- 
or 15-minute wait while we prepared the equipment 
and while he awaited his turn to be shocked. He was 
given a choice of waiting together with the others in a 
room nearby or alone in a separate room. He was told 
that, on the basis of the choices made, we would make 
the most appropriate waiting arrangement. He de- 
pressed one of two buttons to indicate his choice. He 
was then asked to indicate the strength of his desire 
either to wait alone or together (depending upon which 
he had chosen). He did this by setting a dial in the 
center of his console at a position between 1 and 11. 
His dial varied the setting of a potentiometer which 
caused a milliameter, which was calibrated from 0 to 
100, to register differentially. We now had the basic 
information which we could use to test the comparison 
hypothesis, 

Check on the Fear Manipulation 

In addition to the skin resistance record, other 
checks on the success of the fear manipulation were 
included, 

After the questionnaire was completed the subject 
was asked, in the two information treatments, to answer 
the following question printed on a separate sheet of 
paper: 

On occasion we have found the individual’s actual 
feelings, that is, his subjective experience, does not 
exactly correspond to his physiological emotionality 
index. If you had been asked to guess your meter 
reading before seeing it, assuming 100 to be maxi- 
mally emotional, what would you have guessed? 
A blank was left for the subject to indicate his guess. 
The final event in the experiment was a further 

check on the fear induction. The subject was given an 
opportunity to change his mind as to whether or not 
he wished to participate. If he chose not to participate, 
however, he would forfeit the honorarium of $1.50 
which he was promised. After the subject made this 
choice, the experiment was over. 

The experimenter then revealed the deception and 
described, in brief, the purpose of the experiment. The 
subject was then asked not to reveal anything about 
the experiment to those who had not as yet participated. 


RESULTS AND DISCUSSION 
Effectiveness of the Fear Manipulation 


Three independent checks of the fear in- 
duction were made: change in skin conduct- 
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ance, the subject’s guess (in the informatior 
treatments) as to what his meter reading would 
be, and the subject’s decision to remain 9 
leave prior to the supposed shock administra- 
tion. 

We subtracted the subject’s base conduct- 
ance immediately before the fear induction 
from his level at the end of the induction, The 
average increase in conductance was greater in 
the High Fear than in the Low Fear treatment 
(p < .001 by 2). 


through with the experiment. However, in he 
High Fear treatment 11 subjects chose to leave 
as compared with only 1 in the Low Fear treat- 
ment ($ < .01 by chi square). $ 

These measures leave little doubt as to the 
overall success of the fear induction. 3 


Fear, Information, and Affiliation 


We first turn to an examination of the data 
relevant toa test of the comparison hypothesis. 
Only 12 of our 100 subjects chose to be alone, 
with no systematic relationship to our treat- 
ments. With regard to the choice of “together” 
or “alone,” there is a basic procedural differ- 
ence between this and Schachter’s experiments. 
When asked to choose “together” or “alone” 
Schachter (1959) also provided the subject 
with a “don’t care” alternative. In his analysis 
he combined the “alones” and “don’t cares” 
and contrasted them with the “togethers. 
The percentage of subjects choosing “alone 
in our No-Inf treatment is the same as in 
Schachter’s first experiment (approximately 
9% with no difference between High and Low 
Fear). His results are based principally upon & 
comparison between those subjects choosing 
“together” and those choosing “don’t care.” 
The small proportion of “alones” in both this 
and Schachter’s experiment suggests that these 
individuals are probably deviant in some re- 
lated respects. We shall return later to a Ger 
sideration of those subjects who chose “alone. 
The omission of the “don’t care” alternative, 
thereby forcing a choice between “together 
and “alone,” makes our dependent meast 
not strictly comparable with Schachter S. tg 
discussing the strength of affiliation we will} 
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TABLE 1 


FEAR, ĪNFORMATION, AND STRENGTH OF AFFILIATION 
FOR SUBJECTS CHOOSING “TOGETHER” 


No-Inf Inf-S s Inf-S+0 
High Fear | 66.80 (15) | 70.50 fe) 55,09 (11 
Low Fear | 54.53 (17) | 64.12 (17 {7.67 (12 


fer to the data for those 88 subjects who chose 
“together.” 

Table 1 presents the mean desire to be to- 
gether for all six treatments. A difference be- 
tween fear treatments is indicated (p < .10 by 
a one-tailed ¢ test; using a median split, p < 
05 by chi square). A test of the comparison 
hypothesis involves evaluating the difference 
between the two information treatments. This 
difference is in the predicted direction (p = .05 
by #) thus supporting the hypothesis. The data 
in the No-Inf treatment complicate the picture 
in that a quadratic trend is suggested with 
increasing information (p < .07 by 2), ie., 
both the No-Inf and Inf-S + O treatments are 
low relative to the Inf-S treatment. Given 
that the individual was uncertain as to the 
extent of his emotion, providing him with in- 
formation about himself would serve to reduce 
this uncertainty and hence reduce his desire 
for comparison. The addition of information 
about the others would be expected to reduce 
his comparison desire even further. How can 
we then account for the relatively high level of 
affiliation engendered in the Inf-S treatment? 

Other data, which are at hand, might shed 
some light on the matter. We determined, for 
each subject in the Inf-S and Inf-S + O treat- 
ments, the difference between his meter read- 
ing (82 in the High Fear and 32 in Low Fear 
treatment) and what he later guessed it would 
be. We correlated the absolute value of this 
difference with his desire to be together. The 
Coefficients are: High Fear Inf-S, .22; Low Fear 
Inf-S, .72; High Fear Inf-S + O, —.03; Low 
Fear S + O, .15. The Pearson coefficient for 
the Inf-S treatment is higher than for Inf- 
S + O (p < .02 by 2). When the two sources 
of information (his subjective experience and 
his meter reading) about the subject’s reaction 
were discrepant, it seems likely that ambiguity 
was induced which, in turn, engendered com- 
Parison motivation. Given that the information 
was already available (in the Inf-S + O treat- 
ment), the ambiguity was resolved. The cor- 
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relation is quite high, interestingly, within the 
Low Fear Inf-S condtition whereas it is not 
significantly different from zero in the High 
Fear Inf-S treatment. In both fear treatments 
discrepancy is, to a great extent, a function of 
guessed emotionality. In the Low Fear treat- 
ment a large discrepancy occurred when the 
subject guessed that his meter reading would 
have been considerably higher than 32, whereas 
in the High Fear treatment a large discrepancy 
occurred when the subject guessed that his 
meter reading would have been considerably 
Jower than 82. In the Low Fear treatment, 
therefore, guessed emotionality and discrep- 
ancy act in concert, whereas in the High Fear 
treatment they act in opposition. 

On the basis of this possible confounding of 
discrepancy with emotionality, a higher corre- 
lation under Low Fear than under High Fear 
is to be expected. This difference which is 
evident under Inf-S does, however, not occur 
under Inf-S + O. That this is not so in the 
latter treatment suggests a reduction of com- 
parison motivation. Since a covariance analysis 
of the present data is not appropriate, addi- 
tional research is required in which discrepancy 
and the individual’s absolute phenomenological 
level of fear are varied independently. 

One may tentatively suggest, on the basis of 
these data, that the Inf-S treatment may have 
increased the salience of the individual’s emo- 
tional state by confronting him with a discrep- 
ancy. It is also possible that the mere fact of 
knowing the absolute value, so to speak, of a 
measurement of some aspect of the self induces 
a desire for knowledge of relative standing. 
Results in a recent experiment by Radloff 
(1959) suggest an analogous process in the use 
of opinion comparison. When our subject found 
confirmation for his subjective feelings, i.e., 
where there was agreement between the meter 
reading and what he would have guessed it to 
be, he had little need for additional informa- 
tion. However, where the discrepancy was 
large his curiosity was aroused as to how the 
others were reacting. The self-information 
treatment may have induced ambiguity where 
none or very little existed in the first place. 
Given a discrepancy, however, it is not clear 
just what kind of information (assuming the 
subject was seeking some) he wanted. Perhaps 
he wanted to reach some compromise, i.e., to 
use the information others might provide as a 
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means for reconciling the two conflicting 
sources of information with which he was faced. 
In so doing he might assign different weights 
to the two sources and then proceed to average 
them in some way in order to reach a compro- 
mise. Another motivation for affiliating may 
have been to determine if he was interpreting 
the scale correctly. Perhaps the emotional 
levels he coordinated to the end points were 
somehow wrong. Still another variant of com- 
parison motivation here might be the individ- 
ual’s desire to determine if some constant 
discrepancy was experienced, i.e., perhaps the 
others’ subjective feelings also differed by an 
amount similar to the discrepancy with which 
he was confronted. These represent some of the 
possibilities which should be explored. 

Two additional difficulties, one methodo- 
logical and one statistical, occur in interpret- 
ing the correlation between the discrepancy 
of subjective feelings from the meter reading, 
and affiliation. The emotionality guess was a 
retrospective view by the subject of himself 
and is therefore subject to a number of sources 
of contamination. Also the variance of guesses 
within the Inf-S + O treatment tends to be 
smaller than the variance within the Inf-S 
treatment (p < .10 by F). This homogeniza- 
tion would follow since additional reference 
information had been provided and all the 
readings were close to one another. The chances 
for a correlation in the Inf-S + O treatment 
would be reduced relative to the Inf-S treat- 
ment by allowing individual error a wider 
relative latitude to affect the correlation. In 
order to explore the problem further, this 
feature of the situation would have to be 
controlled. 


Subjects Who Chose “Alone” 


Of the 6 subjects in the High Fear informa- 
tion treatments (Inf-S and Inf-S + O combined) 
who chose “alone,” 5 had emotionality index 
guesses of 70 or higher. Of the 27 subjects who 
chose “together” in these treatments only 12 
had guesses of 70 or higher. The data on 
“alone” choices are sparse. However, the 
interaction just noted suggests that under 
certain circumstances, which we are unable as 
yet to specify, an emotionally aroused indi- 
vidual prefers seclusion to the company of 
others who are faced with the same plight. 
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Another line of evidence suggests a positive 
relationship between fear and preference for 
isolation. For each of the eight subjects in the 
information treatments who chose “alone” 
we compared his guessed emotionality with 
the strength of his desire to be alone and 
found a Spearman coefficient of .76, which 
for an N of eight, is significant at the .02 level, 
Thus high emotionality for “alone” subjects 
relative to those who chose “together” and the 
correlational evidence just discussed, suggest 
that an “alone” choice did not reflect indif- 
ference toward being together as Schachter’s 
(1959) grouping of the data might lead us to 
suspect. Sarnoff and Zimbardo (unpublished) 
report that the induction of oral anxiety (where 
a psychosexual conflict is aroused) predisposes 
the subject toward being alone rather than 
together, the reverse being true for fear (where 
arousal does not involve a psychosexual con- 
flict). The latter is comparable to our manipu- 
lation. Those few subjects in our experiment 
who chose to be alone in the High Fear treat- 
ment may thus have suffered a diffuse anxiety 
reaction of the kind generated by Sarnoff and 
Zimbardo in their anxiety manipulation. Also, 
we have noted that the preponderance of 
subjects who chose “alone” were men and 
that their emotionality was high. A public 
display of emotion is, in our culture, considered 
unmanly. Experiencing the emotion in private 
obviates the necessity of hiding it from others 
and removes the risk of having it displayed 
inadvertently. This suggestion brings into 
focus the normative aspect (the reaction others 
would presumably approve) of emotional 
comparison which has thus far received little 
attention. 


Sex, Birth Order, and Affiliation 


Schachter (1959) found a positive relation- 
ship between a questionnaire item measure © 
anxiety and the tendency to affiliate. We 
decided to examine, irrespective of experi 
mental treatment, the relationship between 
emotionality as measured by an increase 1 
skin conductance and desire to be together. 
Here we find an interesting male-female 
differential. Using median splits for both 
variables we find a positive relationship for 
women, namely, that a large increase in skit 
conductance during the fear induction © 
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associated with a strong desire to be together, 
whereas a small increase in skin conductance is 
associated with a weak desire to be together 
(p < .02 chi square). For menman opposite 
trend occurs (p < .10 by chi square). The 
considerably smaller N for men makes sta- 
tistical comparison unstable. This „difference 
in trend is quite interesting and brings us to a 
closer examination of the effect of the sex 
variable and of the birth order effect reported 
by Schachter. 

On the questionnaire we asked the subject 
to indicate the age and sex of siblings, if any, 
and to indicate his own age. The subjects in 
Schachter’s (1959) experiments were all 
University of Minnesota coeds whereas we 
used both men and women. We find a difference 
in accordance with Schachter’s finding, i.e., 
using a median split in classifying for high and 
low affiliation we find that 42% (21/50) of 
first born and only children have low affiliation 
scores whereas the percentage for later borns 


< is 61 (23/38, p < .05 by chi square). If we 


examine men and women separately we find 
quite a striking contrast in the relationship. 
Thirty-four percent (14/41) of first born and 
only women had low scores whereas among the 
later born women the percentage is 71 (20/25, 
p < 01 by chi square). For men the relation- 
ship appears reversed. Of the 29 men, 10 
chose “alone” (only 2 of 71 women chose 
falone”), Among the 19 men who chose 

together,” 77% (7/9) of the first born and 
only children and 30% (3/10) of later borns 
had low scores (p < .07 by an exact test). The 
Mean affiliation scores, combining across fear 
treatments, are: first born and only men, 45.1; 
first born and only women 67.0; later born 
Men, 61.4; later born women, 55.7. A large 
difference occurs between men and women 
first borns, whereas what appears as a reversal 
for later borns is not a significant difference. 


_ Of the 10 men who chose “alone,” the 4 first 
_ borns had an average desire to be alone of 


52.7 whereas the average desire of the 6 later 
borns was 44.3. 

Schachter (1959) finds some evidence for a 
Telationship between birth order and emo- 
tional reactions to stress, namely, that first 
born and only women tend to react more 
Strongly than later borns. We examined, for 
all subjects, the relationship between birth 
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TABLE 2 
BIRTH ORDER, Sex, FEAR TREATMENT AND 
CONDUCTANCE CHANGE (micromhos) * 
DURING THE FEAR INDUCTION 
High Fear Low Fear 
Men Women Men Women 
First Born | 1.67(6) | 2.58(20) | 0.00(7) | 0.6721) 
Later Born | 3.74(5) | 1.31(14) | 0.79(11) | 0.30(12) 


®Skin conductance data for four women are missing due to an 
equipment failure. 


order and change in skin conductance during 
the fear manipulation. These data are pre- 
sented in Table 2. We observe the High versus 
Low Fear difference referred to earlier as well as 
sex and birth order effects. In the High Fear 
treatment we see an interaction between sex 
and birth order which parallels the affiliation 
data (p < .05 by i). The relationship between 
birth order and affiliation might therefore be 
attributed to a difference in emotional reac- 
tion to the fear induction and not to a difference 
in the reaction which occurs given emotional 
arousal. 

Pursuing the problem further becomes 
difficult due to the rapidly shrinking cell size 
with further breakdowns. Examining the data 
for men becomes impossible for this reason so 
we shall confine the immediately following 
analysis and remarks to the data for women. 
In order to determine if the relationship be- 
tween sex, birth order, and affiliation was to be 
accounted for solely by the difference in 
emotionality just observed, we attempted to 
control for emotionality. In order to have a 
sufficiently large sample to examine, we com- 
bined the fear treatments. Before doing so, we 
dichotomized both the affiliation and con- 
ductance variables using a median split within 
fear treatments and then examined the result- 
ing contingency for first born and only versus 
later born women. This within-treatment 
split corrects for the differential effect of the 
fear treatments upon conductance change and 
affiliation. We find that a conductance dif- 
ferential for first born and only women is 
correlated with affiliation (p < 02 by chi 
square), whereas the trend for later borns is 
not significant. This evidence leads us to 
suspect that the relationship between birth 
order and affiliation is due, in part at least, to 
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a difference in reaction to fear. This corrobo- 
rates Schachter’s finding. 

In the light of Schachter’s (1959) conclusion 
concerning the relative overprotection of first 
born children, and the subsequent develop- 
ment of “greater dependence in first born than 
in later born children,” how can we account 
for the present difference in reaction between 
first born and only women and their male 
counterparts? One possible interpretation 
involves not so much the attitude learned by 
the child due to the strength of the succorance 
relationship with his mother as those that she 
communicates. The parent conveys society’s 
norms to the child. One set of norms that is 
communicated is how a male or a female should 
behave. In interacting with her male child the 
mother treats him as a male, rewarding male- 
like behavior to the exclusion of female-like 
behavior. The reverse is true for females. The 
behavior of first born and only children in our 
experiment exemplifies the Victorian norms of 
the stalwart man and dependent woman which 
still probably characterize the ideal man and 
woman in contemporary western culture. What 
about the behavior of later borns? A distin- 
guishing feature between first and later born 
children is that, during the formative years 
(months?) the first born child has contact only 
with his or her parents whereas the later born 
child is exposed both to parents and to one or 
more older siblings. An older sibling will not 
normally display different behavior toward the 
younger child because of that child’s sex; he 
or she will treat the other as just another 
child. At some age an individual, after fully 
incorporating society’s sex norms, may as- 
sume the responsibility, so to speak, of com- 
municating them to younger siblings. Our data 
in the present experiment are unfortunately too 
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sparse to study the effects upon affili: 
the age differential between a child a 
next older mpima: ; 


SUMMARY 


The present experiment was stim 
some wor k by Schachter (1959) in 
applied Festinger’s (1954) theory 
comparison to account for the a 
affiliative tendencies when the indi 
exposed to a threat. We attempted 
directly the tenability of explaining 
tionship between fear and affiliations by 
comparison theory. According to the tl 
the individual is motivated to comp 
others some aspect of himself, e.g., an o 
he holds, an ability he possesses, or a 
tional experience whenever other mea 
achieving certainty about that asi 
lacking. Using a modified version of Si 
fear arousing situation, in which elec! 
was threatened, we provided the i 
with information as to the intensity of en 
experienced by three other individuals seat 
adjacent cubicles. According to the 
this should reduce the desire for afi 
Support was found for the theory. O 
tives also appear to have been operativi 
as strong sex differences in reaction to t 
The effect of sex appears to be linked with 
order of birth in the family. 


REFERENCES 


FESTINGER, L. A theory of social comparison prot 
Hum. Relat., 1954, 7, 117-140. 

Raptorr, R. W. Opinion and affiliation. Unpublt 
doctoral dissertation, University of Minn 
1959. 4 

SCHACHTER, S. The psychology of affiliation. 
lis: Univer. Minnesota Press, 1959, 


(Received May 24, 1960) 


Journal of Abnormal and Social Paychology 
1961, Vol. 62, No. 8, 593-597 


A COMPARISON OF GENERALIZATION AND ADAPTATION-LEVEL 
AS THEORIES OF CONNOTATION 


© JEROME E, PODELL 
San Francisco State College 


o connote commonly means “to saggest. . . 
in addition to the explicit or denoted 
| meaning: as, the word mother means (de- 
notes) ‘female parent,’ but it generally connotes 
love, care. . .” (Webster's New World Diction- 
ary of the American Language, 1951). In other 
words, the connotation of a word may be ex- 
pressed qualitatively by another word. Re- 
cently, however, Osgood, and his associates 
(Jenkins, Russell, & Suci, 1958; Osgood, Suci, 
& Tannenbaum, 1957) have shown that conno- 
tation may be expressed quantitatively as val- 
ues on certain dimensions of meaning. In addi- 
tion, these investigators have shown that the 
major dimensions of connotation, are, in order 
of importance, evaluation (a good-bad con- 
tinuum), strength, and activity. Since the ma- 
jor portion of variance in ratings on connota- 
tional scales has been accounted for by these 
dimensions, it seemed quite worthwhile to ex- 
amine in detail the relationship between verbal 
(qualitative) and quantitative connotation. 
Osgood et al. (1957) have explained the 
connotation process by means of a mediated 
generalization model. This view suggests the 
following relationship between verbal and 
quantitative connotation, viz. if two words 
have the same quantitative connotations, i.e., 
equivalent sets of values on the dimensions 
of the semantic space, then each word should 
be a verbal connotation of the other. More 
concretely, when a person is set to respond 
by giving a verbal connotation, the most likely 
response is that word having quantitative 
connotation most like that of the stimulus 
word. This statement summarizes what will 
be hereafter referred to as the generalization 
view. Although generalization is a well-sub- 
stantiated theory in the field of learning, its ap- 
plication to the field of meaning, as Osgood 
et al. (1957) point out, demands further experi- 
mental investigation. Furthermore, one may 
- accept the dimensional approach to connota- 
tion without being bound logically to accept 
the generalization view. 
A likely alternative to this view isthe adapta- 
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tion-level theory of Helson (1947). This 
theory, while developed largely within the 
context of perception, has been shown to apply 
also to the field of meaning (Helson, Dwor- 
kin, & Michels, 1956). It differs from the 
generalization view in that it takes account of 
the quantitative character not only of the 
stimulus, but also of the population of possible 
responses. The present study aimed to discover 
which of these two theories might yield the 
more accurate deduction relative to verbal 
connotation of words referring to personal 


traits. 

Studies by Asch (1946) and Kelley (1950), 
prototypes of the experiment reported here, 
may be viewed as experiments in verbal 
connotation, if the following assumption be 
granted. Since responses in these studies were 
rarely synonymous to the stimulus words, 
impressions of persons formed from trait-terms 
involve connotation. Each subject in the 
Asch (1946) and Kelley (1950) experiments was 
given one of two short initial descriptions of 
persons, each consisting of identical lists of 
traits except for the inclusion of the word 
corp in one list, and the word warm in the 
other. Subjects formed impressions on the 
basis of these initial lists, and subsequently 
selected additional traits appropriate to the 
person described earlier. It was found that 
impressions formed under the “warm” con- 
dition differed markedly from those formed 
under the “cold” condition, and the theory was 
advanced that the differential effects of the 
traits warm and COLD were attributable to their 
being “central” traits (Asch, 1946). y 

A general explanation of the results obtained 
by Asch (1946) and Kelley (1950) can be 
provided by invoking either the mediated 
generalization theory (Osgood et al., 1957) or 
adaptation-level theory (Helson, 1947), Ac- 
cording to the generalization view, the process 
by which the initial impression influences 
subsequently chosen traits depends upon 
mediated generalization along various dimen- 
sions. Since this view implies that the response 
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tendency is a direct function of stimulus 
response similarity, it follows that the prob- 
ability that a trait will be chosen is a direct 
function of the similarity between the trait 
and the initial impression with respect to 
relevant dimensions of meaning. Furthermore, 
if the evaluative dimension (favorableness) was, 
as usual, the dominant determinant of connota- 
tion, we can simplify our discussion by infer- 
ring that the traits chosen would be similar in 
favorableness to the traits initially presented. 
If we assume that warm and cold are good and 
bad, respectively, then the warm and cold 
impressions would mediate responses similar 
to themselves with respect to favorableness. 
This interpretation of the results obtained by 
Asch (1946) and Kelley (1950) is based on the 
hypothesis that stimulus words and response 
words in this situation should have identical 
values on the relevant dimension of meaning. 
Adaptation-level theory implies additionally 
that the subsequently chosen traits should be 
affected not only by the favorableness of the 
initial impression, as above, but also by the 
favorableness of the whole set of potential re- 
sponses. In adaptation-level terms, the initial 
list functions as a background (frame of refer- 
ence), while the check list functions as the 
comparison series stimuli; each list having a net 
stimulus value equal to the average of the 
values of its component parts. According to 
this view, the average favorableness of the 
traits chosen should fall somewhere between 
that of the initial list and that of the check 
list. Accordingly, the present study deals with 
evaluative connotation as affected by the net 
favorableness of the initial list of traits (back- 
ground), and the net favorableness of the check 
list of traits from which the subjects subse- 
quently select their responses (series stimuli), 


METHOD 


Construction of the Favorableness Scale 


Seventy-eight adjectives were scaled by a method 
similar to that of Mosier (1941). Each word was typed 
on a 3” X 5” card, and judges were required to sort 
them according to a seven-category scale of favorable- 
ness of personal traits; 1 being the most favorable, 
and 7 being the least favorable. No restriction was 
placed on the number of traits assigned to each cate- 
gory. Twelve undergraduate students served as judges, 
six from the University of California at Berkeley, and 
six from San Francisco State College. Half the judges 
from each institution were men, and the other half, 
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women. The mean scale value was con 
word, and then a scale was constructed bys 
far as possible, one word for every two-tenth 
interval. The scale is shown in Table 1, Three 
words near the'ènds of the scale (indicated by: 
formed the Initial Lists (background stimuli) 
below. 


Experiment 

Subjects. Forty-eight students in the du 
tion Psychology course at San Francisco St; e C 
served as subjects. i 

Procedure. All subjects were given an Ini 
three trait-terms, and requested to form 
impression of a fictitious person. They then 
sented with the Check List (Table 1 minus th 
isked terms), and instructed to check four te 
would best describe the person, There were t 
pendent groups of subjects; one group given the 
able Initial List consisting of the words CAP; 
FORMED, and CONSIDERATE, and the other 
the Unfavorable Initial List consisting of 
MEDIOCRE, SMUG, and HARSH. These six words an 
scale values are indicated by asterisks in Table 
mean favorableness values of the three lists wen 
follows: Favorable Initial List, 1.8; Unfavorable 
List, 5.3; and Response Check List, 3.6. Word orde 
all lists was counterbalanced by presenting 
List in the above order and its reverse, and byp 
the Check List in a random order and its 
four resulting Initial Lists and the two Check 
administered in all of the eight possible comi 
with six subjects receiving each combination. A 
jects were given a two-page pamphlet which bore of 
first page the following instructions: i 
This is a test of your ability to judge 1a 
Suppose that all you knew about a person 
people say he is capable, informed, and i 
(mediocre, smug, and harsh). Now turn the 
(The second page bore the following.) Now ii 
what sort of person this must be. Look at this 
words. Then select four words from the list 
strike you as most likely to be true of the 
Here is the list. Check four words which 
would be true of the person described on th 
page. (The Check List followed.) 


RESULTS 


The median scale value of the four 
words was employed as the score for 
individual since the response distributio 
were skewed. The mean median of the 
obtained by the group receiving the Favoral 
Initial List was 1.9, while the mean medii 
obtained by the group receiving the Unf 
able Initial List was 5.1. The two disti b 
showed no overlap, and the probability 
outcome under the null hypothesis is les 
0.1%. 


Under the Favorable condition, 
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TABLE 1 
Trart-TeRM List 
Tait ge om la 
Genuine 1.2 || Quiet 3.8 
Self-controlled 1.4 | Rural 4.0 
Efficient 1.5 || Crafty 4.2 
Capable* 1.6 | Abrupt “ | 4.5 
ind 1.8 | Ordinary 4.7 
Tnformed* 1.8 || Erratic 4.9 
Considerate* 2.0 || Domineering 5.0 
Adaptive 2.0 || Mediocre* 5.0 
Warm 2.2 | Cold 5.1 
Reentye 5 š ja mainaa F E $ 
intent $ mi j; 

Zealous 3.0 Harsh* 5.7 
Forward 3.1 || Inconsistent 5.7 
Proper 3.2 || Sour 5.9 
Robust 3.4 || Offensive 6.6 

Reserved 3.6 


Note.—Asterisks indicate three additional words near the ends 
of the scale forming the Initial Lists (background stimuli). 


ference of 0.1 between the mean median re- 
sponse and the mean of the Initial List, 1.8, was 
not significant. However, under the Unfavor- 
able condition, the difference of —0.2 between 
the mean median response and the mean of 
the Initial List, 5.3, was significant beyond 
the 1% level (Wilcoxon, 1949). 

Since under both the Favorable and the 
Unfavorable conditions the central tendency of 
response fell between the means of the Initial 
and Check Lists, the data were further analyzed 
according to adaptation-level theory by com- 
puting the relative weights of the Initial and 
Check Lists as determiners of response. This 
was done by means of a modification of the 
formula developed by Helson (1947) for lifted 
weights. This formula (Formula 1) states that 
the log of the weight of the stimulus judged as 
neutral (neither heavy nor light), plus a 
correction for the stimulus interval, is equal to 
the weighted mean of the mean log of the 
series stimuli and the log of the comparison 
stimulus, with the series stimuli being weighted 
me times as much as the comparison stimu- 
us. 


log (AL + 0.75d) 
= [(32 log X;/n) + log C]/4 


This formula was modified to suit the present 
study as follows. Logarithms were eliminated 
because the words had values on a psychologi- 
cal rather than a physical scale. Hence there 
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was no need to employ the Weber-Fechner 
psychophysical relationship. The triple weight 
for the series (X;), and the unit weight for the 
comparison stimulus (C) were changed from 
constants to variables, since the calculation of 
these weights was a problem for the present 
study. In addition, the stimulus-interval 
factor (0.75d) was ignored as too refined for 
an exploratory study. Finally, the sum of the 
weights (w) was made equal to 1. The results 
of these modifications are shown in Formulae 2 
and 3. It may be noted that Formula 2 is 
identical in form to that used in physics for 
the equilibrium of parallel forces (Sears, 1946). 


AL = KEX :/n)w: 2 
+ EX ,;/n)wl/w: + w; 
witwj=1 B] 


In these equations, X; and X; refer to favor- 
ableness values of traits in the Initial and 
Check Lists, respectively; n; and n; refer to 
the number of traits in each list; and w; and 
wyj refer to the weights of the two lists. The 
adaptation-level was taken as the mean median 
response under the two experimental condi- 
tions, i.e., 1.9 for the Favorable condition and 
5.1 for the Unfavorable condition. 

The weight of the Check List under the 
Favorable condition was 0.05 while its weight 
under the Unfavorable condition was 0.16. 
The corresponding weights for the Initial 
Lists were 0.95 and 0.84. Thus the weights of 
the Initial Lists in determining the adaptation- 
level ranged from 5 to 19 times as great as that 
of the Check Lists, or conversely, the weight of 
the Check List in determining the adaptation- 
level ranged from 14 to Mg that of the Initial 
List. 


DISCUSSION 


Dimension Theory of Meaning 

The present results provide support for the 
dimension theory of meaning. Specifically, it 
was found that the favorableness of words 
chosen to match an initial list was strongly 
and reliably affected by their favorableness. 
Despite the fact that the traits were not con- 
trolled for “loading” on other dimensions such 
as activity and potency the favorableness 
dimension alone had a very substantial effect 
in determining judgments. This result is 
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consistent with the factor analytically derived 
primacy of the favorableness dimension, and 
the theory that verbal and quantitative con- 
notation are related. 

Certain other studies which also concern 
dimensions of meaning differ from the present 
one in that the responses were glandular 
secretions (Razran, 1935; Riess, 1940) or motor 
responses (Osgood, Suci, & Tannenbaum, 
1957, pp. 155-160) rather than words. In 
contrast to these studies, the present one 
concerns symbolic responses to symbols rather 
than sign responses to symbols. 


Theories of Adaptation-Level and Generalization 


One may logically adopt the dimension 
theory of meaning without necessarily accept- 
ing the theory of mediated generalization; 
adaptation-level providing a view alternative 
to the latter. With respect to the present 
experiment, the difference in implications of 
the two theories concerns the factors that 
determine the central tendency of the responses. 
While the generalization theory implies that 
the response depends only on the stimulus, 
the adaptation-level theory implies that the 
response depends both on the initial stimulus 
and the set of possible responses. Specifically, 
the latter theory implies that the average 
response should fall between the stimulus value 
and that of the average of the potential re- 
sponses. 

The present results provide conflicting evi- 
dence with respect to the more applicable 
theory. On the one hand, the average response 
under the Unfavorable condition was clearly 
consistent with adaptation-level theory since 
it was reliably different from the value of the 
relevant Initial List in the direction implied by 
the theory. On the other hand, the average 
response under the Favorable condition was 
not reliably different from the value of the 
relevant Initial List, and this outcome is 
consistent with the generalization theory. 
However, the obtained difference, though not 
significant, was in the direction implied by the 
adaptation-level theory. Taken together, these 
two essentially independent results indicate 
that both theories require consideration when 

further studies are planned in this area. 
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Comparison with Psychophysical Methods 
The similarity of the present results to those 


involving judgment of physical qualities, egoi 


weight, loudness, is impressive because of the 
radical difference in stimulus quality, and 
because the present procedures were sub- 
stantially different from those observed in 
conventional psychophysical experiments. For 
example, although the traits were scaled for 
favorableness, and the judgments were 
demonstrated to have been determined by this 
factor, subjects did not deliberately judge on 
this basis. In fact, most subjects were found 
to be unaware of it. This lack of awareness 
of the basisof judgment is in contrast to conven- 
tional psychophysical situations in which the 
subject judges stimuli with respect to an ex- 
plicitly agreed upon dimension. Also, although 
the Check List was treated as a series of com- 
parison stimuli, the subjects were not actually 
required to compare the items one by one 
with their initial impressions as in the usual psy- 


chophysical experiment. Nevertheless, judg- — 


ments obtained under these relatively loosely 
controlled conditions were analyzable by means 
of Helson’s technique, and the results were 
interpretable in terms of the adaptation-level 
theory. 


Impressions of Persons 


Favorableness of personal traits was shown 
by the present results to be an important 
determinant of behavior in the impressions of 
persons situation. Therefore, the effects of the 
words WARM and corp in previous studies can 
be explained in terms of the substantial 
differencein favorableness (warm, 2.2; cold, 5.1) 
between the two terms, rather than by the 
“central trait” theory advanced by Asch 
(1946). More specifically, a difference in 
impressions is produced by a difference in the 
favorableness of the initial trait-names. 
Following this same reasoning one can explain 
Asch’s (1946) finding that, in contrast to 
WARM and COLD, POLITE and BLUNT had little 


differential effect on traits chosen on the basis — 


of an initial impression. Assuming, as before, 
that the favorableness dimension was dominant, 
we hypothesized that the difference between 
POLITE and BLUNT with respect to the favor- 
ableness dimension is smaller than that be- 
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tween WARM and cop. In order to test this 
hypothesis, 52 subjects from the same popula- 
tion as that sampled in the main experiment 
rated 18 trait-names including WARM, COLD, 
POLITE, and BLUNT, on a seven-point scale of 
favorableness. 

The mean difference between the ratings of 
corp and warm proved to be 4.0 scale inter- 
vals, while the corresponding difference be- 
tween the ratings of POLITE and BLUNT was only 
2.5 intervals, The difference of 1.5 scale inter- 
vals between these two values was significantly 
different from zero beyond the 1% level, and it 
indicates that POLITE and BLUNT are less dif- 
ferent from each other with respect to favor- 
ableness than are WARM and corp. Therefore, 
it is reasonable to expect that there will be less 
difference between traits chosen under “polite” 
as compared to “blunt” conditions than under 
“warm” as compared to “cold” conditions. 


SUMMARY 


Twenty-four subjects were given an Initial 
List of three very favorable trait-names, while 
another 24 subjects were given a different 
Initial List consisting of three very unfavorable 
trait-names. Each subject was instructed to 
form an impression of a fictitious person, and 
then to select from a check list four other 
trait-names appropriate to the initial impres- 
sion. The Check List consisted of 25 trait- 
hames that had been scaled previously on the 
favorableness dimension. Trait-names selected 
under the Favorable condition were consistently 
and considerably more favorable than those 
Selected under the Unfavorable condition. In 
addition, under the Unfavorable condition 
the average value of selected trait-names was 
Consistently more favorable than the average 


value of the associated Unfavorable Initial 


List. Under the Favorable condition, however, 
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an analogous deviation toward unfavorableness 
was not statistically reliable. 

The relationship between verbal and quanti- 
tative connotation was discussed in relation 
to Osgood’s dimension theory of connotation, 
mediated generalization, and Helson’s adapta- 
tion-level theory. In addition, Asch’s impres- 
sions of persons situation was reinterpreted in 
terms of the dimension theory of connotation. 
Finally, it was shown that the lesser effect of 
POLITE versus BLUNT as compared to that of 
WARM versus COLD is explainable in terms of 
quantitative connotation. 
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-PSYCHOTHERAPEUTIC AND OTHER HOSPITAL 
‘ TREATMENT CRITERIA: 


THE DILEMMA 


RALPH P. FORSYTH! ann GEORGE W. FAIRWEATHER? 
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researchers have used many different 

assessment devices as criteria for the 
adjustment of the mentally ill patient. Because 
of this diversity of criteria, most studies 
evaluating different neuropsychiatric treat- 
ments are not comparable even when similar 
patient populations have been used. The most 
common indices of adjustment are hospital 
behavior rating scales, personality inventories, 
projective devices, and follow-up ratings. Two 
recent studies (Fairweather, Simon, Gebhard, 
Weingarten, Holland, Sanders, Stone, & 
Reahl, 1960; Lasky, Hover, Smith, Bostian, 
Duffendack, & Nord, 1959), in which a small 
number of diverse measures were used, have 
indicated that these measures are not related to 
one another, a finding which cautions against 
generalizing from test responses or other 
hospital behaviors to community adjustment. 
This study investigates the relationships among 
historical indices, personality inventory test 
responses, projective devices, within-hospital 
behaviors, and follow-up ratings in order to 
arrive at an empirical criterion for assessing 
patient adjustment. 


METHOD 


The data used in this study were collected at a large 
Veterans Administration hospital in order to compare 
the relative efficiency of four psychotherapeutic pro- 
grams for three different patient groups. The results of 
these comparisons were given in a monograph (Fair- 
weather et al., 1960) which Presents a detailed account 
of the design, methods of data collection, and develop- 
ment of instruments. 

In the aforementioned study, each of the 96 male 
veteran patients was tested on a variety of Psychologi- 
cal instruments during his last week of hospitalization, 


1 Now at the University of North Carolina, 

*The authors are deeply indebted to Robert C. 
Tryon for his invaluable advice and interest in all 
phases of this research. Also, we are grateful to Ralph 
Simon, Mildred E. Gebhard, Eric Weingarten, John L. 
Holland, Richard Sanders, Gideon B, Stone, and Jo- 
seph E. Reahl for their aid in collecting the data for 
this study. 


T recent years, psychotherapists and 


One-third of the patients were nonpsychotics—behavior 
disorders and neurotics; one-third were long-term 
(chronic) psychotics—mostly schizophrenics who had 
been hospitalized for more than one year; the remaining 
third were short-term (acute) psychotics—mostly schizo- 
phrenics who had been hospitalized less than one year, 

The tests administered were the Thematic Appercep- 
tion Test (TAT), Q sorts of both ideal and real self, 
the Minnesota Multiphasic Personality Inventory 
(MMPI), and the Holland Vocational Personality In- 
ventory (HVPI) (Holland, 1958). Also, ratings on both 
job and ward behavior were obtained. For those 48 pa- 
tients participating in group therapy, a group therapy 
rating was completed every 30 days (Finney, 1954). A 
record was kept of time in treatment and tranquilizing 
drugs received while in the hospital. In addition, his- 
torical information was collected for each patient in 
the study, and 23 items were selected as possible prog- 
nostic indices. Finally, 6 months after each patient's 
discharge, a questionnaire was completed by an indi- 
vidual with whom the patient was living at that time. 

Thus, the data for this study are composed of 66 
items or scales, each of which was dichotomized to re- 
flect the adaptive-maladaptive dimensions of the vari- 
able. The purpose is to determine the relationships 
among these 66 frequently used clinical variables in 
order to arrive at a meaningful set of criteria for evalu- j 
ating patient adjustment. 


RESULTS 


A 66 X 66 tetrachoric correlational matrix 
was computed and a cumulative communality — 
(CC) cluster analysis performed (Tryon, 
1958). Factoring by the key cluster method 
Was accomplished on an electronic computer. 
Seven orthogonal dimensions were found. 
Seven oblique clusters were located on these 
dimensions. In order to discover the relation- 
ship between each variable and the seven 
cluster domains, its unaugmented oblique 
loadings with the domains were computed 
(Tryon, 1958). Variables with unaugmented 
oblique loadings of .50 or over with the clusters 
are presented in Table 1. 

Cluster 1 is the MMPI domain of self- 
evaluation; it includes all the scored MMPI 
scales (except L, K, Pd, M; "f, Ma, and Welsh R), 


* Factoring was terminated in accordance with the 
objective criteria set forth by Tryon (1958). 
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TABLE 1 
THE SEVEN CLUSTERS 
Cluster d tiye MMPI-CS* < 98)» || Cluster 4 
a. Adaptiv 5 -98)* 
b. Low MMPI-F EA a. Maladaptive follow-up drinking .86) 
c. Low MMPI-Pi es b. High I 9 (Mf-masculinity) 17) 
d. Low AME es o Cae) c. Nonpsychotic diagnosis upon admis- -64) 
e, Low -Pa : sion 
f Low MMPL-Sc Y a d. Low HVPI 7 (Control) (.60) 
£ Low MMPI-Welsh A¢ : as: e. Few total weeks of previous hospital- (.57) 
h, EN Bose’ sort correlation ER t ree ee ; ise 
i, Low x . à ive Finney scale score , 
j. High MMPI-Es 175) Adaptive ward behavior (155) 
k, Low MMPI-Hy -13 E Low HVPI 11 (Infrequency) 53. 
l. Low MMPI-Hs 64 i. Low MMPI-Welsh R! 51 
m, Few previous arrests (51 Custer Lie (.83) 
Cluster 2 a. service . 
a, High HVPI 5 (Verbal Activity) 94 b. High gain in service rank ? R 
b. High HVPI 12 (Responds like others) 91 c. Low HVPI 11 (Infrequency) 63 
c. High HVPI 4 (Conformity) 87 d. More education completed ER 
d. High HVPI 6 (Emotional) (.87 e. Few service courts-martial Wee 
È High HYPI 3 (Responsiiey) ‘ey [ices p ri 
. esponsil . 
g High HVPI 9 (Mf-masculinit ) a) cs Followup EEN ees 8 
h. High HVPI 2 (Intellectuality, f . Follow-up, well most of time é 
i. High HVPI 1 (Physical Activity) (a c. Follow-up, was not rehospitalized 2 
ate High HVPI 10 (Status) (.53) ee eae Finney scale score 3 
luster luster 
a, Older age at first hospitalization (.96) a. Few previous arrests ae 
b. Older onset age k eR, b. Few jobs held in Dee et 
c. No service connected pension RA c. Few service courts-martial | hospital t $0} 
d. Nonpsychotic diagnosis (upon ad- -60) d. Maladaptive job behavior in hospit š 
mission) 
e. Few total weeks of previous hospital- (.57) 
ization 
f. Older age upon admission (.52) 
g. Fewer number of previous hospital- (.51) 
izations 


^ Unaugmented cluster loadings in parentheses. 


b In addition to the usual MMPI scales, configural interpretations o! 
were asked to place each test 


John L. Holland, experts on each of these tests, 


the expected follow-up adjustment of each patient. These two sci 


(MMPI-Cs) and, similarly, HVPI-CS. 


© SIM is the MMPI scale of Inner Maladjustment (Simon, 1952). 


À MMPI Welsh A and R refer to Welsh’s (1956) Anxiety and 


the Q sort correlation between ideal and real 
self and number of arrests prior to hospitali- 
zation. All the measures in Cluster 1 reflect 
the adaptive ends of their scales; ie., if a 
Patient scores adaptively on one MMPI 
Scale he tends to score adaptively on all of 
them. Conversely, of course, the same is true 
for maladaptive scores. 

Cluster 2 is a domain of vocational prefer- 
€nces, the HVPI. All the HVPI scales except 
HVPI 7 (Control), 11 (Infrequency), and the 
HVPI-CS have high loadings. As in Cluster 1, 
Subjects who respond adaptively on one HVPI 


‘These correlations were computed for each patient 
and then dichotomized at the median. Those correla- 
tions higher than average were considered adaptive, 

‘ose lower, maladaptive. 


f the MMPI and HVPI were obtained. George S. Welsh and 
profile into an adaptive or maladaptive group in terms of 


ales were designated, respectively, as an MMPI Configural Scoring 


Repression scales. 


scale tend to score adaptively on all HVPI 
scales, Holland (1958) discusses the personality 
correlates of this test. Wish oat 
Cluster 3, containing seven historical items, 
refers to the neurotic-psychotic dimension. It 
seems to differentiate ee proa who us 
ntly hospitalized with neurotic or be- 
E Sordid from those hospitalized with 
psychotic reactions. These older, nonpsychotic 
veterans had fewer previous hospitalizations, 
fewer total weeks of hospitalization in the past 
and, of course, were older at time of onset of 
illness and when first hospitalized. They also 
tended not to receive a service connected 
pension since their breakdown usually did not 
occur in service. This appears to be a non- 
psychotic vs. chronic-psychotic cluster. 
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Cluster 4 seems to be a psychopathic index. 
These patients, also generally nonpsychotic, 
adjust well on the hospital ward, interact well 
in group psychotherapy (Finney scale), and 
have rarely been hospitalized in the past. They 
incline to alcoholism when leaving the hospital. 
The psychological test scales in this cluster 
indicate that patients high on this cluster chose 
more masculine occupations (HVPI 9-Mf), 
were “impulsive and undercontrolled” (HVPI 
7), chose the more usual occupations (HVPI 
11-Infrequency), and were low in repression” 
(MMPI Welsh R). 

Cluster 5 seems to be an achievement do- 


Their lower score on HVPI 11 (Infrequency) 
indicates that these patients selected the 
more usual occupational choices on this 
inventory. 

Cluster 6 refers to the domain of posthospital 


sample hospital job 
maladjustment. High scores describe indi 
viduals who have historically conformed to 


are so high that there is little difference between 
the intercorrelations of the cluster domains 
and cluster scores.* Table 2 shows the relative 


* The values in parentheses are approximations of the 


Ratrn P, Forsyra AND GEORGE W, FAIRWEATHER 


independence of the clusters; only 

cluster domain intercorrelations reach , 
Both the adaptive HVPI cluster (2) and W 
“nonpsychvtic” cluster (3) correlate in th 
magnitude with the “psychopathic” cluste 
(4). The adaptive MMPI cluster (1) and thi 
historically adaptive cluster (7) are similar 
related. Three other cluster intercorrelation 
reached a magnitude of .26 or .27. Both th 
psychopathic (4) and historically adaptive (7) 
dimensions are negatively related to 
achievement factor (5). Also, the adaptive 
MMPI cluster (1) and the adaptive follow-u 
cluster (6) are positively related to a moderat 


legree. 

Twenty of the 66 variables were not includ 
in any of the seven clusters. Four were tes 
variables (MMPI L and Mf, HVPI CS, an 
the TAT). Seven were historical items (raa 
received tranquilizing drugs in previou 
hospitalizations, reared by both parents, p 
centage of time gainfully employed in last 
years, weekly salary of last job, type of di 
charge, marital status). Two hospital variable 
(time in treatment, and received tranquilizing 
drugs during current hospitalization) and 
follow-up items (has some, or no close friends 
and frequently or rarely talks to people w 
whom he lives) were also not included in any 
cluster. 

Seven of the scales were in two of the sevél 
clusters. These were: number of weeks 0 
previous hospitalization (Clusters 3 and 
number of arrests (1 and 7); number of service 
courts-martial (5 and 7); type-of admission 
diagnosis (3 and 4); group psychotherapy 
scale (4 and 6); HVPI 9-Mf (2 and 4); an 
HVPI 11~Infrequency (4 and 5). 

Table 3 presents the original tetrachorlt 
correlations between the five within-hospil 
treatment variables and the six follow-u 
items. High scores for each variable are adap 
tive so that a positive correlation indica 
product-moment correlations between cluster series b 
cause the tetrachoric correlations used in the anal; 


mations should be fairly accurate, however, since t 
of most variables was of an observed co 

tinuous distribution. 
* Eight selected TAT cards (1, 2, 4, 9, SMF, 7B 
BM, and 12M) were given to each patient and be 


w1 
TABLE 2 
INTERCORRELATIONS BETWEEN THE Custer Douars (Oblique Factors) 
Panir Followup | Contormity 
ca A ie | saaa 
12 (10) 203 (.09) 26 (35) ee) 
«07 (.06) «32 (30) 2 (a) 8 Gat) 
1.00 (.90) At (.29) = (=.09) | o (07) 
+3129) | 4,00 (.95) CEEI) «94 09) 
=:9 (=.18) | —=.27 (=.25) HT (AB) | =a C8) 
04 (—.03) | 07 (07) 1.00 (.96) Mt ( 
09 (.07) 4 (05) rw | o Ca 
Dowary Varrorres axo Couwoxarrmies 


Communality of cluster scores 
Domain validity of cluster score 


* Intercorrelations between cluster scores in parentheses, 


TABLE 3 


‘Within hospital 


rugs (on) 
Time in treat- 
ment (long) 


Note.—Enclosed area contains Cluster 6. 


cement between theoretical levels of 
tation. Table 3 points out the general 
of agreement between the within-hospital 
follow-up measures. Only the Finney 
Bos) scale, of the within-hospital measures, 
vas related with any magnitude to more than 
me follow-up measure. 
€ follow-up items tended to be relatively 


‘TETRACHORIC CORRELATION MATRIX OF Hosprrat AND Fouow-ur Imus 


-4 

=a 

8 

‘H 

-%4 

E 

u 

E 

than might be expected. 

py age Of 
people and problem drinking page 
OT A DOET main ite ceil 
ADtlatioal fats one finda thst the follow- 
up drinking item was related much more 
frequently to historical and psychological test 
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scales than any other follow-up item. These 
were: .49 with few previous weeks hospitalized, 
49 with past drinking problem, .62 with non- 
psychotic diagnosis, —.51 with MMPI L, 
—.44 with MMPI K, .67 with MMPI Ma, 
and —.58 with HVPI 9 (Mf). 


Discussion 


This cluster analysis of 66 indices of patient 
adjustment emphasizes the relative inde- 
pendence of frequently used criterion measures. 
For the hospitalized patient population used 
in this study, the common criteria of ward and 
job behavior ratings, usually given highest 
weight in hospital discharge policy, are 
relatively unrelated to any follow-up area of 
adjustment. Further, prognostic indices such 
as past achievement, projective devices, and 
personality inventories are generally not 
related to community adjustment. Follow-up 
items themselves show a wide range of cor- 
relations, i.e., follow-up full time employment 
correlates .66 with remaining out of the hospital, 
but only .12 with number of friends. 

In view of the heterogeneity of criteria, it 
appears that studies attempting to evaluate 
hospital treatments can only be generalized 
whenacommon criterion is chosen. Inasmuch as 
hospitalization for mental illness, whether 
voluntary or committed, represents a failure 
of the self-maintenance system, it seems only 
logical that an evaluation of hospital treatments 
should be made in the areas of community 
living representing this failure. An approxima- 
tion to such a criterion is found in the follow- 
up cluster which involves simple adjustmental 
indices of rehospitalization, work, and an 
estimate of the former patient’s current degree 
of illness. 

While the follow-up cluster would be valu- 
able in evaluating hospital treatments, it would 
seem to have little merit for assessing out- 
patient treatments. Hollingshead and Redlich 
(1958) found that most analysands are mem- 
bers of higher socioeconomic groups and hence 
would be out of the hospital, almost universally 
employed, and characterized by a limited 
range pathology. Further, outpatients fre- 
quently enter treatment voluntarily because of 
subjective discomfort rather than because of 
any readily observable breakdown of the self- 
maintenance system. Thus, it appears that a 
social criterion for the evaluation of outpatient 
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logically meaningless. Moreover, by the 
nature of many outpatient populations, sub: 
jective evaluations are essential. It s 
clear that a universal criterion for evalua 
treatment. success is nonexistent and si 
tional criteria, at least within or outs 
hospitals, are necessary. W, 

The only hospital measure found ini 
follow-up cluster is the Finney (1954) 
Therapy Rating. The high loading of 
scale with follow-up items may be accouni 
for by two independent but equally impo 
variables. First, the scale itself comprises item 
related to readily observable behaviors in 
emotionally loaded interpersonal situatio 
This is in direct contrast to ward and 
behavior ratings, which usually involve 
formity behavior with less emotional invol 
ment. Second, the therapy rating scale v 
completed every 30 days, and the rating 
in this analysis was the one prior to dischi 

Figure 1 compares all patients who recet 
group therapy when divided into the mosi 
least adaptive follow-up groups. This 
demonstrates that follow-up adjustment 
most accurately predicted by the last o 
series of observations in the therapeutic sit 
tion. It also suggests that there may be a point 
in time when therapy should be terminated, if 
progress in interpersonal behavior has no 
occurred. In Figure 1, the most adaptive 
follow-up group becomes significantly 
ferentiated from the least adaptive at 120d 
(¢ of 3.26 with 25 df, p < .01). 

It is not surprising, then, that ratings made 
once or twice during the patients’ hospitaliaa 
tion are not highly related to follow-up. Furt 
the therapy rating loading on the follow 
cluster suggests that social-psychological n 
dices of behavior in close interpersonal contexts 
within the hospital is the area of measuremens 
most deserving of further research and expan 
sion. 

Future studies must also take cognizance 
the significantly poorer follow-up adjustm 
of the long-term psychotics when com 
with short-term psychotics and nonpsychot 
groups. A previous study (Fairweather et a 
1960) clearly showed that schizophrenl 
cannot be compared without the amount 
patients’ previous hospitalization being C0” 
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NEY SCAL 


Nf FINI 


60 90 120 150 
DAYS IN THERAPY 


Fic. 1. Group therapy ratings for patients who 
demonstrated most and least adaptive community ad- 
Justment, (All nine follow-up items were dichotomized 
into adaptive and maladaptive alternatives, with adap- 
tive alternatives receiving a score of one and mala- 
daptive zero, The community adjustment score for a 
patient was the sum of the nine items. The distribution 
of community adjustment scores was dichotomized at 
the median and those above the median were labeled 
most adaptive and below the median least adaptive.) 
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trolled because short-term psychotics demon- 
Strate significantly more adaptive follow-up 
than long-term psychotics, irrespective of 
treatment. It may be that norms for each group 
will have to be independently established. 

The three diagnostic groups (nonspychotics, 
Short-term, and long-term psychotics) may 
also be compared on the seven clusters found 
ìn this study. Figure 2 shows that the acute 
Psychotics scored much more adaptively than 
the other groups on the MMPI cluster (1), 
the follow-up cluster (6), and the historically 
adaptive cluster (7), while looking slightly 
better on the HVPI cluster (2). The non- 
Psychotic group, of course, has high scores on 
the nonpsychotic cluster (3), the psychopathic 
(4), and achievement (5) clusters. The chronic 
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p} 
IMMPU IMVU NEUROTIC- PSYCHO ACHEVE- FOLLOW- CON- 
ne VOCA RCM SORES MENT UF FORTY 


CLUSTERS 


Fic, 2. Comparison of the three diagnostic groups 
on the seven clusters. (The ordinate, Per Cent Re- 
sponses in Clusters, is the ratio of hits in a cluster to 
total number of possible hits. Because each variable is 
dichotomized, only’.5 of any distribution can be in a 
cluster. Thus, on the MMPI cluster with 13 scales and 
32 patients in a diagnostic group, there are 13 X 32 
or 416 possible hits. Figure 2 shows that the Ps group 
hit 283 of the possible 416 or 68%, while the P] group 
hit 162 of the 416 or 39%.) 


psychotics were least adaptive on the test 
clusters (1 and 2) and the follow-up cluster 
(6); they were also very low on Clusters 3 and 
4, which is indicative of their chronicity and 
lack of psychopathy. 

As previously mentioned, the two test 
clusters are relatively orthogonal to the fol- 
low-up cluster. The number of highly interre- 
lated scales in Clusters 1 and 2 suggest that 
the major portion of the variance of both tests 
may be explained by one general variable which, 
in this case, is unrelated to the to-be-predicted 
follow-up response. In this regard, the MMPI 
configural scoring was the best single measure 
of the MMPI factor, but had little or no 
relationship to follow-up. Thus, the MMPI 
configural scoring predicted the rest of the 
MMPI scales well but had little association 
with hospital behavior or follow-up. This 
finding questions the value of profile analysis 
over single scale scores. The very high loading 
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of the Q sort correlation in Cluster 1 suggests 
that self-evaluation items are common to these 
two forms of measurement. 

The fact that much-quoted and frequently- 
used tests such as the TAT failed to appear in 
any cluster demonstrates the problem of 
relating fantasy and self-evaluation reports 
to each other and to complex behaviors. It 
seems that much higher correlations need to be 
demonstrated before psychologists use such 
tests as decision making instruments where 
complex behaviors are the predicted response. 

The lack of relationship between length of 
time in treatment and follow-up corroborates a 
recent study by Lasky et al. (1959). Similarly 
the use of tranquilizing drugs seems to have 
little relation to posthospital adjustment. 


SUMMARY 


Sixty-six frequently used clinical variables, 
including historical data, personality in- 
ventories, projective devices, hospital be- 
haviors, and follow-up ratings, were collected 
from 96 male NP patients from a large Veterans 
Administration hospital. These variables were 
intercorrelated and cluster analyzed. Seven 
clusters accounted for all but negligible vari- 
ance. One of these clusters, which included 
three follow-up items, was proposed as a rela- 
tively meaningful criterion which could be 
used to evaluate psychotherapeutic and other 
hospital treatments for this type of hospitalized 
population. The lack of relationship between 
psychological tests and follow-up was discus- 
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sed. The importance of measuring inter- 

personal interactions within the hospital was 

suggested as a meaningful avenue of research 
because of the promising prognostic results 
obtained with the Finney (1954) group psycho- 
therapy scale. 
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INTRAJUDGE CONSISTENCY IN RATINGS OF 
HETEROGENEOUS PERSONS? 


CECILY F. GROSS 
Teachers College, Columbia University 


scores concerning social perception, Cron- 

bach (1955) noted that observer bias 
merits attention beyond its role as a source of 
constant error. He suggested that the mean 
and standard deviation of the ratings by a 
judge (J) of a number of other persons (Qs) 
constitute an implicit personality theory. Al- 
though research on specific determinants of 
impression formation has demonstrated the 
relevance of a number of attitudes and cog- 
nitive modes of the judge, attributes of the 
stimulus, and characteristics of the social 
interaction between them, evidence for a 
reliable response set with generality over 
heterogeneous others has not been clear-cut. 

Unmeasured specific biases of J toward O 
seem likely to have increased the consistency 
of judgment reported in some studies. In the 
research of Crow and Hammond (1957), for 
example, where the Os were hospital patients 
and the Js medical students, a factor may have 
been the Js’ attitudes related to their role vis- 
a-vis patients. Likewise, in Alfert’s (1958) 
study, consistency may have been inflated by 
the fact that, although unknown to one another 
at the time, the Js were nonetheless rating 
their own classmates. 

More satisfactory evidence of individual 
consistency of response toward heterogeneous 
other persons has been shown in studies by 
Beier, Smock, Izard, and Tougas (1957) and 
Izard (1959) in which stimulus photographs 
were selected from a pool of over 1,000 taken 
of males and females of widely differing ages. 
Yet a set of just as carefully selected photos 


1This paper is based on a dissertation submitted in 
partial fulfillment of the requirement for the degree of 
Doctor of Philosophy at Teachers College, Columbia 
University, 1960, The writer wishes to express gratitude 
to the members of the Dissertation Committee, Robert 
L. Thorndike and Edward J. Shoben, Jr., and particu- 
larly to thank the Chairman, Joel R. Davitz, for his 
stimulating encouragement. Deep appreciation is also 
extended to Rosedith Sitgreaves for her generous help. 
. The graciousness of H. Schmierer of Pratt Institute 
1s gratefully acknowledged. 


Į an analysis of components of accuracy 


led to inconclusive results in a study by Camp- 
bell and Burwen (1956). Although these 
investigators found highly significant between 
judge differences within a group of WAFs, 
they found no significant respondent effects 
within any of their three groups of male Air 
Force personnel. It also seems noteworthy 
that, in contrast to their apparent difficulty 
in demonstrating consistency of response to 
heterogeneous others on the part of three of 
the four groups studied, Campbell and Burwen 
obtained impressively high reliability co- 
efficients for the ratings of specific age and sex 
subgroups of Os, despite proportionately 
small Ws. Perplexingly, the male subjects in a 
study by DeSoto, Kuethe, and Wunderlich 
(1960), responding to the same stimulus pic- 
tures used by Campbell and Burwen, showed 
marked intrajudge consistency. 

The second parameter of Cronbach’s (1955) 
“implicit personality theory,” the standard 
deviation of a judge’s responses to a large 
number of heterogeneous others, has been the 
topic of very little research other than the 
previously mentioned study by Crow and 
Hammond (1957) in which reliable intrajudge 
variability was found. The question whether 
judgmental consistency is general across 
descriptive dimensions has also received little 
attention. 

The purpose of the present study was to 
investigate the existence and relative power of 
a unitary attitude toward heterogeneous 
others within and across descriptive dimen- 
sions, employing Cronbach’s (1955) operational 
definition. The major hypothesis was that on 
any given descriptive dimension, individual 
judges tend to be consistent in their ratings of 
a group of stimulus persons. Since individual 
differences in intrajudge consistency can be 
assumed to obtain across all traits being 
judged, the second hypothesis was that the 
tendency to consistency on one trait is posi- 
tively related to consistency on all. Subordinate 
hypotheses were that no differences would be 
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found related to the sex of the judge nor the 
sex of the stimulus person. 


METHOD 


Judges 

The subjects serving as judges were 35 female and 27 
male freshmen and sophomores enrolled in a general 
psychology course at Pratt Institute. The average age 
of the women was 18.7 and that of the men 22.4 years. 
The statistically significant 4-year difference is ac- 
counted for by the previous military experience of some 
of the men, Of the women, 24 were majors in art teacher 
education, the remainder being majors in fashion, in- 
terior design, and art. Of the men, 19 were architecture 
students, with the rest distributed among art teacher 
education, industrial engineering, mechanical engineer- 
ing, and fashion design. 


Procedure 


The design required the 62 Js to view and rate a 
number of Os on various scales. The Js were shown a 
series of 30 30-second motion pictures in each of which 
a single stimulus person was seen in a pattern of stand- 
ardized, relatively neutral activity. The sequences, 
taken outdoors, showed a full view of the O walking a 
distance of approximately 10 feet to a bench and then 
sitting down. During the last 10 seconds of film, the 
camera panned in for a close-up. 

The 30 stimulus persons (Os) ranged in age from 
the twenties to the forties, five persons of each sex fall- 
ing in each of the three decades. Because the purpose of 
the experiment was to investigate the generality of antic- 
ipatory set, heterogeneity among the superficial char- 
acteristics of the Os was essential. However, the selec- 
tion of Os was restricted to Caucasians, and their dress 
to street clothes in order to avoid contamination by 
racial or socioeconomic attitudes. The order in which 
the Os appeared on the screen was randomly deter- 
mined, except that no more than two Os of the same sex 
were shown in succession. 

The rating scales upon which the Js’ impressions 
were recorded were derived from the Guilford-Zimmer- 
man Temperament Survey. The seven factors on which 
the scales were based were general activity, rhathymia, 
ascendance, sociability, emotional stability, mascu- 
linity-femininity, and friendliness. These traits were 
chosen for their importance and relative independence 
as dimensions of personality according to Guilford’s 
(1949) research. Furthermore, since they occur com- 

monly in personality description, their use was pre- 
sumed to have the advantage of appearing plausible to 
the Js. A brief group of synonyms and behavioral state- 
ments was used to define, illustrate, and anchor each 
end of the seven bipolar scales. 

In order to prevent the Js from seeing their previous 
ratings, the rating scales were presented in booklet 
form, with a set of seven scales for the rating of one O 
to each page. An attempt was made to eliminate order 
and context effects by varying the order of presentation 
and the left to right position of each scale from page to 


page. 
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The data were collected at a single meeting of the 
class. The following instructions, reproduced on the 
cover of the booklets, were read aloud: 

Whenever we see a stranger for the first time, we 
get an idea about what kind of person he is, In a few 
minutes we are going to show you movies of a num- 
ber of men and women, one at a time. We are inter- 
ested in finding out how they impress you. We'd like 
you to tél us your hunches about what each person 
is really like by rating him on certain traits. 

Each one of the pages in this booklet is to be used 
to describe one of the persons you will see in the 
movie. On each page you will find seven pairs of 
traits, along with a matching set of seven rating 
scales. The scales run from the extreme of each trait 
all the way to its opposite. After you see each person 
on the screen, rate him on ALL SEVEN of the rating 
scales on the page. Do this for each scale by putting 
an “x” in the middle of the one box that fits your im- 
pression best. 

You will have only 30 seconds to look at each per- 
son, and then the projector will be stopped for you 
to make your ratings. So don’t take time to puzzle 
over your ratings or worry if you are not too sure. 
Your snap judgment is exactly what we want! 

Supplementary informal verbal instructions were de- 

signed to encourage frankness and to reduce the Js’ 
tendency to give the socially acceptable response. So 
that tendencies to evasiveness might also be minimized, 
the Js were asked to try to avoid using the midway 
point of the scales unless it seemed precisely to fit their 
impression of the O. After a practice trial to be certain 
that the Js understood the use of the scales, the films 
were run, the lights being turned on between each sec- 
tion of film to permit the Js to record their ratings of the 
O just viewed. 

The statistic used as a coefficient of consistency was 
the intraclass correlation (r’) (Haggard, 1958) obtained 
by analysis of variance of repeated measures on sub- 
jects who thus serve as their own controls. The signifi- 
cance level of this coefficient is determined by the F 
test. Although the significance of the difference between 
two r”s can be estimated only by examining the degree 
of overlap between the confidence intervals of a pait of 
r’s, the frequency with which differences fall in a pre- 
dicted direction is amenable to the nonparametric sign 
test (Siegel, 1956). 


RESULTS 


Intradimensional Consistency 


Testing Hypothesis I, that individual 
judges tend to consistency of response toward 
all Os within each rating dimension, involv 
separate analyses of variance of the J: s’ ratings 
on each scale. Statistically significant between 
judge differences at better than the .01 level ap- 
peared for all seven of the scales. The Between 
Judge Mean Square (BJMS) for each scale was 
then used to obtain an intraclass correlation a$ 
the summary index of consistency of 


INTRAJUDGE CONSISTENCY 


TABLE 1 


COEFFICIENTS OF CONSISTENCY FOR 62 Js JuDGING 
30 Os on Eacu oF SEVEN Scales (eè = WMS) 


Scale Single Item Overall 
I .049 61 
IL 057 64 
I O11 2.24 
IV -032 49 
v 082 13 
VI 092 75 
vit 036 53 
All Scales 162 57 

TABLE 2 


COMPONENTS OF VARIANCE FOR 62 Js’ JUDGMENTS 
on EACH OF SEVEN SCALES 


Judges Objects Error 
Scale area Total 
Variance | % |Variance | % | Variance | % 
E 615 5 55 |21| 1.91 | 74| 2.6 
Il} .16 6] .23 8| 2.30 | 86 | 2.7 
UL | .05 1 71 |24| 2.20 | 75} 2.9 
Iv| .11 4 78 |28| 1.90 | 68 | 2.8 
Vij .22 8| .39 | 15] 1.93 | 77] 2.5 
VI} .24 | 10 28 |11| 1.98 | 79) 2.5 
VIE | .09 4 45 |21| 1.59 | 75 | 2.1 


Js’ responses on that particular trait. Since this 
study was concerned essentially with intra- 
judge consistency as compared with all other 
sources of variation, known or unknown, the 
appropriate error term in computing r’ was 
the Within Mean Square (WMS) obtained 
after the Between Judge Sum of Squares was 
subtracted from the Total Sum of Squares. 
In Table 1 are the reliability coefficients of the 
seven scales, both for ratings of a single O and 
for the total of 30 ratings. 

Tt can be seen that if the series of stimulus 
persons is regarded as a test, the reliability of 
a single item (0) on any one scale is practically 
nil; while the reliabilities for whole “tests” 
of 30 items per scale range from .24 to .75. 

Because the between objects differences in 
these seven analyses of variance were highly 
significant, r’’s were also computed in which the 
the error terms were the Residual Mean 
Squares (RMS) remaining after Sums of 
Squares for Objects and Judges both were 
removed. For all scales, the RMS coefficients 
were larger than those based on the WMS. 
The increment in the intraclass correlations 
following the removal of object effects can be 
attributed to the fact that the BOMS ranged 
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from 2 to 12 times as large as the BJMS. The 
relative slightness of the increments, despite 
the magnitude of these ratios, probably can 
be accounted for by the large proportion of 
error variance appearing in the analysis of the 
components of variance for each scale shown 
in Table 2. So great a proportion of error 
variance is only to be expected in a design 
providing insufficient information for veridical 
judgment. However, the finding that the 
stimuli, despite their seeming scantiness, 
contributed from 8% to 28% of the total 
variance seems surprising indeed in the context 
of expectancy theory. In sharp contrast, the 
proportion of variance attributable to observer 
bias is miniscule, although the research method 
had been selected because of the likelihood 
that it would maximize judge factors. 

In summary, the data show the presence of 
intrajudge consistency of response within each 
scale. However, stimulus properties appear to 
be at least equal in importance to judge factors, 
if not strikingly more important, in determin- 
ing the ratings of heterogeneous others, even 
when minimal information is given upon 
which ratings can be based. 


Cross-Dimensional Consistency 

Hypothesis II, that the tendency to intra- 
dimensional consistency on one scale is posi- 
tively related to intradimensional consistency 
on all, was tested separately for the means and 
the variabilities of ratings made by the Js 
over all Os on all scales. 

Consistency of means across scales. An analysis 
of variance was done in which each J’s unit 
scores were the means of his 30 ratings on 
each of the seven scales. The BJMS was then 
used to compute the r”s of .16 for a single 
scale and .57 over all scales which appear for 
“All Scales” in Table 1. These r”s, similar to 
those obtained for the individual scales, reveal 
only slight reliability for a single mean, but 
fair consistency across the total of seven means. 

Further support for the hypothesis that 
consistency of means is stable across descrip- 
tive dimensions was found ina comparison of the 
confidence intervals of the seven intrascale 
coefficients of consistency. When the number 
of r’’s which failed to fall within the confidence 
interval of each coefficient was summed, it was 
found that in the case of none of the seven 
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TABLE 3 
INTRACLASS CONSISTENCY COEFFICIENTS AND 
CONFIDENCE INTERVALS FOR MEANS AND 
VARIABILITY INDICES ACROSS SCALES 


ll ingle Scali nena 

Overa! Single Scale (Sinele Se le) 
Means 57 16 -10-.32 
Variability 91 58 -45-.69 


scales did that number exceed the frequency 
that might have occurred by chance. The over- 
lap among the confidence intervalsattests to the 
fact that the seven r’’s based on between 
judge differences for each scale are of approxi- 
mately the same order of magnitude and, 
consequently, that consistency of mean 
response tends to be relatively constant across 
scales for the group of Js as a whole. 

Consistency of variability across scales. Seven 
variability scores, one for each scale, were 
computed for each judge, representing his 
residual variability after the mean of his 
judgments of all Os, plus the mean group 
judgment of each O, respectively, plus the 
grandmean, wereall partialed out of his ratings. 
The variability scores for the 62 Js were then 
ranked separately for each scale. An analysis 
of variance by ranks indicated highly signifi- 
cant between judge differences. The computed 
rs were .58 for a single scale and .91 over all 
scales, The contrast between these r”’s and 
the modest r’’s for the consistency of the 
means is shown in Table 3, where the wide 
gap in the confidence intervals also can be 
seen. It will be noted that the r’ for single 
scale variability is approximately equal to 
the r’ for the total of seven means. 


Sex Differences in Consistency 


The investigation of the hypothesized “no 
difference” in consistency between the various 
subgroups of Js and Os consisted of applying 
the sign test to the frequencies with which 
the r’ of one group exceeded that of the other 
on each of the seven scales. Since the widest 
split found between groups for any of the 
eight comparisons was 5:2, with a chance 
probability of .45, no significant trend seemed 
to be indicated. Thus no difference in con- 
sistency appears to distinguish the ratings of 
men Js from those of women, regardless of the 
sex of the O; neither do there appear to be 


differences in the consistency with which male 
and female persons are rated, regardless of the 
sex of the J. 


> 


Discussion 


Although marked individual differences in 
variability indicate that the data do not 
perfectly fit the assumptions of a parametric 
test, the results of this study, in conformity 
with those of Beier et al. (1957), Crow and 
Hammond (1957), and Izard (1959), reveal the 
presence of intrajudge consistency in impres- 
sions of heterogeneous persons. 

However, the relatively slight power of ob- 
server bias is shown by the analysis of com- 
ponents of variance in which the proportion 
of variance found to be contributed by between 
judge differences is in line with the 3% found 
by DeSoto, Kuethe, and Wunderlich (1960). 
The frailty of individual bias is further paral- 
leled in the findings of Campbell and Burwen 
(1956), whose reliability coefficients are of 
approximately the same order of magnitude as 
the intraclass correlations in the present study. 
Although the results do not explain the non- 
significance of respondent effects in three of 
Campbell and Burwen’s four groups, they do 
Suggest that, since the contribution of a general 
response set is small, any gross measure of 
this variable would be fragile, its appearance or 
honappearance depending largely upon other 
factors accounting for more of the total vari- 
ance. Incontrast tothe small influence of general 
Tesponse sets, the strength of specific attitudes 
is indicated by evidence such as Campbell and 
Burwen’s finding of higher reliability coeffi- 
cients for judgments of homogeneous as 
compared with heterogeneous groups of Os, 
and the positive findings of the numerous 
studies investigating specific determinants of 
social perception (e.g., Cohen, 1956; Scodel & 
Austrin, 1957; and Secord, 1958). It seems 
likely that the generalized O represents too 
high a degree of abstraction and that gen- 
eral response set is organized at too tentative 
a level of integration to exert a consistent in- 
fluence upon behavior, 

‘ The highly significant stimulus effects found 
in this study have also been noted in previous 
research, e.g., by Beier et al. (1957), Campbell 
and Burwen (1956), and, most outstandingly 
of all, by Izard (1959), despite his care in 
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maximizing stimulus ambiguity. What can be 
the nature of this stimulus effect? Though 
no criterion data are available, it is improbable 
that the judgments had much validity. It 
seems much more likely that cultural similari- 
ties in the experience of the Js resulted in 
their drawing upon commonly held stereotypes. 
Indeed, the relative proportion of stimulus to 
respondent effects shown in the analysis of 
components of variance shows that, at least 
under conditions of low to moderate ego in- 
volvement, the Js relied more upon these 
commonly held sets than upon unique ones. 
The ratio of stimulus to respondent effects 
may possibly have been exaggerated by homo- 
geneity among the Js, resulting not only in 
reduced r’’s but also in increased agreement 
upon the ratings of the stimulus persons. Still, 
while heterogeneity of age, education, social 
class, etc. would shrink interjudge agreement, 
any resultant reduction in stimulus effects 
would be likely to accrue not to the judge 
factor, but to random variation. These results 
seem in conflict with the common assumption 
that the presentation of ambiguous stimuli 
evokes largely autistic elements in perception. 
They thus accentuate the importance of 
normative data in the interpretation of 
projective tests. 

Although modest but statistically significant 
intrajudge consistency of means was found 
across scales, even larger between scale dif- 
ferences indicate that the group of Js tended to 
use the scales in the same way. Scale effects 
probably cannot be attributed solely to social 
desirability since the correlations found be- 
tween the rank order of the scale means and 
Cowen, Budin, and Budin’s (in press) norma- 
tive values for “Personal” and “Social” 
desirability response sets toward the scale 
anchor terms were nonsignificant. (V = 6, no 
norms being available for the Masculine- 
Feminine vs. Effeminate-Mannish scale.) 
Another possible source of between scale dif- 
ferences may have been the intrajudge con- 
sistency of covariance patterns which Cron- 
bach (1955) proposed as the third parameter of 
an “implicit personality theory.” 

The high reliability of individual variability 
appears to call for considerable further in- 
vestigation. Although the phenomenon may be 
a manifestation of personal style in the use of 
the instrument rather than of the J’s char- 
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acteristic range of social judgments, Broen and 
Wirt’s (1958) correlation of .03 between two 
measures of the tendency to spread ratings on 
rating scales indicates that style of responding 
on rating scales, per se, is unlikely to have been 
a major factor in the consistency of variability 
scores. From the greater stability of these 
indices compared with the mean ratings, one 
might further speculate whether impulsivity, 
or expansiveness in judgments of others, may 
not be a more salient response set than favor- 
ability. 

In sum, these findings cast doubt on theo- 
retical positions like those of Kelly (1955), 
Sullivan (1956), and Cronbach (1955), ac- 
cording to which the judge’s unique system of 
constructs, personifications, or implicit per- 
sonality theory are central in determining his 
impressions of people. The modification of 
expectancy theories to accord with the results 
of this investigation would require a mini- 
mizing of the emphasis upon people’s outlook 
on others in general and, instead, increased 
attention: (a) to the pull of aspects of the 
stimulus regarding which there is a large 
measure of interjudge agreement, and (b) to 
the role of individual differences in range of 
response. Finally, although less clear-cut on 
this last point, the results of this study, along 
with those of others, suggest that the theoreti- 
cal concept of a unitary attitude toward the 
generalized other may be too loose to have 
more than severely limited significance and 
that such terms as “implicit personality 
theory” might with greater precision be used in 
the plural rather than in the singular. 


SUMMARY 


The existence and relative power of a 
unitary attitude toward diverse persons, 
implied in statements of Kelly (1955) and 
Sullivan (1956), were investigated by utilizing 
Cronbach’s (1955) operational definition of 
“implicit personality theory.” Although the 
results supported the concept of a general 
response set, the fragility of intrajudge con- 
sistency relative to stimulus factors points to a 
construct difficult to validate. No differences in 
consistency were found to be related to sex of J 
or O. The findings suggest less theoretical 
emphasis upon J bias and greater attention to 
response variability and “reality” as evinced 
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by interjudge agreement concerning stimulus 
attributes. 
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AN EXPERIMENTAL TEST OF SCHEMA INTERACTION* 


DONALD P. SPENCE? 
Research Center for Mental Health, New York University 


HEN a person is asked to describe a 
strange face, he notes the obvious 
physical features and from them 
makes assumptions about the person’s possible 
feelings and other characteristics. These as- 
sumptions are partly implicit, based on past 
associations of physical features with specific 
experiences. Murray (1933) has described the 
working of these associations thus: “Certain 
physical features of the object which the sub- 
ject does not consciously perceive are neverthe- 
less physically affecting his body, and though 
he may be unable to report upon these internal 
happenings, they are nevertheless affecting his 
conscious appraisal of the object” (p. 312). 
Thus it would appear that the description of 
a strange face may be partly determined by 
stimulation outside of awareness. In slightly 
different language, it seems that the meaning 
attributed to a given facial expression depends 
on which schemata are influencing perception, 
where a schema, following Bartlett (1932), is a 
semipermanent aggregate of connotations not 
necessarily in awareness. Klein and Holt (1960) 
have suggested that an incidental stimulus can 
activate such a schema; hence the stimulus 
should be able to change the expression at- 
tributed to a strange face. Smith, Spence, and 
Klein (1959) showed that such is the case; 
when the words happy and angry were pre- 
sented below threshold, they influenced the 
description of a neutral face. Eagle (1959) 
showed that a subliminal aggressive picture 
can influence the attribution of aggressive 
traits to a neutral figure. Evidently the 
schema activated by the incidental stimulus 
interacts with the schema aroused by the face 
and results in a change in description. The 
more expereince the subject has had with the 


1'The study was carried out as part of a research 
project supported by grants from the Ford Foundation 
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2 The writer is indebted to George S. Klein for his 
help in setting up the experimental design, testing sub- 
jects, and for analysis and interpretation of the find- 
ings; to Robert R. Holt and Fred Pine for their sug- 
gestions and criticism; and to Paul Lippmann for his 
assistance in testing subjects and analyzing data. 


incidental stimulus, the more established is the 
associated schema, because schemata are built 
up from experience (Bartlett, 1932). 

How do we learn about the schema activated 
by the incidental stimulus? By using the sub- 
ject’s description of the stimulus in full aware- 
ness. Previous experiments with subliminal 
stimulation (Eagle, 1959; Smith et al., 1959) 
have indicated that the stimulus that registers 
below awareness corresponds in large degree 
with the description of the stimulus in full 
awareness. Thus a rating of the stimulus should 
be a good approximation of the schema it 
activates when presented incidentally. 

In the present study, the subject is asked to 
rate three emotional objects. This rating is 
supposed to crystallize a set of feelings about 
the objects—i.e., reinforce the object schema. 
In a later session, each object is presented 
incidentally, immediately prior to an ambig- 
uous face. The study tests the hypothesis that 
the incidental object reactivates the schema 
mobilized by the initial rating, and that this 
schema then interacts with the face schema to 
bring about changes in description, causing the 
face to take on some of the properties of the 
object. 


METHOD 


Design and Procedure 


The experiment consisted of two sessions averaging 
90 minutes each and separated by an average interval 
of 2 months. In the first session the subject was asked 
to describe three pictures of faces which subsequently 
became the focal stimuli. The faces were relatively ex- 
pressionless and yet easily discriminable. Each was ex- 

in a three-field tachistoscope for 15 seconds at an 
illumination of .8 footcandles, with the following in- 
structions: 

Im going to show you a face for a short look. 
Study it carefully. When I turn it off, I want you to 
Jean back in the chair and tell me whatever comes to 
mind—what this face reminds you of, how it makes 
you feel, anything you like. Keep talking until I ask 
you to stop. 

‘Associations to the face were recorded for 2 minutes, 
Then after another, 5-second exposure of the face, the 
subject described it on a rating form of 41 items, Each 
item was a dichotomy (eg., ‘“modest-boastful”) ex- 
panded to a seven-point scale which provided three 
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degrees of choice on each side of a middle, “can’t-tell” 
position. Items were chosen from Osgood (1958) to 
cover the three main factors of Evaluation, Potency, 
and Activity. $ 

After the subjects had given associations and ratings 
to all three faces, he was asked to leave the room tem- 
porarily and in his absence 11 stimulus objects were 
arranged on a table. The objects were selected to elicit 
a wide range of feelings and consisted of pipe, baby 
bottle, earring, lipstick, female stationery, pistol, 
whiskey bottle, sympathy card, exam bluebook, ink 
bottle, charred stick. Then the subject was brought 
back and given the following instructions: 

Now I want you to look at the objects here on the 
table. Look them over carefully and choose one. 
When you have picked it, I want you to sit back, 
study it and think about it—what it is, and what it 
reminds you of. Then I want you to lean back, close 
your eyes and talk about whatever comes to mind— 
what the object makes you think of, and how it makes 
you feel. Keep talking until I say stop. 

Order of selection was recorded for each subject. 
After he had given associations to the first object, it was 
shown. to him for 5 seconds in the tachistoscope. He 
then rated it on the Osgood checklist. The same pro- 
cedure was repeated for the remaining objects. 

‘The associations and checklist ratings were used to 
choose the two pre-face incidental stimuli given to each 
subject. There were three criteria for selection: check- 
list profile with a high number of extreme ratings, free 
associations that contained many feelings and personal 
references, and extreme positive or negative connota- 
tions in the associations. If the stimulus was high on all 
three criteria, it was assumed to have a particularly 
polarized schema that would have a substantial effect on 
the face schema. The two most extreme objects for each 
subject became his later incidental stimuli; one was 
positive (e.g., baby bottle) and the other was negative 
(e. g., pistol). 

The initial rating and free association to the faces 
and objects had the function of reinforcing one particu- 
lar set of associations for each stimulus. Of the many 
possible schemata activated by each face or object, one 
was made relatively explicit by being recorded on a 
rating scale. When the face or object was shown in a 
later session, it was assumed that this particular schema 
would be activated. 

The second session constituted the main part of the 
experiment. One to 3 months after the first session, 
each subject was re-exposed to the three faces, each 

face immediately preceded by one of the incidental 
stimuli previously chosen. Each face was exposed at 
330 milliseconds and the incidental object at less than 
30 milliseconds. It had been found in a previous study 
(Klein, Spence, Holt, & Gourevitch, 1958) that such a 
combination of speeds prevents the first, briefly exposed 
object from appearing in conscious awareness. To de- 
termine the exposure level for the pre-face stimulus, we 
established the subject’s recognition threshold for a line 

drawing of a diamond, and then presented the stimulus 

object a certain level above the diamond threshold. The 

increment was determined by a small pilot study. 
Exposures of the objects varied from 6 to 28 milli- 
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seconds. Median exposures were 11 milliseconds for the 
positive object, 12 milliseconds for the negative object, 
and 11 milliseconds for the neutral object. Since no sub- 
ject reported seeing the pre-face object, our exposures 
appear to havé been adequately incidental (in fact, 
probably subliminal most of the time). We may assume 
that the subject did not experience the objects as rele- 
vant and focal stimuli in his dealings with the face 
stimulus. o 

Each subject was exposed to the three faces in a 
constant order. The object-face pairing was counter- 
balanced with roughly half the subjects getting Face 1 
with the positive object and Face 2 with the negative 
object. All subjects were given Face 3 with the neutral 
object (the charred stick). The object was mounted on 
pegboard, slipped into channels placed at either side 
of one exposure field of a three-field tachistoscope. The 
face was placed in the second field. The subject was 
asked to fixate on the third, or blank field; at a given 
signal the object was briefly exposed followed immedi- 
ately by the face at 330 milliseconds, An adjustable 
electronic circuit controlled the rapid pairing of the 
stimuli, details of which may be found in Klein et al. 
(1958). ; 

The subject was told that he would see a face, one 
he had seen before, at four brief exposures. After the ex- 
posure of Face 1 and its pre-face object, the subject 
was asked to associate to it for 2 minutes. After the as- 
sociation, face and object were presented two more 
times and the subject was asked to describe the face on 
the Osgood checklist. Faces 2 and 3 were presented 
with the same instructions. At the end of the session, 
the subject was asked to list all objects he could re- 
member from the first session, Order of recall was re- 
corded. 


Subjects 

Subjects were paid volunteers, 13 men and 10 
women, who were undergraduates in a school of educa- 
tion. All subjects were part of a larger pool intensively 


studied on a variety of personality measures (see Gold- 
berger & Holt, 1958, for details). 


RESULTS 


Were four brief presentations of the stimulus 
sufficient to activate the object schema and 
influence the rating of the face? It was ex- 
pected that the object schema would interact 
with the face schema in such a way that the 
final rating would be a combination of the two 
earlier ratings. If we assume that the object 
was first rated “very strong” and the face was 
first rated “somewhat weak,” a combination 
of these two ratings would make the face, on 
the second rating, move toward the “strong” 
end of the scale. Thus the combination of ob- 
ject and face schemata would result in a shift 
toward the original object rating. We tallied 
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all ratings that changed two or more units on 
the checklist from the first rating of the face to 
the second, and subtracted shifts toward the 
object from shifts away from the object. By 
chance, both types of shifts are possible; by 
hypothesis, only shifts toward the object are 
expected. Therefore, shifts toward the object 
should exceed shifts away. Data were tallied 
separately for each object, each subject being 
given a score according to whether he had more 
shifts away from the object or toward the ob- 
ject. 

The majority of shifts were toward the object 
in all three object-face conditions, and the dif- 
ference was significant with two. When paired 
with the negative object, the ratings of the 
face shifted toward the earlier ratings of the 
object; 15 out of 23 subjects (p < .05, one- 
tailed sign test) showed a preponderance of 
shifts toward the object. When paired with the 
neutral object, the face’s ratings moved pre- 
dominantly toward the object’s ratings in 14 
out of 23 cases (p < .05, one-tailed sign test). 
When paired with the positive object, the 
ratings of the face moved toward the ratings of 
the object in 12 out of 23 cases (ns) but 7 of 
these cases were women; thus the women 
behaved as predicted, and the interaction of 
sex with shift toward the object approaches 
significance (p < .10, Fisher exact test, two- 
tailed). In the main, therefore, the hypothesis 
was confirmed. 


Additional Effects 


When the explicit properties of the incidental 
stimulus (e.g, “very modest,” “slightly 
healthy,” etc.) are not taken into account and 
we consider only the general evaluative di- 
mension, its influence on the face is much less 
predictable. Ratings of the face were scored on 
a 21-item evaluative scale, i.e., items were 
considered as positive or negative, according 
to a consensus of four judges. When the ob- 
jects were scored on this scale they were 
markedly different, with the positive objects 
having a mean rating of 18 and the negative 
objects a mean rating of 5. Thus the positive 
objects were positively evaluated and the 
negative objects the reverse. When the final 
ratings of the faces are scored on this scale, we 
find they were not influenced by the global 
properties of the objects; on the contrary, the 
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faces are not significantly more positive after 
being paired with the positive object nor 
significantly more negative after being paired 
with the negative object. The specific proper- 
ties of the object schema, rather than general 
“goodness” or “badness,” seem thus to have 
been transferred to the faces in the interaction 
of object and face schema. 


Memory 

Memory of the neutral incidental object was 
significantly related to its effect on the as- 
sociated face. At the end of the object-face 
pairings, subjects were asked to list all the 
objects they could remember from the earlier 
session. Some mentioned the neutral object 
early on the list, some later, and some not at 
all; subjects were ranked according to ease of 
recall, with a low rank indicating early recall 
and a high rank indicating late recall or for- 
getting. 

Subjects were ranked for degree of incidental 
effect by summing the algebraic total of shifts 
away from and toward the neutral object. The 
subject with the largest net number of shifts 
toward the object was given a rank of 1. 

Order of memory of the neutral object cor- 
related .56 (p < .02, two-tailed test) with 
degree of effect; subjects whose impresions of 
the face showed most direct influence by the 
object were also most likely to remember the 
neutral object. 

Memory of the positive and negative objects 
was not significantly related to amount of 
effect. 


Discussion 


The hypothesis that an incidental stimulus 
activates schemata and thus can influence the 
description of a face has been supported by the 
negative and neutral stimulus objects, and by 
the positive object for female subjects. The 
schema activated by the incidental object ap- 
pears to combine with the face schema in a 
predictable fashion and to influence the 
description of the face. The combination of 
schemata seems to follow fairly straightfor- 
ward additive principles, with one attribute 
increasing or decreasing the strength of a 
related attribute: e-g., “active” added to 
“passive” shifts the balance toward “active.” 

The results are in line with those of earlier 
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studies by Smith, Spence, and Klein (1959) 
and Bach and Klein (1957) in which sub- 
liminal stimuli appear to influence descriptions 
of supraliminal figures. In these studies a 
verbal stimulus outside of awareness (happy or 
angry) apparently activated a congruent 
schema and when the subject was asked to 
describe the supraliminal figure his description 
was influenced by the activated schema. The 
present study differs from these two in showing 
that the discrete attributes of the activated 
schema may interact with attributes of the face 
schema, thus supporting the assumption that 
an incidental stimulus can be faithfully 
registered outside of awareness in more or less 
specific detail, and that a generalization of 
effect does not necessarily take place. Thus the 
negative object did not make the face appear 
generally more negative, nor the positive 
object make it more positive; on the contrary, 
the effect was tied to specific object properties 
which represented a mixture of goodness and 
badness. 
When the stimulus is sufficiently compelling, 
a generalized incidental effect may occur. 
Eagle (1959) has shown that a subliminal 
exposure of a man being stabbed makes for a 
general increase in aggression attributed to the 
following stimulus; Pine (1960) showed that 
an oral-passive incidental stimulus makes for 
stories characterized by an oral-passive theme, 
but not necessarily containing specific ele- 
ments of the stimulus. In both cases, the 
incidental stimulus was heavily loaded with 
drive content and, through contact with 
instinctual wishes, its effect may be increased. 
Drive energy is presumed to follow the rules of 
the primary process—hence it is less bound by 
realistic notions of form and content and is 
more easily displaceable. Under such condi- 
tions, it seems reasonable that an incidental 
drive stimulus would have less precise effects. 
An incidental stimulus that made less contact 
with drive material would result in a more 
veridical effect, as was the case in the present 
study. It is interesting, in this connection, 
that the positive object, in some ways the 
most drive related because of its strong oral- 
passive connotations, showed the least veridi- 
cal effect. 
It should be noted here that the more 
veridical effects of the present study are easier 
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to trace because the prior object rating pro- 
vides a baseline. In Pine (1960) and Eagle 
(1959), there was no baseline and the stimulus 
that registeréd may have differed from the 
objective stimulus presented. In these studies 
the response may have been closely related to 
the regisigring stimulus and not, strictly 
speaking, a transformation. In the present 
study, the rating profile enables us to get 
closer to the registering stimulus and cuts down 
one possible source of error. Thus the greater 
veridical effect may be partly a consequence of 
method as well as the result of less drive. 


Congruence 


The failure of the positive object to conform 
to the hypothesis may stem, as already noted, 
from its greater drive relevance, but another 
possibility should be considered. All positive 
objects (e.g., baby bottle, female stationery) 
had (inadvertently) strong feminine connota- 
tions and they were paired with masculine 
faces. Now the connotations of any attribute 
used to describe a feminine object are quite 
different from its connotations when it is used 
to describe a male object. “Passive,” for 
example, may endow a feminine object with 
pleasant and favorable qualities, but when 
applied to a masculine face it has other im- 
plications. Passive obviously has different 
connotations in the two contexts and it would 
be misleading to assume that it has the same 
meaning in both contexts. Yet that is the 
assumption behind the experimental hypothe- 
sis—that passive has identical meanings in all 
contexts, and attributes can be added and 
subtracted like so many counters. 

This example suggests a general rule: the 
way in which two schemata interact depends 
on their overall contexts. If two schemata 
have roughly equivalent contexts, attributes 
from Schema I should have roughly the same 
connotations as attributes from Schema II, 
and the two sets of attributes can be treated as 
equivalent and combined without loss of 
meaning. What Eagle (1959) calls congruence 
seems to be an important variable in predicting 
the interaction of stimuli. If the congruence is 
low, the contexts are different, and the attri- 
butes of the separate schemata have different 
connotations; as a result, the final tating may 
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shift away and toward the object and the final 
effect should be difficult to predict. 

In general, we should expect to find more 
additive effect when congruence is high, and 
the results bear out this expectation. The 
negative and neutral objects that produced 
the strongest direct effect were either mascu- 
line in content or in form—e.g., pistol, whisky 
bottle, square sympathy card, and angular 
stick—and thus more congruent with the 
masculine faces. The positive objects were 
more feminine, less congruent, and less ef- 
fective. 


SUMMARY 


Tn an attempt to test propositions about the 
interaction of schemata, three stimuli, in- 
cidental to the stimulus to be rated, were 
paired with three supraliminal faces in a 
partly balanced design. Twenty-three sub- 
jects rated the stimuli and faces initially on an 
Osgood Semantic Differential. In a second 
session, several months later, they rated each 
face again, preceded by one of the objects 
below the level of focal awareness. It was 
predicted that the object and face schemata 
aroused by the first rating would be reac- 
tivated, and would interact in a predictable 
way to produce a new rating of the face. For 
two of the three types of objects, ratings of the 
face following the negative and control in- 
cidental stimuli moved in the direction of the 
object rating, in support of the hypothesis. 


The findings were discussed in terms of a 
theory of incidental schemata that influence 
ongoing cognition. 
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ATTITUDES TOWARD OLD PEOPLE IN AN OLDER SAMPLE! 


NATHAN KOGAN? 
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N a recent paper (Kogan, 1961), the 
present author described the development 
of a scale measuring attitudes toward old 

people (OP scale). For college samples, a 
positively and a negatively worded version of 
the OP scale were significantly correlated in 
the content direction, and scale reliabilities 
were generally satisfactory. In addition, 
significant relationships that could not be 
attributed to response set effects were found 
between OP scale scores and other attitudinal 
and personality variables. 

This paper reports results obtained from 
the administration of the OP scale to a sample 
of apparently healthy older adults of better- 
than-average intelligence and education. With 
college students’ responses serving as “base- 
line” data, OP scale characteristics are 
examined in the older sample with particular 
reference to the favorability and variability of 
the attitude, the internal consistency of the 
scales, the logical consistency in response to 
positively and negatively worded versions of 
the scales, and direction and magnitude of 
relations with other attitudinal variables. 

Specific predictions of differences between 
the younger and older samples were derived 
from two general assumptions. First, it was 
assumed that older people in American society 
have many of the qualities of a minority 
group, a point that has been stressed by a 
number of investigators (Barron, 1953; Drake, 
1958; Linden, 1957a, 1957b). If Lewin’s (1945) 
concept of “‘self-hatred” is applied in the 
present case, the empirical consequences might 
well take the form of a greater polarization 


1 This study was supported in part by the National 
Institute of Mental Health, United States Public Health 
Service, under Research Grant M-1867, and was con- 
ducted under the auspices of the Age Center of New 
England, Inc. Support for machine processing of much 
of the data was provided by Educational Testing Ser- 
vice. 

2 Grateful thanks are due Samuel J. Messick for 
statistical advice and critical comments on the paper. 
The author also wishes to express his appreciation to 
Beverly Scott, Florence C. Shelton, and Judith W. 
Stephens for their assistance in data collection and 

analysis. 


in the attitude; older people would be more 
likely than younger respondents to be found 
at either extreme regarding the issue. Hence, 
older adults should manifest greater inter- 
individual variability. Second, it was assumed 
that there would be a greater “ego involve- 
ment” on the part of older people in the content 
of the OP scale items. As a member of a group 
that comprised the object of the attitude under 
study, an older person might be expected to 
take a closer, more considered look at each 
OP item than would a younger one. Ac- 
cordingly, we should expect older subjects to 
exhibit somewhat greater logical consistency 
in responding to matched positive and negative 
OP items. 

No hypotheses were formulated with respect 
to differences between younger and older 
subjects in the favorability level of attitudes 
toward old people. If older individuals exhibit 
considerable variability in the extent of 
acceptance-disparagement of old people as a 
group, it is difficult to predict whether older 
subjects should be more or less biased than 
younger subjects. 

Similarly, no hypotheses are advanced 
Tespecting the magnitude of relations be- 
tween OP scale responses and responses to 
other attitudinal content in the older and 
younger samples. It will first be necessary to 
examine the extent to which stylistic response 
tendencies can account for relations between 
the variables. 


MertHop 


Subjects 


„Older subjects (89 males and 115 females) were ob- 
tained from the Age Center of New England, Inc., a 
nonprofit gerontological research organization in Bos- 
ton, Massachusetts, These subjects were volunteers, 
noninstitutionalized, and in apparently good health, 
whose membership in the organization involved a com- 
mitment to participate in a variety of research studies. 
Chronological ages of the subjects ranged from 54 to 
92 for the males, with a mean of 70.8 (SD = 7.3), and 
from 49 to 86 for the females, with a mean of 69.4 
(SD = 7.9). 

Approximately 70% of the older subjects held or 
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are now holding professional or managerial positions; 
most of the remaining 30% belong to the clerical or 
skilled craftsman occupational groups. The mean educa- 
tional level in the sample was 13.7 (SR = 2.8) and 12.9 
(SD = 3.1) years of school for males and females, re- 
spectively. Current Population Reports (Bureau of the 
Census, 1960), citing years of school completed for the 
civilian population of the United States in March 1959, 
reports a median figure of 8.7 for males 55-64 years and 
8.2 for males 65 years and over. The corresponding 
figures for females of those ages are 8.9 and 8.4 years 
of school, respectively. By contrast, median school 
years completed is 12.4 for both males and females in 
the 20-24 age group. Hence, the educational level of 
the older subjects could well be considered superior to 
a contemporary college student group. This inference 
is further supported by intelligence data available on a 
random subsample of 52 males and 73 females, who 
were given the vocabulary subtest of Wechsler’s (1944) 
intelligence test for adults. The mean scores for males 
and females were 34.7 (SD = 4.8) and 33.9 (SD = 5.4), 
respectively. Both means are slightly larger than those 
reported by Dana (1957) for college student samples. 

Young adult subjects (87 males and 81 females) were 
obtained from introductory classes in social science at 
Boston University.* 


Measuring Instruments 

A set of 34 “old people” items, 17 expressing negative 
sentiments about old people (OP—) and 17 reversed in 
wording expressing positive sentiments (OP-+), were 
interspersed among statements from other attitude 
domains yielding a 98-item battery for the older sample 
anda 72-item battery for the student subjects. Common 
to both samples, in addition to the OP scales, were 12- 
item F and reversed F scales and 6item “religious 
conventionalism” (RC) and reversed RC scales. The 
particular items used in the OP and F scales are re- 
ported in Kogan (1961). RC items were selected from 
Levinson (1954) and a set of matched items of opposite 
sign was constructed (Table 1). 

The following response categories were provided for 
all of the items: “strongly disagree,” “disagree,” 
“slightly disagree,” “slightly agree,” “agree,” and 
“strongly agree.” These were scored 1, 2, 3, 5, 6, and 
7, respectively, with a score of 4 assigned in the few 
cases where respondents omitted an item. 


RESULTS 

Mean scale scores for young and old subjects 
are shown in Table 2. While no age differences 
emerge for the OP— scale, there is a highly 
significant difference for OP+, with older 
subjects exhibiting a greater tendency to 
endorse such items. The latter finding when 
viewed relative to the direction of the mean 
differences for OP— would suggest that an 
acquiescent response set exerted a stronger 
influence in the older sample. 


2I am grateful to Robert Smart for making subjects 
available. 


TABLE 1 
ORIGINAL AND ReversED RC Scare ITeMs 


1(+). Every explanation of man and the world is in- 
complete unless it takes into account God’s will. 

1(—). God’s will should play no part in explaining 
man and the world; scientific knowledge is all 
that is needed. 

2(+). The chief end of man is nothing other than 
eternal salvation. 

2(—). The promotion of individual personal happiness 
is the chief end of man. 

3(+). Every person should have complete faith in 
some supernatural power whose decisions he 
obeys without question. 

3(—). There is no faith or supernatural power whose 
commandments one should obey without ques- 
tion. 

4(++). In addition to faith we need help from God in 
order to resist temptation. 

4(—). Prayer and faith in God don’t help very much 
in resisting temptation. 

5(++). The most important aim of the churches at the 
present time should be to encourage spiritual 
worship and a sense of communion with 
highest. 

5(—). The most important aim of the churches at the 
resent time should be to encourage man’s 
‘umanity to man. 

6(+). Life could hardly be worth living without the 

romise of immortality and life after death, 

Fhe promise of immortality and life after death 

js aie unnecessary for a worthwhile and happy 

e. 


Such an interpretation, however, is not 
unequivocally supported by the mean dif- 
ferences observed for the F and RC scales. 
With regard to authoritarianism items, it is 
the older men who score significantly higher 
in authoritarianism on F+, while the older 
women are significantly lower in authoritarian- 
ism on F—. While both sets of differences can 
be attributed to a stronger acquiescent re- 
sponse tendency in the older subjects, the fact 
remains that such tendencies were not of 
equal strength for original and reversed items. 
Similar findings may be noted in the domain of 
religious conventionalism. The older males 
achieve significantly higher scores on RC+ 
items, with no difference obtaining for RC—. 
The older women, on the other hand, manifest 
more religious conventionalism on both RC+- 
and RC— items, a finding directly counter to 
a simple acquiescence interpretation. In sum, 
it is evident that mean scale differences be- 
tween samples reflect a combination of set and 
content, with the relative contribution of set 
related to the particular content area of a 


scale. 
Tt will be recalled that we hypothesized 
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TABLE 2 
MEAN DIFFERENCES BETWEEN YOUNG AND OLD SusjecTs on OP, F, AND RC Scares 
wey aia raTa wens 
Scale 
Mu SD i SD t M SD M SD t 
OP- 55.85 11.73 | 57.26 | 11.97 79 53.78 | 10.21 | 55.83 | 12.61 | 1.26 
(3.28) ® (3.36) (3.16) (3.28) 
OP+ 64.06 11.10 91 9.84 | 4.52** +30. | 10.73 | 57.79 | 11.37 | 4.08** 
(3.77) (3.34) (3.78) (3.40) 
F+ 42.24 8.44° | 51.84 | 12.09¢ | 6.12** | 47.29 9.09° | 49.31 | 12.29° | 1.32 
(3.52) (4.31) (3.94) (4.11) 
F- 46.04 6.33b | 44.83 8.01è | 1.11 48.60 6.63 | 42.84 7.30 | 5.74** 
(3.84) (3.74) (4.05) (3.57) 
RC+ 23.80 8.50 | 29.45 8.18 | 4.49** | 24.62 7.64 | 29.96 9.01 | 4.47** 
(3.96) (4.91) (4.10) (4.99) 
RC- 26.83 6.75 | 26.90 6.06 07 25.72 6.00 | 27.94 S92 Be EY bg 
(4.47) (4.48) (4.28) (4.65) 


Note.—Converted means are shown for the OP+-, F—, and RC— scales in order that means for positive and negative scales be com- 
parable, i.e., higher values reflecting more negative attitudes toward old people, more authoritarianism and more religious conventionalism, 


^ Per item means, 
b Variances differ significantly at ,05 level, 
© Variances differ significantly at .01 level. 
* Significant at the .05 level. 
** Significant at the .01 level. 


TABLE 3 


Opp-Even RELIABILITY COEFFICIENTS (Spearman-Brown) AND INTERSCALE 
CORRELATIONS FOR YOUNG AND OLD SUBJECTS 


OP- OP+ F+ F- RC+ RC- 
OP— |.83| .79| .60**| —.02] .46**|  .46**| 17 | —.21* | 30 —.36** 
-75| 84] 414] 05) 16 | .46**] <04 | —.19* on ae 08 mE 
OP+ 74 | 73) 107, | —:35] <11 .34%4| <16 | —.21* | 113 :31** 
-79 | .82) —.30**| —.35**| —.04 | 117 | <00 |=.24*] 117 10 
F+ 61 | .80 | .48**| <15 | 29| ais] ‘o4 | —118 
59 | 81 | 3A230266 ol Aze i03 | =. 18” 
F- -23 .23 | —.02 -04 | —.03 ote 
A e |) 208) |! 43 1318] 114 
RC+ (86 | 182 ‘76%*| .42** 
‘71 | 90 .65**|  .44** 
RC- 71 63 
SST 53 


Note.—The four samples of 87 young men (YM), 81 young women (YW), 89 old men (OM), and 115 old women (OW) are arranged in 


the cells of the table as follows: 


* Significant at the .05 level. 
** Significant at the .01 level, 


greater variability among older subjects in 
their attitudes toward old people. Only for 
females on the OP— scale were the variances 
significantly different in the predicted direc- 
tion. Variances were significantly different 
between age groups, however, for both males 
and females on F+, and for males on F—. 
This may simply reflect the greater range of 


education in the older samples (Christie, 
Havel, & Seidenberg, 1958). 

Reliability coefficients and intercorrelations 
between the various scales are shown in Table 
3. With the exception of the F— scale, reliabili- 
ties for the four samples are of moderate to 
high magnitude. Turning to relationships 
between positively and negatively worded 
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versions of the three scales, it should be noted 
that r’s can theoretically range from +1.00 
(perfect logical consistency) to —1.00 (perfect 
logical inconsistency). It is apparent that r’s are 
uniformly higher in the content direction for 
younger subjects. Indeed, with the exception 
of r’s for females on the F— scale, differences 
between the younger and older samples in 
positive-negative scale correlations are statisti- 
cally significant beyond the .05 level or better 
(two-tailed test).4 Accordingly, we must 
abandon the hypothesis that the greater 
involvement of older subjects in OP item 
content is reflected in a higher correlation 
between oppositely worded versions of the OP 
scale. Particularly puzzling is the fact that 
the absolute level of r between OP— and OP+ 
for older subjects is less than that observed 
for F and RC items. Both of the latter would 
seem to be of less direct relevance to older 
subjects than are items explicitly referring to 
various attributes of old people. 

Consider next the relations between the 
OP, F, and RC scales reported in Table 3. 
In every case, significant (positive) 1’s_be- 
tween scales based on items worded in the 
same direction become nonsignificant Or 
significant in the set (negative) direction when 
r’s are computed between scales of opposite 
wording. In sum, it must be concluded that 
there are no clear content relationships between 
the three attitudinal domains under study for 
either young or old adults. An attempt to 
explain the absence of a relationship between 
OP and F in younger samples may be found 
in Kogan (1961). 


DISCUSSION 


The results reported above generally do not 
support the hypotheses advanced earlier in 


‘Two-tailed tests were employed for OP scale com- 
parisons for two reasons. First, a result in the direction 
opposite to that predicted could not be considered 
psychologically meaningless given the primitive state 
of theory in this area. Hence, the present problem does 
not seem to meet criteria proposed by Kimmel (1957) 
for the use of one-tailed tests. Second, the difference 
between the OP interscale r’s for young adults and 
maximum possible 7’s for older subjects (the product 
of the square roots of the reliabilities) is considerably 
less than the difference between the former and the 
minimum r’s possible for older subjects. In other words, 
it is statistically more difficult for the OP interscale r’s 
of the older subjects to be larger rather than smaller 
than the corresponding r’s for younger subjects. 
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the paper. In but one case out of four did older 
subjects exhibit greater variability in their at- 
titudes toward old people. Correlations between 
positively and negatively worded versions of 
the OP scale not only failed to exceed those 
reported for younger subjects, but actually 
were of approximately zero magnitude. 

Do these results call for a modification of 
the assumptions from which hypotheses were 
derived? It will be recalled that the older 
sample was distinguished from the younger in 
terms of the presence of tendencies toward 
disparagement of one’s age group and “ego 
involvement” in the issue at hand. These 
assumptions appear so very basic in terms of 
logic and previously reported evidence that 
one is prompted to look elsewhere in account- 
ing for the unexpected results of the study. 
Tt is most likely that the difficulty lies in the 
nature of the transition from assumption to hy- 
pothesis. In particular, does the well-known 
phenomenon of assimilation of “majority 
group” values in some “minority group” mem- 
bers necessarily imply greater interindividual 
variability for the minority than the majority 
in responses to “minority group” items? Fur- 
ther, does stronger personal involvement in an 
issue necessarily imply a heightened awareness 
of the logical oppositeness of statements re- 
garding that issue? 

Tt is a generally accepted view (Kuhlen, 
1959; Riegel, 1959) that the role of the older 
person in ‘American society is one likely to 
give rise to feelings of denial and ambivalence 
regarding one’s aging. A number of investiga- 
tors (Kutner, Fanshel, Togo, & Langner, 1956; 
Phillips, 1957; Tuckman & Lorge, 1954) 
report that substantial numbers of persons in 
their sixties and above decline to classify 
themselves as “elderly” or “old.” Given the 
inexorable physical changes accompanying ag- 
ing and their social stimulus value, the atti- 
tudes of many older persons toward their age 
peers should be characterized by ambivalence. 
The operational consequent of such ambiva- 
lence may well be an increased intraindividual 
variability in this attitude domain. The 
comparability of scale reliabilities for young 
and old subjects indicates that such differential 
variability is not manifested within uniformly 
positive or negative item sets. Rather, it is 
expressed in the extent to which younger and 
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older subjects are consistent in responding to 
matched items of reversed wording. 

Are we dealing here with anything more 
than differences in response sets in younger 
and older persons? There is little doubt in 
the present data that older subjects are more 
prone than younger subjects to “set” as 
opposed to “content” consistency in re- 
sponding to attitude items. This does not 
imply, of course, that older individuals 
make more response errors when filling out 
written questionnaires. Rather, the phenom- 
enon of acquiescence can probably best be con- 
sidered an important personality correlate of 
aging or an aspect of an “old age” syndrome. 
This view is consistent with recent theory and 
research regarding response set as a stylistic 
personality variable in its own right (Couch & 
Keniston, 1960; Jackson & Messick, 1958). 
Given the energy loss, passivity and depend- 
ency alleged to be associated with old age, one 
might well expect older individuals to differ 
from their younger counterparts in willingness 
to endorse “authoritative” statements. Indeed, 
Jackson (1959) has recently demonstrated a 
relationship between response acquiescence 
and “cognitive energy” level. 

The foregoing observations do not, however, 

account for the fact that the magnitude of 
the response set effect varied greatly across 
scales for older subjects and was ordered in a 
direction contrary to expectations, i.e., least 
content consistency on the OP scales. This 
finding suggests that the greater emotional 
involvement of old people in the attitude 
domain under consideration may have one of 
two alternative effects. Either the effect is 
one of a basic failure to apprehend the logical 
incompatibility in the paired statements, or, 
more likely, the feelings and experience of the 
older person are such that the matched state- 
ments may be logical, but not psychological, 
opposites. In the latter case, the inference is 
made that the attitude domain is less likely 
to be unidimensional for an older than a 
younger subject. More specifically, the older 
person’s ambivalence regarding aging and his 
direct exposure to individual differences 
among age peers may be contrasted with the 
overgeneralized view, positive or negative, that 
college age subjects are likely to form of a 
group relatively remote from them in age 
status. 
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The older subject confronted with a state- 
ment describing “good” qualities of old 
people, and a matched statement describing 
the absence of such qualities or opposing “bad” 
characteristics, may conceive of the former as 
fitting a particular kind of old people, the 
latter as appropriate to a different kind. In 
most casts, of course, the respondent, if not 
dissociating himself completely from the class 
of old people, very probably considers himself 
as the sort of old person described in the 
positively worded items. 

There is some evidence for this interpreta- 
tion in the OP item statistics’ for the younger 
and older subjects. The item yielding the 
highest absolute mean (most unfavorable 
response) for older subjects and the greatest 
mean discrepancy between younger and older 
subjects (younger subjects more favorable) 
teads as follows: “If old people expect to be 
liked, their first step is to try to get rid of their 
irritating faults.” This evidence further serves 
to clarify the psychological basis for failing to 
endow old people with “logically” consistent 
qualities. For, while an older subject may have 
a generally positive view of old people, a 
sensitivity to “irritating faults” necessarily 
results in his agreeing with statements ex- 
pressing those faults of special annoyance to 
him. Given, in addition, a stronger disposition 
to acquiesce in older persons, the observed 
differences between younger and older subjects 
assume a meaningful pattern. 

The present work may be of relevance to 
the broader area of minority-majority group 
relations. It will be acknowledged that the 
criteria for belonging in the “minority group” 
of old people are considerably more ambiguous 
than those for ethnic or religious group mem- 
bership. Nevertheless, a search for generalities 
might prove of value. 

_ Prenticé (1956) obtained very high correla- 
tions in the content direction between posi- 
tively and negatively worded religious and 


5 A three-page table giving OP item means, standard 
deviations, item-sum correlations and matched item 
pair correlations for the young and old samples has been 
deposited with the American Documentation Institute. 
Order Document No. 6766 from ADI Auxiliary Publica- 
tions Project, Photoduplication Service, Library of 
Congress; Washington 25, D. C., remitting in advance 
$1.25 for microfilm or $1.25 for photocopies. Make 
checks payable to: Chief, Photoduplication Service, 
Library of Congress, 
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racial minority items. Would these high 
correlations be maintained for minority group 
members responding to matched items of 
reversed wording whose conten’ concerns the 
minority in question? Would the magnitude of 
the correlation reflect the extent of “mar- 
ginality” (Glaser, 1958) in the group? Of 
course, it would be necessary in “studies of 
this kind to control for response set effects by 
including matched positive and negative 
items from other attitude domains. With 
response set controlled, it might prove possible 
to obtain item discrepancy scores for the 
individual minority group member, which 
would reflect major areas of ambivalence in 
one’s ethnic or religious identification. 

In a more practical vein, do the low OP 
interscale correlations for older subjects 
essentially rule out attitudes toward old 
people as a metric that can be studied in terms 
of its relations with other variables in older 
samples? This would appear to be an extreme 
view. While the OP interscale correlations 
hovered around zero for the older samples, 7’s 
between specific item pairs exhibited con- 
siderable variation. For five of the OP item 
pairs, these r’s were statistically significant for 
both male and female older subjects. There 
are, in other words, certain aspects of the 
attitude on which there is more consensus and, 
correspondingly, less ambivalence among 
older persons. The addition of other items of 
this kind and deletion of item pairs correlating 
nonsignificantly could well raise the interscale 
r to the levels obtained for younger samples. 
Alternatively, one might consider dividing the 
OP scales into unidimensional subscales, each 
reflecting a particular aspect of the attitude 
domain. The major focus of the present paper, 
however, has been one of accounting for dif- 
ferences between older and younger subjects in 
responses to a common set of items. 


SUMMARY 


The present paper compares the responses of 
college students and noninstitutionalized 
older people of comparable education and in- 
telligence to a set of statements measuring at- 
titudes toward old people (OP scale), author- 
itarian tendencies (F Scale), and religious 
conventionalism (RC scale). For each scale, 
matched positively and negatively worded 
items were used. Hypotheses formulated for 
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the OP scales—that older subjects would show 
greater interindividual variability and higher 
interscale correlations than younger subjects— 
were not confirmed. In fact, r’s between the 
OP-+ and OP— scales for the older subjects 
were approximately zero, and significantly less 
than those obtained for the younger samples. 
In the older samples, interscale 7’s for the F 
and RC scales, while higher in the content 
direction than the interscale 7’s for OP, were 
nevertheless significantly less than the cor- 
responding 7’s obtained in the student samples. 
On the basis of these results (in conjunction 
with the pattern of young-old mean differences 
on the scales), it was apparent that older 
subjects are more disposed than younger in- 
dividuals toward acquiescent response tend- 
encies. 

In accounting for age differences in OP 
scale results, emphasis was placed upon the 
ambivalence of the older person regarding his 
own age status and that of his age peers, with 
the result that logically incompatible state- 
ments were rendered psychologically con- 
sistent. By contrast, younger subjects were im- 
puted to have a more consistently negative or 
positive view of “old people,” with the conse- 
quence of a closer match between logical and 
psychological consistency. 
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series of recent experiments have no phonetic symbolism effect would be ex- 

attempted to determine thegmiversal- pected. It is also clear that not all words in any 

ity of phonetic symbolism in lan- given language are faithful examples of 
guages. In a study by Brown, Black, and phonetic symbolism. It seems apparent, 
Horowitz (1955), subjects were given the therefore, that the demonstration of any such 
task of choosing from a pair of antonyms, ina effect may be crucially conditioned in a biased 
language familiar to them, that word which fashion by precisely the same variables that 
“represented a translation equivalent from a are under investigation. 
“second pair of antonyms in a language un- Perhaps a better beginning point for 
familiar to them. A study by Maltzman, demonstration of universality of phonetic 
“Morrisett, and Brooks (1956), in which a symbolism would be one which attempts to 
single stimulus word was to be matched with determine the actual meanings assigned to 
one of a pair of response words, also employed various phonetic elements in nonsense Com- 
materialsin languages unfamiliar to the subjects binations by subjects from diverse linguistic 
jn both the stimulus and response positions. communities. Similarity of assigned meanings 
Still another variant procedure was employed for subjects using different languages, although 
by Brackbill and Little (1957). In this study not directly attacking the problem of phonetic 
subjects were required to make same-different symbolism jn natural languages, at least 
judgments for pairs of single words; the would assure us that some basis for a linguistic 
members of the pairs being from different symbolic communality is tenable. Accordingly, 
languages, one or both of which were un- the purpose of this study is to determine: (a) 
familiar to the subjects. All procedures, how- if systematically sampled speech sounds 
ever, have aimed at demonstrating greater arranged in nonsense combinations display 
than chance accuracy in the choice of the un- consistently different meanings when judged | 
known translation equivalent. This is not the on certain descriptive (affective) dimensions; 
place for a lengthy statement of the difficulties (b) if the changes in meaning bear any con- 
inherent in attempting to establish a workable sistent, lawful relation to the phonetic proper- 
criterion of translation equivalence, except to ties of the sounds; and (c) if the meaningful 
point out that linguists have generally de- differentiations of speech sounds and their 
spaired of any really satisfactory, analytic relations to phonetic properties correspond 
criterion. Brackbill and Little (1957) have across two contrasting linguistic communities, 
pointed out that the selection procedures for ‘American English versus Japanese. 
the items used by these investigators have 
not been sufficiently rigorous. It is doubtful 
whether any selection procedure, no matter A 
how analytic, can avoid the possibility of Subjects 


_ bias. A translator faced with the problem of The Se a loge were 79 
choosi i i undergraduate ani uate students o! sexes in 
ees eee cguvalgni roni Paane residence at the University of Illinois. All subjects in 


several alternative words of varying meaning this group spoke English as a native language and did 
overlap might well condition his choice On not have any knowledge of Japanese. The Japanese- 


the basis of a similarity in sound or form. If speaking subjects were 41 Japanese foreign students 
the translations are as dissimilar as possible, and staff members with varying competence in English, 

employed by or attending the University of Mlinois. 
1 This paper is based on a doctoral dissertation pre- All subjects in this group spoke the Tokyo dialect of 
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sented to the University of Illinois. Japanese and had resided m REN an ar of 
2 The writ tefull Jedges the supervision 13.9 months at the time of testing. Although all Jap- 
er gratefully acknowledges the supei with English, a total of 17 


and guidance of Charles E. Osgood. anese subjects were familiar 
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of the 41 subjects in this group had entered the country 
less than 3 months before testing. A single subject had 
resided in the United States for a total of 54 months 
and represented the maximum length of time in the 
country for this group. 
Stimulus Materials 
After a careful search of several languages histor- 
ically unrelated to English for correspondence in the 
phonemic inventory, Japanese was finally selected. The 
Phones selected are displayed in Table 1 with the spe- 
cific combinations employed in the experiment. The 
choice of these combinations was conditioned by three 
criteria: meaninglessness of the stimulus in both English 
and Japanese, equal representation of each of the sound 
elements in the total list with equal number of occur- 
rences of each consonant element before and after each 
of the five vowel elements, and adherence to the pho- 
nemic distribution laws of each of the languages. Pre- 
liminary work with Japanese bilingual informants un- 
covered six stimulus items that were identified as real 
words in Japanese. An identical number were found to 
be listed in English source books, Exclusion of these 
teal words found in English and Japanese was made 
impossible by the second and third criteria listed above. 
‘The final stimulus list was recorded on high quality 
equipment as spoken by a trained phonetician,* Because 
of the presence of minor differences between the articu- 
lation of the vowel and consonant phones in the two 
languages, a series of compromises in the production of 
the sounds was attempted by the speaker. These com- 
promises were designed to favor neither of the groups 
with respect to the articulations, After recording the 
items the experimenter and the phonetician selected the 
,one item of several attempts on each item which best 
“approximated the desired effect both with respect to 
articulation and technical recording criteria. These 
items were then cut out of the original tape and spliced 
end-to-end to form a continuous loop that could be 
played repetitively. 


Measuring Instrument 


In order to index the symbolic meaning of the stim- 
ulus materials, a 15-scale form of the semantic differ- 
ential was constructed, The scales were chosen to repre- 
sent, in part, the factors of evaluation, potency, and 
activity previously found in factor analyses of various 
stimuli (see Osgood, Suci, & Tannenbaum, 1957) and 
in part to reflect various sorts of physical dimensions 
deemed appropriate to the evaluation of sounds, 

‘The scales so chosen were combined, for purposes of 
analysis, into three composite scales identified as an 
evaluative composite (E), a potency composite (P), 
and an activity composite (A). The scales chosen to 
comprise the evaluative composite were: good-bad, 
beautiful-ugly, pleasant-unpleasant, high-low, and 
colorful-colorless. Those comprising the Potency com- 
posite were: heavy-light, powerful-powerless, thick- 
thin, large-small, hard-soft, and strong-weak. The actiy- 


* Grateful acknowledgment is extended to Elizabeth 
T. Uldall of Edinburgh University for her expert assist- 
ance in the recording of the stimuli. 
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TABLE 1 


MATRIX OF CONSONANT AND VOWEL COMBINATIONS 
EMPLOYED IN THE EXPERIMENT 


n č š g 


P $ 


OO By 


RK SD 


Note.—The 10 phonemic elements displayed in the table have 
the following orthographic equivalents: /p/ as in pat, /n/ as in 
nap, /č/ as in chap, /3/ as in shop, /g/ as in gap, /i/ as in 
ese Jel as in gape, /a/ as in pot, /o/ as in note, /u/ as in 

100! 


ity composite was derived from the scales: quiet-noisy, 
slow-fast, Passive-active, and cold-hot. One additional 
scale Was included, a familiar sounding-unfamiliar 
sounding scale, which did not enter into any of the 
composite scores. This scale was included to assess the 


The seven-point scales were randomized as to order 


: 
ý 


ion of the adjectival opposites, Booklets of 28 
assembled, each page duplicating the 16 
page for each of 25 stimulus items plus three 
items used in the first of two testing sessions 
ee repeated items used in the secOnd session. 


uctions for the experiment were read to the 
in the language appropriate to each language 
up. In the Japanese groups this was accomplished 
‘a native Japanese experimenter, the author osten- 
esent only to handle the tape recorder. The 
jons essentially paralleled those suggested by 
, Suci, and Tannenbaum (1957, p. 82ff.). 
jach subject served a total of 2 hours, each of the 
separated by at least a week’s interval. Twenty- 
uli were presented at each session. The first 
timuli of the first session were practice items and 
not analyzed. The first three stimuli of the second 
n were repeated items, chosen at random (the 
loops were shuffled in a large box) from the 25 
ental items presented in the previous testing. 
Subjects were tested in subgroups ranging in size from 
‘5 to 20 members. The order of the 25 stimuli was ran- 
domized for each of the groups before presentation. 
he two sets of 25 items were obtained by dividing the 
stimuli at random into two sets of 25 items each. 
‘This initial division of the items remained constant 
for both language groups, the randomization before 
each subgroup presentation within the two major 
anguage groups being used only on the two separate 
‘Subsets of 25 items. 
‘Each of the stimuli was repeated until all members 
the subgroup had completed checking all 16 scales 
that item. In those subgroups containing a few 
h slower subjects, a certain’ amount of speeded 
i ng was performed. No stimulus item, however, was 
tepeated fewer than 15 times. 


RESULTS AND DISCUSSION 


Scale Intercorrelations and Factor Analyses 


‘It will be recalled that the choice of par- 
ticular scales to be grouped into composites 
for the purposes of later analysis was made 
quite independently of the data collected in 
this experiment. The present data were 
analyzed, however, to determine the adequacy 
‘of these a priori decisions. This analysis took 
the form of two sets of factor analyses and 
acillary intercorrelations of the scale means 
for each of the language groups taken across 
all of the stimulus items. That is, each of the 
correlations is based on an N of 50 stimulus 
items for which the basic entry was the mean 
scale position of each of the subject groups 
‘as a whole. These intercorrelations were then 
factor analyzed using Thurstone’s centroid 
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Procedure. A total of 12 factors was extracted 
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from each of the language group scale inter- 
correlation matrices. Three of these factors 
were chosen for subsequent orthogonal rota- 
tion by means of the varimax criterion de- 
vised by Kaiser (1958). These factors were 
found to have approximate latent roots 
greater than 1.00 and were thus considered 
the major factors present in the correlation 
data. No other factors displayed approximate 
latent roots greater than one in either analy- 
sis.4 The rotated factor structures for the two 
language groups are displayed in Tables 2 and 
3. 
It is strikingly clear that, despite some 
shuffled scales in the two analyses (the greater 
number occurring in the analyses of the 
Japanese data), these factors are easily 
identifiable as the well-established first three 
factors obtained from analyses of meaningful 
verbal material by Osgood et al. (1957). 
Although the choice of scales was deliberately 
dictated by the expectation that the same 
basic factors would appear in the present 
situation, the fact that they do maintain this 
structure when nonsense materials are being 
judged suggests that the same system of 
affective meaning can be extended to the new 
type of stimulus ‘materials. In any case, these 
data justify the a priori decisions concerning 
the groupings of scales into composite scales. 
It remains to be seen whether some more 
extensive sampling of scales would have 
resulted in the factor structure identified here. 
‘A comparison of the factor loading profiles 
for the two groups, using rank order correla- 
tions, was carried out to determine the degree 
of similarity of the two groups as to factor 
structure. Since the first two Japanese factors 
ed to correspond most nearly to a 


appear c 
reversal of those factors for the American 


analysis, these correlations were computed 
between Factor 1 of the American analysis, 
identified as the evaluative factor, with Factor 
2 of the Japanese data, similarly identified; 
Factor 2 of the American data, the potency | 


4 Raw factor analyses and analyses of variance tables 
have been deposited with the American Documentation 
Institute. Order Document No. 6767 from ADI Auxili- 
ary Publications Project, Photoduplication Service, 
Library of Congress; Washington 25, D. C., remitting 
in advance $1.25 for microfilm or $1.25 for photocopies. 
Make checks payable to: Chief, Photoduplication 


Service, Library of Congress. 
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TABLE 2 
American Group ROTATED SCALE 
Centrom FACTOR LOADINGS 
Factors 
Scale 

1 2 3 
Good-bad -82°° —,08  —.04 
Beautiful-ugly A = .35 A 
Pleasant-unpleasant -89 —.31 .16 
Colorful-colorless 78 -O1 45 
High-low 30 —.44 75 
nents rh i BY 
Heavy-light ae : ran 
Hard-soft —.41 54 49 
Powerful-powerless 14 -90 Pat 
Thick-thin —.37 -19 —.37 
Large-small —.13 .87 —.20 
Passive-active —.01 —.36 —.80 
Quiet-noisy 54. —.47  —.48 
Slow-fast = .23 .21 —,91 
Cold-hot —.60 —.09 -00 
Familiar-unfamiliar -70 07 16 
Percentage variance 29.5 28.9 19.8 

TABLE 3 
JAPANESE Group ROTATED SCALE 
Cenrrom Factor LOADINGS 
Factors 
Scale 

1 2 3 
Good-bad .27 —.88 22 
Beautiful-ugly 45 —.82 26 
Pleasant-unpleasant 42 — .84 18 
Colorful-colorless 84 —.26 —.32 
High-low 70 —.23 =.55 
Strong-weak —.46 21° —.78 
Heavy-light —.90 .32 —.10 
Hard-soft -20 46 —.72 
Powerful-powerless —.74 +24 —,52 
Thick-thin —.95 140 —.14 
Large-small —.90 05 —.27 
Passive-active 14 14 84 
Quiet-noisy =.11 —.47 oda 
Slow-fast —.87 AS -33 
Cold-hot -81 .03 AE 
Familiar-unfamiliar —.04 —.80 —.12 
Percentage variance 40.1 22:6 29.4 


factor, was compared with its counterpart, 
Factor 1 of the Japanese; and the third or 
activity factors in each group were compared 
with one another. Loadings on the evaluative 
factors of the American and Japanese data 
correlated .89, the potency factor of the two 
groups correlated .74, and the activity factor 
in each group correlated .64. All correlations 
are significant at the .01 level. Thus it may 
be concluded, at least with respect to the 
profiles of factor loadings, that the two 
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language groups use similar semantic dimen- 
sions. 


Analyses of Variance 

In view of the foregoing findings of reason- 
able independence of the three composite 
measures, the original decision to carry out 
independént significance tests on each of 
these composite measures appeared to be 
justified. 

Assessment of the significance of the 
differences obtained in this experiment took 
the form of six separate four-variable classifi- 
cation analyses of variance, one for each of 
the three composite measures in each of the 
two language groups. The experimental 
classifications were those for consonant, 
vowel, position, and subject effects. The 
position variable represents the grand means 
for each of two equal subsets of the 50 stimulus 
items, each subset conforming to the experi- 
mental requirement of equal representation of 
all consonant elements before and after each 
of the vowel elements exercised for the stimulus 
list as a whole. Thus we have two miniature 
subsets of 25 stimulus items, each duplicating 
the experimental conditions imposed upon 
the original 50-item list. All variables with the 
exception of the subject effect were assumed 
to be fixed. The over-all model for the analysis 
is therefore of the mixed type. It was assumed 
that the interaction effects of the observations 
in which fixed and random effects enter are to 
be treated as random sampling variables. 
Tests of the random variable interactions and 
the subjects main effect were made utilizing 
the highest order interaction, or residual, in 
this analysis. 

A summary of the significance levels found 
in all analyses is presented in Table 4. Only 
the .01 and .05 levels are reported, even though 
in most instances the F ratios were significant 
far beyond the .01 level.5 

Inspection of the analyses for American 
subjects, displayed in Table 4, reveals that all 
main effects for all composite scores with the 
exception of the position effect are significant 
at the .01 level. The position effect reaches 
significance at the .05 level in the activity 
score analysis. Since all phoneme elements, as 
well as all consonant-vowel transitions, are 


5 See Footnote 4, 


PHONETIC SYMBOLISM 


TABLE 4 


= 
Japanese Group 


Source 


A 
Consonant “01 | .o1 | 01 | .o1 P -01 | .01 
Vowel 01 | .01 | .01 | .01 | ns 01 
Position ns | ns | .05| ns | ns | ms 
Subject 01 | .01 | .01 | .01 | .01 OL 
VXC ‘01 | .01 | .01 | .01 | .01 01 
WAP 01 | ns | -05| ns | ns .05 
VXS o1 | .01 | .01 | .01 | .01 01 
CXP 01 | .01 | ns 01 | .01 | ns 
cxs 01 | .01 | .01 | .O1 | .01 01 
PXS ns | ns | ns | ns | ns | ms 
VXCXP o1 | .01 | .01 | .01 | .01 01 
VXCXS ns | ns | ns | ns | -05 | ns 
VXPXS ns | ns | ns | ns | ns | ns 
CXPXS ns | ns | ns | -01 | .01| ns 
VE CC PSI |) anal a eae 


represented an equal number of times in both 
sublists, the position effect provides a test of 
the adequacy of the experimental controls in 
randomizing the individual bias effects ac- 
cruing to specific phoneme combinations. The 
activity composite score analysis indicates 
that this division was not adequate and repre- 
sents a caveat that must be borne in mind in 
interpreting the effects of consonant position 
with regard to that score. The vowel X posi- 
tion and subject X position interactions 
provided a similar check on the adequacy of 
counterbalancing with respect to the vowel and 
subject determinations, respectively. The P 
X S interactions are all nonsignificant in these 
analyses. Two of the V X P interactions 
reached significance at or beyond the .05 level 
in the American data analysis. The bias re- 
flected by these significant interactions is 
undoubtedly due to disproportionate effects 
of specific vowel-consonant combinations, the 
total effect of which is made more apparent 
because of the smaller number of items in the 
shortened sublists. i 

The higher order interactions were, as can 
be seen, uniformly nonsignificant in the 
American analyses, with the exception of the 
V X C X P effects. This latter interaction 
provided a test equivalent to a simple subjects 
by items analysis of variance and indicates 
that the stimulus items were reliably dif- 
ferentiated in the aggregate. 

Inspection of the Japanese data analyses 
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reveals that these results are in almost all 
regards parallel to those found for the Ameri- 
can group. The most notable exception is the 
C X P X S interaction. The CVC pattern is 
relatively uncommon in Japanese. This fact 
could well account for the idiosyncratic subject 
judgments of the consonant-position combina- 
tions and hence for the significant variance 
contribution of this interaction. Notice should 
also be taken of the nonsignificant vowel 
effect for the potency composite measure in 
this group. 

With the specific exceptions apparent in 
the table, the two sets of analyses indicate 
reliable differentiation of the phoneme elements, 
consonant positions and unique stimulus items. 
In addition, even though subjects were 
reliably differentiated with respect to judg- 
ments of these phoneme elements, such 
differences were not great enough to obscure 
the obtained differences between the elements 
themselves. Further, the absence of significant 
highest order interactions and stimulus-list 
subset biases, with several notable exceptions, 
lends credence to the utility of the experi- 
mental design. 


Familiarity Judgments 

In order to assess the extent of the influence 
of the subjects’ judgments of the familiarity of 
the stimulus items upon the composite meas- 
ure judgments, the familiarity scale scores 
were correlated with each of the composites. 
These correlations took the form of Pearson 
product-moment coefficients, for which the M 
was the 50 stimuli. The correlations for the 
American data between familiarity judgments 
and the evaluative, potency, and activity com- 
posites were: .59, —.15, and —.12, respec- 
tively. In the Japanese data, correlation co- 
efficients between familiarity judgments and 
evaluative, potency, and activity composites 
were: .50, —.15, and —.02, respectively. In 
addition, the mean familiarity judgments in- 
dicated that the stimuli were in truth rela- 
tively unfamiliar, although significant internal 
variation in this regard was present. If the 
subjects’ judgments of these nonsense stimuli 
had been conditioned by some generalization 
from close real-word associates, the covariation 
of the familiarity and composite measures 
should have been much more extensive. The 
data, however, support a conclusion that 
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attributes the meanings assigned to these 
stimuli on the basis of inherent symbolic con- 
tent other than, or at least beyond, that 
which might accrue to generalization from real 
words more familiar to the subjects. 


Composite Scale Means 


The preceding results indicate that we are 
justified in examining the stimulus configura- 
tions and phoneme element means in some 
detail in order to determine the precise nature 
of the symbolic content obtained. The individ- 
ual phoneme element trends are displayed in 
Figures 1, 2, and 3. These displays are ordered 
so that the stimulus elements follow the or- 
ganogenetic descriptions of the elements with 
regard to articulatory position from “front” 
to “back,” in the classical sense of these terms. 

Inspection of Figures 1 and 2 indicates, in 
general, that the more anterior consonant or 
vowel elements are judged to be more positive 
in evaluation and more negative in potency 
than the more posterior elements. The func- 
tions are hardly monotonic in character, al- 
though the direction of the change as a function 
of the stimulus dimension is surprisingly regu- 
lar in character for the evaluative and potency 
composites. The activity composite means are 
much more erratic in terms of the dimension 
employed. This composite measure, on the 
other hand, accounted for the least amount of 
total judgment variance of the three com- 
posites as determined by means of the factor 
analyses. 

Of the alternative orderings of the phoneme 
elements in terms of physical correlates, the 
organogenetic basis is probably not the most 
desirable. Instead, since the acoustic end- 
product of the articulatory modifications cor- 
responds more closely to the proximal stimulus 
giving rise to the perception of these stimuli, 
it might seem reasonable to turn to this type 
of description as a basis for the stimulus 
ordering. Fairbanks (1960) has dramatically 

demonstrated that the organogenetic descrip- 
tion, however, is almost perfectly correlated 
with the acoustic description in terms of 
formant locations. His vowel plots using Form- 
ant 1 and 2 coordinates indicate that the two 
descriptive systems are highly redundant, 
Although the stimuli were not actually sub- 
mitted to any harmonic analysis, it is possible 
to use data collected by Fairbanks and others 
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Fic. 1. American and Japanese group mean evalua- 
tive composite scores for vowels (upper) and consonants 
(lower). 
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Fic. 2, American and Japanese group mean potency 
composite scores for vowels (upper) and consonants 
(lower). 


in assessing the degree of covariation of these ! 
judgments with other acoustical analyses of the — 
phonemes selected. The upper display of Fig- — 
ure 2 shows the relative values of Formant 
2, after rescaling, as taken from data col- 
lected by Holbrook (1958). The most striking 
difference between the two descriptive systems — 
im terms of the judgmental data collected in — 


G 


MO 


PHONETIC SYMBOLISM 


Eat 
«| 


| 


Meon scale volve {1 low activity) 
a 
è 


à 


a 
è 


TN 


38 


Mor 


Fis. 3. American and Japanese group mean activity 
composite scores for vowels (upper) and consonants 
(lower). 


this experiment involves the return of the 
potency judgments for the vowel /u/ toa 
higher level, (‘“weaker” judgments) giving 
the function a J-shaped character. The Fy 
formant values follow this function at all 
points for the American data. (It will be re- 
called that the Japanese data did not disclose 
a significant vowel effect for this composite 
measure.) 

At least one reasonable interpretation for 
the obtained effect on this composite measure 
derives from the nearly universal correlation 
between high pitches and small objects to be 
found in nature. Thus by metaphorical exten- 
sion, to a correspondence between small ob- 
jects and weakness. A previous experiment 
reported by Solomon (1959), in which passive 
sonar signals were evaluated on a set of seman- 
tic differential scales similar to those used in 
this study, obtained almost identical results. 
Scales that were classified as contributing to a 
potency composite were found to distinguish 
between sonar signals having a preponderance 
of energy in the lower octave bands from those 
with energy concentrated in the higher bands. 
The direction of the correspondence was as 
reported in this study, i.e., the greater the low 
frequency energy in the signal, the “heavier” 
and “stronger” the judgments, and vice versa. 
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Cross-Linguistic Correspondences 


Tn order to assess the over-all correspondence 
of the two language groups in phonetic sym- 
bolism tendencies, product-moment correla- 
tions were computed for each of the composite 
scores utilizing all determinations available 
within each group, i.e., the individual mean 
judgments for each of the stimulus items. 
These correlations were found to be .46, 1905 
and .54 for the evaluative, potency, and ac- 
tivity composites, respectively. All correlations 
are significant at the .01 level with 49 df. The 
summary measures for each of the composite 
scores for each of the five vowel and consonant 
phoneme elements, as displayed in the fore- 
going exhibits, were also compared across the 
two language groups. The evaluative com- 
posite correlation coefficient for the vowel 
elements was computed to be .23 and that for 
consonants .95. The coefficient computed for 
the vowel elements is not significant while the 
consonant correlation reaches significance at 
the .01 level. The comparable correlations for 
the potency composite were found to be .57 
and .91 for the vowel and consonant elements, 
respectively. The activity composite correla- 
tions were .61 for the vowels and .64 for the 
consonants. All correlations for these second 
two composites are significant at the .05 level 
or beyond. As can be seen, the coefficients for 
the consonant correlations are uniformly 
higher than those for the vowels. If we assume 
for a moment that phoneme element judgments 
may be conditioned in part by some imaged 
articulatory posture characteristic of the 
sound elements, the foregoing result suggests 
an interesting conclusion: Since the consonants 
are more easily discriminated with respect to 
articulatory production, i.e., they are charac- 
terized by more easily identifiable muscle fea- 
tures of more diverse kinds, it is possible that 
the higher degree of correspondence obtained 
for these elements reflects the presence of this 
greater number of discriminable cues. 

In conclusion, it seems clear that, although 
this study does not afford direct evidence re- 
garding the existence of phonetic symbolism 
in natural languages—evidence difficult to 
obtain in any study—the magnitude of the 
differences in meanings assigned to these 
stimuli and the similarity of these meanings 
across the two language groups provides at 
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least indirect evidence of its presence and 
generality. 


SUMMARY 


An experiment is reported in which a set of 
50 nonsense stimulus items of CVC shape were 
judged on a set of 16 semantic differential 
scales by two groups of subjects with contrast- 
ing linguistic backgrounds. The subject lan- 
guages used were Japanese and English. Ex- 
perimental stimulus controls took the form of 
an incomplete latin square design in which 
each of five consonant elements preceded and 
followed each of five vowel elements an equal 
number of times with all elements represented 
and equally frequent in the final matrix. 

The mean judgments for all subjects in both 
groups indicated a general trend toward evalu- 
ating the front vowels and consonants as 
“pleasant” and “weak” and the back vowels 
and consonants as more “unpleasant” and 
“strong.” 

Four-way analyses of variance indicated 
that, in the main, the phoneme elements, the 
consonant positions, and the phoneme com- 
binations were reliably differentiated in the 
aggregate by the subjects of both groups. 

Data collected on the familiarity of the 
stimulus items showed significant internal vari- 
ation but was shown to covary with evaluative 
scores only. 

Centroid factor analysis of the scale inter- 
correlations revealed the presence of three 
factors that accounted for a high percentage of 
the total judgment variance for both groups. 
These factors were easily identifiable as corre- 
sponding to previously observed dimensions 
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and were found to have a high degree. of 
similarity across the two language groups. 

It was concluded that the materials had 
expressive symbolic value accruing to their 
inherent phonetic content and not to any 
meanings via real-word associates. These af- 
fective meanings were found to bear consistent, 
lawful relations to the phonetic properties of 
the sounds. The fact that these meaningful 
differentiations and their relations to phonetic 
properties proved to be highly similar across 
two contrasting linguistic groups suggests that 
the laws governing phonetic symbolism may 
have a universal character. 
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osr factor analyses of child training 
practices have been done ©n data 
obtained from parents in one cul- 
“ture, usually USA or Europe (e.g., Takala & 
Takala, 1960). This paper reports a factor 
analysis of the stated socialization practices of 
mothers in six different cultural settings ob- 
< tained through systematic interviews. The 
analysis was done in this form for the purpose 
of reducing a large number of scales in order 
to test antecedent-consequent hypotheses, but 
the results have value in their own right. 
The identification of pancultural factors 
"May suggest ways in which studies of child 
" training practices may be simplified by focus- 
ing the researcher’s attention on major sources 
_ of differences in mothers’ statements regarding 
| their practices (and possibly differences in the 
€ actual practices themselves). To the extent that 
the factors extracted are clear and are deter- 
mined by differences between mothers rather 
than cultural differences alone, the factors can 
be used as dimensions in the direct comparison 
| of practices in different societies or cultural 
| contexts. Finally, the factors extracted may 
assist in formulating more relevant theoriescon- 
cerning the functional prerequisites for any 
society, or, more modestly, concerning sets of 
antecedent or consequent variables of personal- 
ity or social structure. 


1 This paper is a report on a portion of work done 
under grants from the Social Science Research Council, 
the Ford Foundation (Behavioral Science Division), 
and NIMH (Project M2331) to the Cornell-Harvard- 
Yale Study of Socialization in Five Cultures, directed 
by John W. M. Whiting, Irvin Child, and William w. 
Lambert. Other aspects of work reported to date can 
be found in Whiting, Child, Lambert, Fisher, Fisher, 
Minturn, Maretski, Maretski, Nydegger, Nydegger, 
Romney, and Romney (1955), Lambert (1960), and 
Whiting and Whiting (1960). 
2 The authors wish to express their gratitude to Jack 
_and Ann Fisher, Thomas and Hatsumi Maretski, Wil- 
liam and Corinne Nydegger, Kimbal and) Romaine 
Romney, and Robert and Barbara Levine for the use 
of their data. These investigators gathered the data in 
the Northeastern United States, Okinawa, the Philip- 
pines, Mexico, and Africa, respectively. The Indian 
data were gathered by Leigh Minturn Triandis. 
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PANCULTURAL FACTOR ANALYSIS OF REPORTED 
SOCIALIZATION PRACTICES! 


WILLIAM W. LAMBERT 
Cornell University 


The problems involved here are many, and 
not all of them will be dealt with in the pres- 
ent paper. There are issues of the degree to 
which what mothers report as their practices 
are in fact their practices; there are issues con- 
cerning the relationship of intracultural factor 
structures to the pancultural factors reported 
here; there are problems arising from transla- 
tion and back translation of the questionnaire 
and responses, etc. Many of these issues are 
to be dealt with in subsequent reports; some 
are not resolvable in the present state of our 
methods. Our present aim is to report what 
was done and what emerged from the analysis, 
along with a preliminary evaluation of results. 


FIELD PROCEDURE 


Field teams (usually a husband and wife) 
were trained and sent out to relatively homo- 
geneous communities in North India, Oki- 
nawa, Mexico (Indian), the Philippines, the 
Northeastern United States, and to an African 
tribe in Kenya, in order to conduct intensive 
studies in techniques of socialization and the 
behavior of children. An attempt was made 
to use several different methods in each cul- 
tural setting, but also to study the handling 
of several behavior systems that could be 
viewed as common to all the societies. One of 
the instruments used was a standard interview 
(with probes) which was prepared in the 
United States, and then translated into the 
native language. In some cases the questions 
were altered in order to make the questions 
fit the concepts and practices of the several 
cultural settings. The answers of the mothers 
(or mother-surrogates) of a carefully sampled 
group of children were recorded verbatim to 
the 21 classes of questions, with an average of 
four subquestions per class. The total number 
of mothers interviewed was 133. 

The samples of mothers are reflected in 
Table 1, in terms of the distribution of age 
and sex of the child that each mother talked 
about. The ages ranged from 3 to 10 years, 
each child came from a different family, and 
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an attempt was made to counterbalance birth 
order and to ensure that each sample child 
was fairly normal physically. 

All of the interviews were translated into 
English with the aid of bilingual informants 
where necessary. They were all sent to Cornell 
and analyzed by trained coders and raters. 


ANALYSIS PROCEDURE AND RELIABILITY 


Since it was impossible to use all of the in- 
formation obtained from the mother inter- 
views in the factor analysis, scales were se- 
lected on the following bases: (a) a minimum 
of “no answers” in all six subsamples; (6) rela- 
tively high (estimated) coder reliability; (c) 
theoretical relevance, including relevance to 
past factor analyses. Not all of these considera- 
tions could be maximized, of course. The scales 
finally retained for the factor analysis are 
listed in Table 2, which tap a number of as- 
pects of socialization practices: the handling 
by socializing agents (particularly the mother) 
of a number of different behavior systems, 
aspects of caretaking, forms of discipline, 
some kinds of rules, and the apparent or re- 
ported communication of these rules, and the 
sex and age of the child. 

Three types of scales are included in Table 
2. Scales 1 through 3 represent a placement of 
the mother’s verbatim answer into a category 
implied by the following anchor points of a 
nine-point scale: (1) Mother (or surrogate) 
physically punishes child or deprives child of 
privileges; (3) Mother reprimands or teaches; 
(5) Mother withholds approval or ignores; (7) 
Mother gives approval or gives emotional 
support; (9) Mother gives reward or gives 
active help. The answers of the mothers were 
placed on intermediate points of this scale 
when necessary, and if the mother’s answer 
indicated that her response encompassed 
items on two points of the scale (e.g., “Some- 
times I scold him; sometimes I just ignore 

it”) the answer would be an average of the 
two: the example would be coded 4 on the 
nine-point scale. 

The second type of scale in Table 2 involves 
the seven-point scales numbered 4 through 18, 
which the coders rated on the basis of reading 
the entire verbatim protocol of the mother’s 
interview. Point 7 represents the high rating 
and Point 1 the low rating. 


TABLE 1 ` 
THE NUMBER OF CHILDREN 
Girl: irl: Bi 
Country „ seers 740% oe Sonn ros, J 

USA 6 6 6 eG 
India 6 6 6 6 
Okinawa 6 6 6 6 
Philippines 6 6 5 6 
Mexico 6 5 6 5 
Africa 4 4 4 4 

TABLE 2 


SCALES USED IN THE FACTOR ANALYSIS 


10. Degree that mother’s hostility is contingent upon | 


15. Frequency and intensity of physical punishment i 
16. Degree to which privileges and gifts are contingent 


17. Reward for retaliatory aggression to peers, 
18. Degree to which mother is aggressive when child is 


19. Amount of time that mother cares for child 
20. Amount of time that the mother cared for child A 


: es 
21. Amount of time adult other than the mother cared | 


22. Age at which child began to dress self fr 
23. Age at which child began to play away from the 


24. Total frequency of chores 
25. Total number of chores ME 
26. Degree to which mother expects immediate obedi- 


27. Sex of child 
28. Age of child 


19 through 28, which includes a variety of” 
scales (age, time, frequency, etc.) used to place 
the answers of the mothers to individual ques- 
tions. The nature of the scale used in each case 
is fairly obvious from the name given it in 
Table 2. 


1, Degree to which mother, when busy, responds to 
child’s requests for help 4 

2. Degree to which mother is positive when child 
fights with other children ‘ad 

3. Degree to which mother is positive or nonpunish: 
ing when child becomes angry while being scolded _ 

4. Proportion of caretaking done by mother 

5. General warmth of mother 

6. Mood variation of mother’s warmth 

7. General hostility of mother 

8. Mood variation of hostility 

9. Degree that mother’s warmth is contingent ul 
child’s actions 


child’s actions 


11. Amount of communication of rules f 
12. Consistency of aggression rules 
13. Amount of praise 

14. Consistency of mother’s follow-through oñ non- 


routine demands for obedience 


upon child’s behavior 


angry or aggressive 


when child was a baby 
for child when child was a baby 


house 


ence 


Not all mothers answered all questions i 
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such a way that their answers could be placed 
on all of the scales, and this raised problems 
for the computing machine, which cannot con- 
veniently handle such omissions. The problem 
was overcome by entering the average response 
given by all the other mothers in the given 
culture in the place of the “no answer.” 

The reliability of the codings and ratings is 
reported in Table 3. A fairly random sample 
of protocols (from different societies) was 
coded by two coders, the number of the sam- 
ple involved being given in the second column 
of Table 3 opposite the relevant scale number. 
The next column reports the number of times 
only one coder entered a judgment (a form of 
error). The next to last column contains the 
percentages of disagreement exceeding one 
point on the relevant scale. The final column 
contains the Pearson correlation coefficients 
between the two raters’ judgments. These 
coefficients reflect the errors due both to the 
fact that one rater made a judgment and the 
other did not, and to disagreements in judg- 
ments. The first source of error is quite im- 
portant, since the reliability of Scale 23 rises 
to 1.00 when only cases where both raters 
made a judgment are used. The coefficients 
are not corrected for attenuation. On the basis 
of these considerations the coder reliability 
was regarded as adequate for the task. 

The factor analysis was conducted with the 
Iliac computer, with the data from all six 
cultures put together to produce a correlation 
matrix of the variables listed in Table 2. The 
matrix was then analyzed, using a centroid 
analysis with iterated communalities. The 
factor scores were then rotated for orthogonal 
simple structure with the Iliac varimax rota- 
tion program. Ten factors were extracted, 
seven of which appear to present few problems 
of interpretation. 


RESULTS AND INTERPRETATION 


The reference vector factors that resulted 
from the varimax rotation are presented in 
Table 4. For the purposes of factor definition, 
we will use an arbitrary cutting point of .40, 
but will discuss the relation of scales with 
lower loadings to the factor definition, particu- 
larly loadings from .30 to .40, since such load- 
ings may help to clarify the factor. 

Table 5 presents the amount of both the ex- 
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TABLE 3 
Data REGARDING SCALE RELIABILITY 


% Disagree- Correlation 


Scale Total Number Ment greater coefficient 
Number numberof of single RT 
entries answers petir pomeni 
1 19 1 0 Se 
2 6 2 50 +53 
3 9 0 A -68 
4 17 0 24 49 
5 17 1 19 300. 
6 16 3 14 +59 
7 16 4 13 74 
8 17 5 17 67 
9 17 2 30 49 
10 16 1 18 AT 
11 17 6 3 81 
12 17 1 6 +54 
13 17 1 19 +45 
14 17 2 13 67 
15 17 0 8 49 
16 17 0 13 -53 
17 17 0 5 soL 
18 17 4 7 61 
19 23 0 26 .13 
20 23 0 39 7 
21 23 0 35 85 
22 22 0 0 1.00 
23 21 2 0 298 
24 23 0 0 -96 
25 23 0 0 -98 
26 23 0 56 65 
27 23 0 0 1.00 
28 23 0 0 1.00 


8 If computed using double entries only, 1.00, 


tracted variance and of the total variance ac- 
counted for by each of the 10 factors. Column 
1 shows the amount of variance accounted for 
by all 10 factors that is attributable to each 
particular factor. The second column shows: 
how much of the total variance of the original 
correlation matrix is accounted for by each 
factor. All 10 of the factors extracted account 
for 48% of the total matrix variance. Although 
we would expect that a good deal of the re- 
maining variance is due to error of one form or 
another, some of it may well be involved in 
factors unique to the cultural settings studied. 
This problem is to be discussed in later reports. 

Before discussing the factors of Table 5 in 
more detail some readers will wish to peruse 
Table 6 which presents the results of an analy- 
sis of variance of the degree to which the fac- 
tor variance (in the first seven of the factors) 
is due to variance within societies (that is, 
to differences in mothers or children) as com- 
pared to variance between societies (due, 
thence, to differences in the cultural means). 
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TABLE 4 
Varmax ROTATION on Ten CEenTRom FACTORS USING ITERATED COMMUNALITIES 
(Overall Analysis) 
Scale Factor Number 
Number 1 2 3 4 5 6 7 8 9 10 

1 14 +.06 —.06 +.04 —.09 —.02 +.09 —.08 —.08 —.52 
2 HT +.00 —.05 +.07 +.06 +.03 3 +.14 +.02 +.48. +.15 
3 —.15 —.16 +.08 +.02 —.06 —.76 —.17 +.05 +.11 —.27 
4 +.06 +.03 —.02 +.04 +.74 —.02 +.06 +.03 +.07 +.11 
5 —.18 +.6L —.09 —.07 +.03 +.09 +.04 —.02 +.32 —.05 
6 -02 —.05 +.02 —.04 +.27 +.01 +.74 —.10 +.08 = 17 
7 SAN —.69 —.09 —.02 —.01 +.05 +.34 —.02 +.14 +,10 
8 —.08 —.20 —.07 +.03 +.10 +.10 +.91 —.01 +.01 — .02 
9 —.18 —.21 +.25 —.03 —.14 +.17 +.26 —.50 +.12 + alg 
10 +.07 —.08 +.16 —.22 =—,29 —.09 =.21 —.29 +.07 +.08 
11 +.04 +.03 +.24 —.29 +.21 +.22 —.05 +.03 +.28 —.02 
12 +.04 +.02 +.70 —.21 —.08 —.17 —.09 —.09 —.07 +.04 
13 i +.52 —.03 —.31 —.10 +.09 +.05 —.13 +.15 +.12 
14 —.05 +.00 +.49 +.09 +.03 +.36 +.00 —.04 +.23 —.09 
15 —.04 —.58 +.18 +.04 —.11 +.33 +.19 —.29 +.05 +.10 
16 4.11 —.08 +.14 —.09 ' —.22 +.39 —.11 —.17 +.05 +.28 
17 —.10 +.03 —.68 +.16 +.24 —.03 —.01 +.03 +.16 —.09 
18 +.03 —.28 +.11 —.28 +.05 +.64 +.09 —.02 —.14 —.20 
19 —.02 —.02 —.09 +.12 +.54 +.04 +.15 —.10 +.01 —.02 
20 +.01 —.07 —.03 +.72 +.08 —.02 —.10 +.03 +.07 —.12 
21 +.01 +.05 +.03 —.46 —.08 +.04 — .06 +.02 +.10 —.05 
22 +.03 —.04 +.11 —.02 —.13 —.01 —.02 +.53 —.00 +.14 
23 +.29 +.02 +.01 +.15 +.04 +.36 +.00 +.09 +.23 —.12 
24 +.64 —.05 +.02 +.14 +.05 +.24 —.02 —.22 —.04 —.12 
25 +.69 =.05 —,0L —.16 +.00 +.08 —.09 +.06 +.01 +.06 
26 — .02 =,03 +.37 Far +.03 +.06 —.01 +.07 +.02 +.01 
27 +.01 +.06 —.01 =;10 -00 —.04 —.03 —.03 +.35 —.01 
28 +.56 +.05 +.12 +.10  +.01 =.12 +.04 +.22 +-21 —.17 


The variance is preponderantly determined 
by within-society variance in all cases. In 
only the case of Factor 6 does the between- 
society variance approach the within-society 
variance. This is evidence that we are dealing 
with factors that have pancultural meaning 
and may be useful in more than one societal 
context for the study of socialization processes. 
Factor 1: Responsibility demands on the 
child. This first factor is easily interpreted, 
since it has to do with responsibility demands 
on the child. The ratings include not only the 
chores assigned to the child by the mother, 
but also those assigned by others. Therefore 
this factor deals with the general pressures on 
children for doing things to help the family 
get its work done, and some students of groups 
would not be surprised that this task orienta- 
tion should emerge at once from a factor analy- 
sis of socialization processes. The factor may 
have relevance, then, to group processes as 
well as to personality development, as is true 
of most of the factors below. 
We can characterize the children who have 
high scores on this factor as follows: they do a 


relatively large number of different chores 
(Scale 25, +.694); they do chores relatively 
more frequently (Scale 24, +.64) and, as one 
might expect, they are to some extent rela- 
tively older (Scale 28, +.56). 

These are the only scales with loadings of 
greater than .30, but it is interesting that 
Scale 23, which relates to the age at which 
children begin to play away from home, has a 
loading of +.29 in Factor 1. This may be be- 
cause the chores take a child away from home 
early, thus giving him (to some extent) greater 
freedom in play as well. 

Factor 2: Use of positive affect by mother. 
Factor 2 is also easy to interpret, having to do 
with what is often referred to as “maternal 
warmth.” We have chosen to label this factor 
“ase of positive affect by mother,” however, 
since instrumental as well as expressive aspects 
seem to be involved. Mothers at the warm 
end of this dimension are characterized by 
low general hostility (Scale 7, —.69), high 
general warmth (Scale 5, +.61), relatively 
high use of praise (Scale 13, +.52), and rela- 
tively low frequency and intensity of physical 
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TABLE 5 


PERCENTAGE OF EXTRACTED AND OF TOTAL VARIANCE 
REFERABLE TO EACH OF THE DEFINED FACTORS 


P É 
mte Seene aae 
1 11.1 5.2 
2 12.7 6.0 
3 12.1 e5-7 
4 9.5 4.5 
5 9.4 4.5 
6 13.1 6.2 
th 13.3 6.3 
8 6.9 3.2 
9 6.3 3.0 
10 5.4 2.6 
Total 100.0 48.2 
TABLE 6 


DETERMINANTS OF VARIANCE OF FACTOR SCORES 


Percentage of within Percentage of between 


Factor A ty 
society variance society variance 


al 94.5 5.5 
2 83.7 16.3 
3 67.2 32.8 
4 92.8 7.2 
5 73.5 26.5 
6 52.9 47.1 
7 76.3 23.7 


punishment directed toward the child (Scale 
15, —.58). 

These scales are consistent in meaning that 
mothers with high loadings are warm rather 
than hostile with their children, and that they 
control them through praise rather than 
through punishment and threat of punishment. 
The only other scale that approaches an in- 
teresting level of loading is Scale 18, which has 
to do with the degree to which the mother is 
aggressive when the child is angry or aggres- 
sive (—.28), which is certainly consistent in 
tone, 

Factor 3: Control of the child’s peer aggression 
and obedience. This factor clearly deals with an 
important aspect of aggression control, focus- 
ing on the peer relationship, but also dealing 
with expectations and family rules regarding 
Obedience. Mothers who have high scores on 
this factor may be characterized as having 
consistent rules regarding aggression (Scale 
12, +.70), they tend to punish the child’s 
retaliatory aggression directed toward the 
instigating peers, and the mother tends to be 
relatively consistent in following up on her 
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nonroutine demands for obedience (Scale 14, 
+.49). It is consistent that these mothers 
also tend to expect immediate obedience (Scale 
26, +.37) from their child. 

There is no inconsistency in finding two low 
positive loadings in Scale 11 (+.24), reflecting 
the amount of communication of rules in 
general by the mother (certainly aggression 
rules are a component here, though the per- 
sistent articulateness of the mother may not 
be high), and in Scale 9 (+.25), contingency 
of mother’s warmth. A mother who had rules 
in the areas of aggression and obedience and 
followed up on them would be expected to 
make her warmth at least somewhat depend- 
ent upon the child’s obedience and his non- 
retaliatory behavior to peers. 

In defining this factor, in which the loading 
on reward for retaliation (toward peers) looms 
so large, it is interesting that Scale 2, “Degree 
to which mother is positive when child fights 
with other children,” has essentially a zero 
loading on this factor, but defines (if anything 
does) Factor 9. This may suggest a distinction 
between retaliating when one has been picked 
on as compared to starting a fight (or, more 
vaguely, just “being in a fight”). One might 
expect these to differ, but the surprise arises 
in their orthogonality. It is also interesting 
that the mother’s reaction to her child’s retali- 
ation to peers is not correlated with her own 
reaction to the child’s aggressiveness while 
being scolded (Scale 3), nor to the “degree to 
which she herself uses aggression as a reaction 
to the child’s aggressiveness (Scale 18). These 
last two scales define Factor 6, and are there- 
fore orthogonal to Factor 3. Panculturally 
viewed, then, aggression toward peers is a 
different thing, and is socialized differently, 
from aggression toward parents or other social- 
izing agents. 

Factor 4: Extent of mother’s caretaking of 
the baby. This factor is clearly involved with 
caretaking. Mothers who score high on this 
factor can be characterized by the fact that 
the mother had a higher proportion of the re- 
sponsibility for her child when he (or she) was a 
baby (Scale 20, +.72). and that other adults 
were relatively responsible for at this activity 
(Scale 21, —.46). 

There are a number of other scales that have 
low, but suggestive, loading on this factor. 
There seems to be some tendency for mothers 
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high on Factor 4 to show “flat” affect to their 
children, since they are less aggressive when 
the child is angry or aggressive toward them 
(Scale 18, —.28), and they use praise less 
(Scale 13, —.31). It is also possible that these 
mothers, as a result of long contact with their 
babies, can assume more about their child’s 
behavior, since they do less communicating 
of rules (Scale 11, —.29), they have less con- 
sistent aggressive rules (Scale 12, —.21), and 
their own hostility is somewhat less contingent 
upon the behavior of the child (Scale 10, 
—.22). Perhaps all these low loadings can be 
interpreted as displaying the fact that if the 
mother has had long association with her 
baby, she needs to give less overt information 
to the baby when it has become a child. The 
interesting thing here, then, is that this holds 
for affectional as well as instrumental informa- 
tion. 

Note that the scales which reflect the pro- 
portion of the caretaking done by the mother 
(Scale 19 in Factor 5, as well as Scales 20 and 
21 in Factor 4) reflect answers to the ques- 
tions, “Who usually cares for your child?” 
and “Who did this when he was a baby?” The 
answers were scaled from “never” to “always.” 
‘These scales do not reflect, or only reflect im- 
perfectly, the total amount of time the mother 
spends with the child. They do reflect the de- 
gree of responsibility for care of the child. 
‘Otherwise we would expect higher correlations 
with scalés that reflect the personality of the 
mother more directly. 

Factor 5: Extent of mother’s caretaking of the 
child. This factor clearly represents the pro- 
portion of the caretaking of the child that is 
done by the mother at the time of interviewing 
(Scale 4, +.74; Scale 19, +.54). It is interest- 
ing that this matter is orthogonal to Factor 

4, which deals with the caretaking of the baby. 

There are only two low loadings with this 
factor, which suggest ways in which caretaking 
has harassing effects upon at least some 
mothers: the heavy caretakers tend to have 
more variation in warmth due to mood (Scale 
6, +.27) and to have hostility that is not 
contingent upon the child’s performance 
(Scale 10, —.29). Perhaps much care of a 
child gives mother license for self-expression 

(though the evidence here is meagre). 
Factor 6: Control of mother directed aggression 
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and disobedience. This factor has to do with the 
control of negative feelings in the child that 
are directed toward the parent. The defining 
scales suggest? that mothers who have high 
scores on this factor tend mot to be positive or 
nonpunitive when the child becomes angry 
when scolded (Scale 3, —.76), and that they 
tend to be more aggressive when the child is 
angry or aggressive toward the mother (Scale 
18, +.64). 

There are four scales with low loadings (in 
the .30 to .40 range) which help to define the 
factor. These loadings suggest that mothers 
who are high on this factor tend to make con- 
sistent demands for nonroutine obedience 
(Scale 14, +-.36), to use physical punishment 
more frequently and intensely (Scale 15, 
+.33), to control the child in part through 
making gifts and privileges contingent upon 
his good behavior (Scale 16, +.39), and (for 
some reason) to permit play away from home 
at a relatively later age (Scale 23, +.36). 

The emergence of this factor certainly sug- 
gests the existence of a number of important 
dimensions of aggression that must be par- 
tialed out in cross-cultural analyses. If such 
is in fact the case, and to the degree that one 
views personality development from the per- 
spective of learning theory, then these dis- 
tinctions suggest that later peer aggression 
may be somewhat orthogonal to aggression 
directed toward those in authority. 

Factor 7: Emotional instability of the mother. 
This factor is strongly defined by Scale 8: 
“mood variation of hostility” (-++.91), but there 
is also a strong loading in the Scale 6 of +.74, 
reflecting the mood variation of the mother’s 
warmth as well. The lower loading of +.34 on 
Scale 7, “general hostility of the mother” 
somewhat broadens the meaning of the factor. 
This kind of factor may be very important in 
socialization, since a mother high in it may 
provide a difficult “target” for identification 
and an aggressive role model, and the study 
of such persons may provide clues to familial 
and even cultural sources of stress. 

The remaining three factors are less well de- 
fined than are the seven just discussed. Since 
the loadings are relatively low, and they ac- 
count for rather little of the variance, the fac- 
tors should be viewed as very tentative indeed. 

Factor 8. This factor is defined by two 
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scales, which suggest that mothers who score 
high tend to give warmth to the child which 
is not contingent upon his good performance 
(Scale 9, —.50), and that the chifdren of these 
mothers are able to dress themselves at a rela- 
tively later age (Scale 22, +.53)., We may 
thus be dealing with a weak factor suggestive 
of “mothering” or of “noncontingerit nurtur- 
ance.” The weak loadings are consistent with 
such an interpretation, suggesting that the 
mother keeps the child from responsibility 
(Scale 24, —.22, dealing with total frequency 
of chores) and that she does not try to use 
punishment to control behavior (Scale 10, 
—.29, suggesting low contingency of mother’s 
hostility; and Scale 15, —.29, showing some- 
what less tendency to use physical punish- 
ment in socialization). 

Factor 9. This relatively weak factor may 
be an artifact, since, though it deals with 
parental attitudes toward peer aggression it is 
not loaded on Scale 17, “reward for retalia- 
tion” directed toward peers. There may, how- 
ever, be some sense to this factor if it is viewed 
as “peer aggression permissiveness” (Scale 2, 
+.48, reflecting the “degree to which mother 
is positive when child fights other children”), 
tied somewhat to boys (Scale 27, +.35). It is 
interesting that the only other loading above 
.30 is the “general warmth of the mother” 
(Scale 5, +.32), which gives a picture of an 
expressive, positive mother, who permits her 
boy to be expressive as well. This suggestion 
can be checked from other data to be reported 
elsewhere. 

Factor 10. This weak factor has only one de- 
fining scale, and should probably be ignored 
(Scale 1, —.52, reflecting the fact that this 
mother does not give help to the child when 
she is busy). As a hunch, however, we may 
be dealing with a weakly adumbrated factor 
of “mother irritability,” since there is a weak 
loading on Scale 3 (—.27) suggesting that the 
mother tends also to be punitive when the 
child is angry when scolded, and on Scale 16 
(+.28), showing that she gives gifts and 
privileges contingent on the child’s good be- 
havior. 


Discussion 


We have already suggested possible con- 
texts in which the list of factors we have pre- 
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sented may prove useful. We have also sug- 
gested some of the additional problems regard- 
ing these data which await further analysis. 

For a number of reasons, it is difficult to 
relate our pancultural findings to previous re- 
search on parent behavior. First, all of the 
analyses available are intracultural rather than 
cross-cultural. Second, researchers in this area 
have not been uniform in their choice of the 
types of scales. Resemblance of factors, then, 
sometimes rests on one or two scales in one 
factor, which seem similar in meaning to scales 
in another. Third, there is diversity in the 
types of analysis. Baldwin, Kalhorn, and 
Breese (1949) for instance, analyzed their re- 
sults by means of cluster analysis; Sewell, 
Mussen, and Harris (1955) used the multiple 
group method in extracting their factors and 
extracted correlated factors by this method; 
Schaefer and Bell (1957) used the centroid 
factor analytic procedure, but oblique: rota- 
tion; Zuckerman, Ribback, Monashkin, and 
Norton (1958) describe unrotated factors and 
compare their results to unrotated factors ob- 
tained by Schaefer and Bell (1958); Milton 
(1958) used a procedure most comparable to 
ours, that is, centroid analyses and orthogonal 
rotation. Finally, comparison of factors ob- 
tained on different samples must be to some ex- 
tent subjective, despite the mammoth en- 
deavors of Ahmavaara (1957) and others, since 
there is as yet no definitively precise method of 
comparing factoral results obtained orf different 
samples. 

However, since there seems to be some 
similarity of factors across various American 
samples, it seems reasonable, despite the diffi- 
culties involved, to consider which of these 
factors have some pancultural basis. 

The factors that have commonly emerged 
in studies of American samples have been 
versions of warmth factors and permissive- 
restrictive factors. It is not surprising, then, 
that our warmth factor (Factor 2) and restric- 
tion factors (Factor 3, Factor 6) are the ones 
which have the most comparability with the 
results of other studies. 

Factor 2 (Use of positive affect by the 
mother) in the present study has a positive 
loading on maternal warmth and a negative 
loading on maternal hostility. Mothers high 
on this factor also praise their children and 
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tend not to use physical punishment. Milton 
(1958) also distinguishes a warmth factor 
(Factor C) which includes, among other scales, 
affectionate demonstrativeness and praise for 
table manners. Like our warmth factor, Mil- 
ton’s is orthogonal to his permissiveness factor. 
Zuckerman et al. (1958) and Schaefer and Bell 
(1958) find a hostile rejection factor in their 
unrotated area, which might be considered the 
opposite end of a warmth factor. Baldwin 
et al. (1949) also find a warmth factor in their 
cluster analysis. 

In our study we have isolated two (possibly 

three) factors concerned with aggression train- 
ing. One (Factor 3) is concerned with restric- 
tions on aggression to peers and obedience 
training, the other (Factor 6) is concerned 
with restrictions on aggression directed at the 
mother. Milton (1958) finds one aggression 
factor (Factor E) which includes treatment of 
aggression directed at peers and at parents. He 
finds that the factor loading for permissive- 
ness toward peer aggression is correlated with 
punishment of aggression to parents. We find 
them to be independent, appearing, as they 
do, on separate factors. The comparison with 
Milton’s data is further complicated by the 
fact that he also finds a permissiveness factor 
(Factor A) broader in scope than aggression 
training and obedience, but including these 
scales. Thus in Milton’s analysis, the scales 
measuring restrictions on aggressive behavior 
appear in two factors, but always together, 
whereas in our data they appear in two separate 
factors. Schaefer and Bell (1957) find a factor 
which they call Over-Possessiveness (Factor 
IV) and Zuckerman et al. (1958) find one 
called Authoritarian control (Factor T), both 
of which include a scale on restrictions on ag- 
gression. Baldwin et al. (1949) find a general 
restrictiveness cluster and Sewell et al. (1955) 
find a rather diverse factor, which they leave 
unnamed (Factor 7) but includes strict obedi- 
ence training. While these findings have only 
tangential resemblance to our aggression 
training factors, they do seem to be generally 
concerned with dimensions of strictness of dis- 
cipline. 

Our emotional instability factor (Factor 7) 
also has some parallels in other studies. 
Schaefer and Bell (1957) find a factor called 
Hostile Rejection of the Homemaking Role, 
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and Zuckerman et al. (1958) find a Hostility- 
Rejection factor, both of which include a scale 
of irritability. Baldwin et al. (1949) also find 
an Emotionaiity cluster. Milton’s (1958) Gen- 
eral Family Adjustment Factor, which includes 
hostility of the father and mother’s child rear- 
ing anxiety, may represent a related pattern. 

It remains to mention that Sewell et al. 
(1955) find a factor (Factor 4) which they 
call Non-Punitive Treatment that includes 
ignoring the child’s neglect of jobs. This might 
be similar to our first factor, which is con- 
cerned with responsibility training. 

Finally, none of the American studies we 
have reviewed include scales on the proportion 
of caretaking that is done by the mother. 
Therefore we find no factors comparable to 
our Factors 4 and 5 that deal with this area. 
These scales differ from most scales in our 
study and the other studies here reviewed in 
that they do not purely represent either atti- 
tudes towards child rearing practices, or per- 
sonality variables. However, such factors may 
certainly be related both to attitudes of the 
parents and to behavior of the child. The 
variable of time spent with the child has been 
extensively investigated with institutional 
children and the children of working mothers. 
It seems rather odd, in view of the great im- 
portance attributed to maternal presence, that 
this topic has not been more frequently in- 
ee in the type of analyses we have cited 

ere. 


SUMMARY 


We have presented the results of a factor 
analysis of child training practices. The data 
were gathered from mothers in six societies, 
North India, Mexico, Okinawa, the Philip- 
pines, Kenya, and a New England community. 
A small carefully selected sample of 21-24 
mothers was studied in each of these societies. 
In order to minimize intrasociety variation, 
the families studied in each country came from 
the same neighborhood and interacted with 
each other. The mothers of these samples 
should not be considered representative sam- 
ples of their societies, nor are the societies 
necessarily representative of types of world 
societies. The factors which were extracted 
are, however, preponderantly based on differ- 
ences between mothers (or their practices) 
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rather’ than differences in culture-context 
means. 

The correlation matrix of variables was 
factored by the centroid method and the fac- 
tors rotated orthogonally. Ten factors were 
extracted, seven of which were named and 
interpreted. They are: Responsibility demands 
on the child, Use of positive affect by mother, 
Peer aggression and obedience control, Pro- 
portion of caretaking of baby done by mother, 
Proportion of caretaking of the child done by 
mother, Control of aggression and disobedi- 
ence toward parents, and Emotional instabil- 
ity of the mother. Three other factors were 
discussed tentatively. 

Our findings are compared with those of 
other investigators. 
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SOURCES OF FRUSTRATION AND TARGETS OF AGGRESSION: 
A CROSS-CULTURAL STUDY? 
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persons may have more or less charac- 

teristic modes of response to frustration 
they may either direct the majority of their 
aggressive responses towards others or towards 
themselves. In a parallel manner it has also 
long been considered possible to characterize 
a person by his habits of attributing the causes 
of his frustrations to his own actions or to the 
actions of others. A purpose of the present 
research is to explore the possibility that 
small, relatively homogeneous societies may 
be fruitfully characterized in these ways. 
Another aim is to investigate the relationships 
among various types of culturally defined 
frustrating agents and various types of cul- 
turally defined targets of aggressive responses. 
The results are based on ratings of existing 
ethnographic reports from a world-wide “‘sam- 
ple” of nonliterate societies. 


i has long been considered possible that 
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In attempting to classify societies in terms 
of their methods of aggressive expression we 
have borrowed from Rosenzweig’s (1944) clas- 
sification of intrapunitive, extrapunitive, and 
impunitive responses, and have elaborated on 
this classification, particularly in distinguish- 
ing between culturally defined sources of frus- 
tration and culturally defined targets of aggres- 
sion. Rosenzweig defines three types of aggres- 
sive response: 


a. Extrapunitive responses are those in which the 
individual aggressively attributes frustration to exter- 
nal persons or things. The associated emotions are anger 
and resentment. The cognate psychological mechanism 
is projection. It should, however, be noted that the 
extrapunitive response is not always projective. When 
it proceeds directly from anger it has a very different 
character from that found when aggression is first in- 
hibited and then only indirectly finds expression in 
extrapunitive projection. ... 

b. Intrapunitive responses are those in which the 


1 Research for this paper was facilitated by funds 
from the Social Science Research Council and the Ford 
Foundation. Its completion was facilitated by funds 
from the National Institute of Mental Health. 
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individual aggressively attributes the frustration to 
himself. THe inturning of aggression is perhaps a con- 
sequence of its outward expression. Associated emotions 
are guilt and remorse. The related psychoanalytic 
mechanisms are displacement and isolation. ... 

c. Impunitive responses differ from both the ex- 
trapunitive and intrapunitive in that aggression does 
not apparently supply the motivating force; more 
socially directed or erotic drives are at work. Here the 
attempt is to avoid blame altogether, whether for others 
or for oneself, and to gloss over the frustrating situation 
as though with conciliatory objective. The cognate 
psychoanalytic mechanism is repression .. . (p. 383)? 

It should be noticed that Rosenzweig has 
packed many things into these definitions. 
For our purposes we have sought in addition: 
(a) to distinguish different classes of apper- 
ceived (or culturally defined) frustrators, (b) 
to distinguish between apperceived (or cul- 
turally defined) frustrators and apperceived 
targets of aggression, (c) to distinguish among 
some different classes of apperceived targets 
of aggression. We have thereby elaborated 
upon his classification by considering three 
types of potential external agents who might 
be culturally defined as frustrators, and three 
parallel types of agents who might be cul- 
turally defined as targets of aggression. The 
distinctions are made in terms of the “‘other’s” 
status in relation to “ego,” ego being defined 
as the average or modal adult of a society. 
The model therefore involves, besides the self, 
three types of individuals: superiors, peers, 
and inferiors, who might be either culturally 
defined agents of frustration or culturally de- 
fined targets of aggression in any combina- 
tion. Egos who characteristically define the 
self as a frustrating agent may be classified 
as having an intrapunitive outlook. Egos who 
believe that superiors, peers, or subordinates 
are frustrating agents have an extrapunitive 
outlook. Egos who apparently gloss over frus- 
trations, attributing them to none of the 
classes above, are defined as having an outlook 
of the impunitive variety. 

*Quoted from Saul Rosenzweig, “An Outline of 
Frustration Theory” in Personality and the Behavior 


Disorders, edited by J. McV. Hunt. Copyright 1944 
The Ronald Press Company. ee 
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In parallel fashion, egos who typically direct 
some form of aggression against themselves 
may be said to have intrapunitive responses. 
Egos who typically direct aggression at su- 
periors, peers, subordinates or outgroups (war- 
fare) have extrapunitive responses. Egos who 
are not aggressive to anyone of the four possi- 
ble targets are defined as representing impuni- 
tive responsiveness. 

By stating that the subject “aggressively 
attributes frustration to” another, himself, 
or no one, Rosenzweig (1944) assumes that the 
subject’s aggressive response is directed at the 
apperceived source of frustration. While this 
is a reasonable hypothesis in many situations, 
the existence of the phenomena of displaced 
aggression indicates that it is not always the 
case that the apperceived source is also the 
target. A primary purpose of this paper is to 
explore this very issue: whether or not people 
of a particular society characteristically direct 
aggression at those individuals whom they 
believe to be responsible for frustration—and 
whether people of societies with an impunitive 
outlook also exhibit infrequent aggressive re- 
sponses. 

In stating the problem thus we recognize 
that it can be argued that the aggression, al- 
though accompanied by a belief that justifies 
it, has always actually been engendered by 
agents other than the ones attacked. Indeed 
Whiting and Child (1953) assume that dis- 
placed aggression is usually rationalized by a 
justifying belief. The culturally defined beliefs 
and practices that we are discussing probably 
have at least partial antecedents in childhood 
experience, the source of displaced aggres- 
sion of interest to Whiting and Child. How- 
ever, the presence of a justifying belief for a 
type of response makes it difficult to classify 
the response as displaced in the context of the 
present analysis. For the purposes of this paper 
we have considered aggression as direct if it is 
supported by a belief that the object of aggres- 
sion is an agent of frustration and as displaced 
if such a belief does not mediate the aggressive 
response. Whether or not the aggression is in 
fact also partially or totally displaced from 
some other type of aggression rooted in child- 
hood experience is a problem for another anal- 
ysis. 

We also investigate here the relationships 
that exist in our “sample” among the various 
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types of frustrating agents and among the 
targets of aggression. Choices of frustrating 
agents or aggression targets may be mutually 
exclusive, such that “egos” who “assign” 
frustration to a particular agent tend not to 
regard other types of agents as frustrating, 
and groups who are aggressive to a particular 
type of individual tend not to be aggressive 
to other types. If this is the case we would 
expect to find negative correlations among 
ratings of categories of agents.and targets. 
If, on the other hand, the choices are inde- 
pendent of each other, we would expect zero 
correlations among these ratings. 

We have used ratings in our analysis that 
were developed at Cornell, with the exception 
of the warfare scale. Unfortunately, no meas- 
ures are available of the belief that subordi- 
nates are apperceived as frustrating agents. 
This aspect of the model must therefore re- 
main untested. 


METHOD 


Sample 

The data of this study consist of ratings made from 
ethnographic descriptions of nonliterate societies. The 
total number of societies on which the ratings were 
made is 60. However, the ethnographies often do not 
include enough information to enable the raters to make 
judgments on all the scales, so the actual sample size 
varies from one portion of the analysis to another. The 
sample is the same as that described by Lambert, 
Triandis, and Wolf (1959). 

e 


Ratings of Frustrating Agents 


All societies face a number of diferent kinds of 
frustration. We chose to make ratings about agents 
believed to be responsible for crop failure or for poor 
hunting. Most of these ratings were made by graduate 
students in a research seminar. Ratings on additional 
tribes were added by scholars employed for that pur- 
pose. All ratings were made on four-point scales, “four” 
representing the belief that a particular agent was 
always responsible for famine, and “one” representing 
absence of the belief. Each society was rated by two 
students, working independently, who subsequently 
discussed their ratings. If they failed to reach agreement 
on a particular scale, no rating was entered for that 
society. The specific ratings follow. t 

Exirapunitive Outlook—Superiors. It is assumed that 
“spirits” represent agents who are always viewed as 
superior to ego. Ratings were made on the importance 
of the belief that spirits cause damage to the food 
supply. Separate ratings were made for human male 


$The authors are grateful to Margery Wolf and 
Florence Rosenberg for their assistance in rating both 
frustrating agents and targets of aggression. 
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spirits, human female spirits, animal spirits, and spirits 
of indeterminate nature. The highest of these four rat- 
ings was used as the measure of extrapunitive outlook 
towards superiors. The average (independent) inter- 
rater reliability for these scales is .87.4 

Exirapunitive Outlook—Peers. Extrapunitive outlook 
towards peers was measured by the belief that an adult 
member of the tribe causes damage to the food supply. 
Separate ratings were made for women and men and 
the highest of these ratings was used in the analysis. 
The average (independent) interrater reliability of 
these scales is .86. 

A second measure of this type of belief was obtained 
from the ratings on the belief that some outgroup mem- 
ber, usually from some neighboring tribe, is responsible 
for crop or hunting failure. The independent reliability 
of this scale is .90. 

Extrapunitive Outlook—Overall Measure. An overall 
measure of extrapunitive outlook was computed by 
adding, for each society, the scores used in the three 
preceding scales: the high score for the belief that spirits 
and tribal members cause famine and the belief that 
outgroup members cause famine. 

Intrapunitive Outlook—“Self-Blame.”’ Self-blame was 
indexed by the belief that the entire tribe is collectively 
responsible for damage to the food supply. Since this 
belief is relatively rare, the scale as used in the analysis 
represents a measure of presence or absence of the be- 
lief. The independent reliability of this scale is .83, 

Impunitive Outlook. A measure of impunitive out- 
look was obtained by adding, for each society, their 
tating on overall extrapunitiveness and their rating on 
intrapunitiveness. A low score on this combined measure 
indicates that members of the society do not blame any 
of the types of sources considered for failure of the food 
supply and are therefore defined as impunitive in out- 
look, 


Measurement of Targets of Aggression 


These fatings® were also made by the students of a 
research seminar and by paid scholars. The ratings are 
all on seven-point scales. Two students read the mate- 
rial for each of the tribes, After making independent 
ratings the students met and discussed those ratings on 
which there was more than a two point discrepancy, 
in an effort to reach agreement. If the disagreement 
between raters was two points or one point, the judg- 
ments were averaged and this average score was used 
for analysis. If the judges could not agree on their 
ratings to within two points the rating was omitted. 
A similar procedure was used by the personnel who 
later made ratings on additional tribes. 

Extrapunitive Response—Superiors. This was meas- 
ured through ratings on the frequency of coercion of 
gods or spirits. “Coercion” is defined for present pur- 
poses as insulting a god or spirit or beating an image 
of a god or spirit. A rating of seven indicates that such 


4 All reliability coefficients are Pearson product- 
moment coefficients. 

5 Although Margery Wolf and Florence Rosenberg 
tated societies on both agents of frustration and targets 
of aggression, the students who did the majority of the 
ratings on frustrating agents were not the same stu- 
dents as those who rated targets of aggression. 
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practices occur every day or oftener, and a rating of 
one indicates that coercion never occurs. The median 
for this scale falls between—no coercion (1), and the 
lowest intensity of coercion (2). Therefore the scale as 
used in the analysis amounts to a measurement of 
presence or absence of coercion of supernaturals, The 
independent reliability of this scale is .82. 

Extrapunitive Response—Peers. This was measured 
by ratings on the frequency of verbal aggression among 
tribal aduits. Verbal aggression is defined as noncere- 
monial quarreling, insulting, and gossiping. Insulting 
of a nontribal member is not included in this rating. 
The reliability of this scale is .78. 

Extrapunitive Response—Outgroup, This was meas- 
sured by the frequency with which the tribe engages in 
warfare. In using these ratings, it must be borne in 
mind that colonial administrations have put an end to 
war in most primitive societies. Therefore the warfare 
ratings often represent the tribal life at a precolonial 
period of time, whereas the other aggression ratings 
usually represent the behavior as seen by the ethnog- 
rapher at the time of the field study. While one may 
assume that the current aggression of the “ethnographic 
present” is in some degree representative of the aggres- 
sive beliefs and behavior of the society during the period 
when they were engaged in warfare, one must not ignore 
the possibility of changes over time in such beliefs and 
behaviors. Indeed, the suppression of warfare may, in 
itself, alter other forms of aggressive behavior, The — 
frequency of warfare ratings were collected by Jerome 
Bruner. 

Since Bruner’s sample of tribes was smaller than 
ours and somewhat different, the N for the warfare 
data is smaller than that involving other scales. The ` 
total number of tribes common to both samples is 47. 
The frequency of warfare was rated on an 11-point 
scale. In order to make this scale comparable to the 
seven-point scales of the other ratings on targets of 
aggression, this scale was arbitrarily condensed to 7 
points, by making 0, 1 = 1; 2,3 = 2; 4 = 3;5 = 4} 
6 = 5;7,8 = 6,9, 10 = 7. j 

Extrapunitive Response—Inferior. This was meas- 
ured by the amount of aggression displayed to initiates 
in the initiation ceremony. This frequency measure, 
unlike the two preceding scales, is not a measure of - 
aggression over a time period, since initiation cere- 
monies are seldom held more often than once a year. 
Instead, the aggression to initiates is measured in terms 
of the proportion of the total ceremony that is devoted 
to initiate-directed aggression. The highest rating on 
this scale means that the entire ceremony is judged as 
being aggressive. The lowest rating is “no aggression.” 

In using this variable the question arises as to 
whether to include tribes that do not have initiation 
ceremonies and give them a rating of 1 (e.g., “no ag- 
gression”), or whether one should limit the analysis to 
only those tribes where such a ceremony does exist. In 
the data of this analysis the tribes without ceremonies — 
are included with ratings of 1. Several of the correla- 
tions between this and other variables have been com- 
puted with the no ceremony tribes omitted; the results 
remain unchanged with respect to direction or signifi- 
cance. The independent reliability of this scale is .91. 

Extrapunitive Response—Overall Measure. A general 
measure of extrapunitive responsiveness was obtained 3 
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by adding, for each society, the scores on frequency of 
coercion of the gods, frequency of verbal aggression, 
frequency of warfare, and proportion of time devoted 
to aggression in initiation ceremonies. 

Intrapunitive Response. This was meafured by ratings 
on the frequency of sacrifice. Sacrifice is defined as 
giving up anything of real to symbolic value to the 
deity or deities. The independent reliability of this 
scale is .86. 

Impunitive Response. An index of impunifive respon- 
siveness was obtained by adding, for each society, the 
score on overall extrapunitive responsiveness and the 
score on intrapunitive responsiveness. A low score in 
this index indicates that members of the society are 
generally low on expression of aggression, at least ac- 
cording to the categories entertained in the present 
conceptual scheme. 

These scales, of course, do not represent all possible 
types of aggression. In the area of agents of frustration 
we have measured only one type of frustration, threat 
to the food supply. Our ratings of targets of aggression 
do not include, for instance, measures of brawls, murder, 
or suicide, nor do they include ceremonial aggression 
against any of the societies’ members other than initi- 
ates. Ratings on frequency of brawls have been made 
but the variability of the ratings is too low for them 
to be useful in this analysis. Ratings on frequency of 
murder and suicide have been made but not for enough 
of the societies in the sample. In any case, since these 
are exceptional events involving individuals rather 
than the group as a whole (ego as defined above), there 
is some argument in favor of not including them in the 
present analysis. 

We recognize that incomplete coverage of types of 
aggressive response undoubtedly leads to some error in 
the classification of societies with respect to overall 
extrapunitiveness and impunitiveness. However, the 
assessment of the total amount of aggression in a society 
is a virtually impossible task. Therefore we must simply 
hope that we have a reasonably representative sample 
¢ ie saeiteave beliefs and responses of the societies 
studied. 


RESULTS 


Inierrelationships among the Apperceived Frus- 
trating Agents 


Table 1 presents the relationships among 
the various ratings of types of agents of crop 
and hunting failure. Each case in the contin- 
gency tables represents a society. For each 
analysis the distribution is dichotomized as 
close to the median as possible. The main 
finding in Table 1 is that none of the chi 
square values in this table approaches signifi- 
cance, and most are virtually zero. Evidently 
beliefs concerning types of famine-producing 
agents are independent of one another. Un- 
fortunately, our measure of intrapunitive out- 
look (i.e, the belief that the entire tribe is 
responsible for failure of the food supply) is 
tather rare. Although this finding is itself in- 
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TABLE 1 
RELATIONSHIPS AMONG APPERCEIVED 
AGENTS OF FAMINE 
anibal | Outgroup | Spirits 
k #3 ’ F E|? 
elS|" leis Š 
Whole Tribe 
High Oj 6|ns| 4| 1) ms| S| 2] ns 
Low 8 |16 6 | 15 13 | it 
Tribal Member 
High 3| S| ms| 4| 6] ns 
Low 10 | 12 14 |17 
Outgroup 
High 8| 6) ns 
Low 9| 9 


a p is based on x? values, All probabilities are based on two- 
tailed tests. 


teresting, the infrequency of this belief limits 
the usefulness of the scale for the purposes of 
the present analysis. 


Interrelationships among Targets of Aggression 


Table 2 presents the relationships among the 
different targets of aggression. The most strik- 
ing pattern that emerges from this table is the 
consistent presence of a fairly high negative 
relationship between frequency of verbal ag- 
gression to peers and the other measures of 
aggressive response. The probability of the 
negative association between verbal aggres- 
sion and frequency of sacrifice or o$ coercion 
of the gods is less than 5% and the probabil- 
ity of the relationship with aggression in initia- 
tion ceremonies is less than 10%. 

While the relation between verbal aggres- 
sion and frequency of warfare is not signifi- 
cant when societies without any warfare are 
included in the analysis, the relationship be- 
comes negative and significant at the 5% 
level, if only societies with some token of 
warfare are considered. Since total absence of 
warfare may represent either lack of aggres- 
sive intent or lack of opportunity due to isola- 
tion, lack of resources to engage in warfare, 
or colonial control, it seems reasonable to 
consider only those tribes with at least some 
frequency of warfare greater than zero in 
making this comparison. ; 

The other significant relationship among 
targets of aggression is the positive associa- 
tion between frequency of sacrifice and amount 
of aggression to initiates. 
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TABLE 2 
RELATIONSHIPS AMONG RECIPIENTS OF AGGRESSION 
Amount of Frequency of F £ F A 
Verbal'Aggitscion| Aggression to | Coercionof | “Warfare o eae 
[High | Low flies ae ata een Jen nen baad | Hee | Low 2 
Frequency of Sacrifice 7 
sian 7| 12| .02| 16} 7] 02] 11|"9! ns | 11] 7] ns 
Low 17| 6 6} 13 10 | 14 7 | 13 
Frequency of Verbal Ag- 
gression 
High 7| 13] .10} 3] 12] .01| 4] 10] ns 4 | 10 
Low . 14| 9 13) 7 9| 9 9 4 
Amount of Aggression to 
Initiates 
High 10 | 12| ns |13| 6] .10 
w 15| 9 7 |12 
Frequency of Coercion of 
Gods ns 
High 6 | 12 
Low 9| 6 


a Analysis includes only those societies having token warfare or more, 
b p is based on exact x? values, 


Relationships between Apperceived Frustrating of extrapunitive response is not highly associ- 
Agents and the Target of Aggression ated with the index of extrapunitive outlook 


Table 3 shows the relationships among as- 
sumed agents of famine and targets of aggres- 


and that the measures of impunitive outlook 
and impunitive response are not related to 


sion. The intrapunitive belief that the entire each other. We also find that the measure of 


tribe is responsible for famine is not, as might 
be expected, related to the intrapunitive re- 
sponse of sacrifice, but is most positively as- 
sociated with frequency of coercion of the 


frequency of warfare does not seem to be a 
particularly sensitive measure of extrapuni- 
tive responsiveness. In view of the archaic 
nature of the frequency of warfare ratings, we 


gods (p < .10). re-computed the indices of extrapunitive and 


In the, realm of extrapunitiveness, we find impunitive responsiveness, omitting this meas- 
that the belief that some tribal member is W°, and including in the analysis only those 
responsible for failure of the food supply is societies where some outside authority, such 


positively related to the extrapunitive response 25 


a colonial administration, had passed laws 


of aggression to initiates (p < 05), but not 8ainst warfare. For such societies one may 
to verbal aggression to peers. It is negatively sume that the extrapunitive outlet of war- 
associated with frequency of warfare (p = fare is impossible, or extremely limited. The 
.02). The belief that some outgroup member results of analyses with these indices are pre- 
is the culprit is not, as might be expected, re- sented in Table 4. Also presented are the rela- 
lated to frequency of warfare, but is positively tionships between frequency of warfare and 
associated with coercion of the gods (p < 10). agents of famine for those societies where at 
The overall index of extrapunitive outlook is least token warfare is present. 
also positively associated with frequency of The elimination of tribes without warfare 
coercion of the gods, while the index of impuni- does not substantially alter the relationships 
tive outlook is negatively associated with this between frequency of warfare and agents of 
variable. famine. Frequency of warfare for this re- 
Aside from the associations just mentioned, *Forty-three of the societies in our sample have 
the indices of overall extrapunitive response  ¢xtemally imposed laws against war, 10 societies have 


and of impunitive response do not relate to °° 


any other measures of frustrating agents. It 


such laws, and 4 were rated as “don’t knows.” No 


tatings on this matter have been made for the remaining 
societies. These ratings were made in a manner similar 


is particularly disappointing that the index to the other ratings, 


; 
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TABLE 3 
RELATIONSHIPS AMONG ÅSSUMED AGENTS OF FRUSTRATION AND TARGETS OF AGGRESSION 
Targets of Aggression 
Frequency of Freguefcy of Amount of 
ived Sacrifice Aggression to of |. Overall a) 
pret Sant eames” | es! Reman Mat 
e 
= E E k E 
JERPARERPAHPHERHEAGHE 
Whole Tribe 
High 4| 2 3] 3\ns| 1| Sins} 2| 1\ns| 5111-10310 jns] 0] 3 [ns 
Low 11 | 13 |ms| 8 | 11 13 | 10 7 | 10 7 | 12 517 8/5 
Tribal Member 
High s| sias| 7| 8|ns| 6| 4).05| 3 |10 ].02| 6 5 \ns|4|4\ns| 4 | 4\ms 
Low 17 | 16 10 | 12 12 | 20 13| 7 13 | 16 7\8 TAN: 
Outgroup 
High sl olasi 7| z\ns| 9| 6las| 6| 4\ms| 8| 5 |.101 7] 4 \ns) 3 | 6 ims 
Low 7 |10 8| 7 7| 8 SPO 4| 11 3|6 7\4 
Spirits 
High 11/12 ns! 9} 7\ns| 9| 10 \ns| 7] Tinel 8] 9 |ms| 5] 5 |ms| 4) 6 ims 
Low 10| 9 9) 9 13 | 10 6| 7 10 | 10 5|6 6\4 
Overall Extra- 
punitive 
Outlook? 
High 10110 |ns| 8| olas) 9} 10|ns| 5| 9|ns| 9| 8110616 |ns} Ge) 2 jms 
Low 3| 8 ses 4| 7 7| 4 3 | 10 3|4 4\2 
Impunitive Out- 
look® 
High al slas al dlas 2| Sime] 4| 7 ]ms| £f 12 1:051 714 fns) 7) 2m 
Low 9| 6 6| ó 9| 7 4| 4 | 5| 2 213 1|4 


a Verbal Aggression + Aggression to Initiates + Coercion of Gods + Warfare. 

Verbal Aggression + Aggression to Initiates + Coercion of Gods + Warfare + Sacrifice. 
© p is based on exact x? values. 

4 Tribal Member + Outgroup + Spirits. 

© Tribal Member + Outgroup + Spirits + Whole Tribe. 


TABLE 4 
RELATIONSHIPS AMONG AGENTS OF FAMINE AND TARGETS OF AGGRESSION FOR SOME SUBSAMPLES OF THE Tora N 
High Low b High Low $ High Low b 
Mein r A aia 
Whole Tribe 
High 1 6 ns 3 2 ns 3 1 ns 
Low 1 6 6 7 5 9 
ae pe 02 6 3 ns 5 4 ns 
13 4 
s Eo 3 6 10 il 5 
utgroup 
A 5 ns 7 4 ns 5 3 ns 
bas g 7 3 6 4 5 
Spirits 
High 5 6 ns 6 7 ns 6 T ns 
a 5 4 6 7 4 5 
Ba Extrapunitive Outlook i 7 i 4 5 20 6 8 a 
Low 5 3 1 3 3 2 
moit 
Oe Outlook ss 3 je 4 6 eg 6 3 10 
Low i 3 4 2 1 4 


i Analysis includes only those societies having token warfare or more. z 
Index computed only for those societies with externally imposed laws against warfare. 
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stricted sample is still negatively related to 
the belief that some tribal member is responsi- 
ble, but is still not related to the index of the 
belief that the outgroup is a frustrating agent. 

The elimination of the warfare ratings from 
the indices of extrapunitive and impunitive 
response does alter the picture somewhat for 
these indices. For this sample, impunitive re- 
sponsiveness is positively related to the index 
of impunitive outlook (p < .10). There is also 
a tendency for overall extrapunitive outlook 
to be positively associated with overall extra- 
punitive responsiveness, although this trend 
does not reach any conventional level of sig- 
nificance. 


Discussion 
Interrelationships among Agents of Frustration 


Beliefs concerning the type of agents re- 
sponsible for famine are evidently relatively 
independent of one another, since no signifi- 
cant relationships appear in Table 1. Societies 
may be measured with respect to the preva- 
lence of beliefs in various frustrating agents in 
this area, but they usually cannot be typed 
as favoring exclusively one or another form of 
belief. 


Interrelationships among Targets of Aggression 


As we have said, the most consistent pattern 

in Table 2 is the consistency of negative 

relationships between the extrapunitive re- 

sponse of verbal aggression and all other meas- 

ures of aggressive response. This can perhaps 

be best explained as an inverse relationship 

between spontaneous and structured aggression. 

Verbal aggression is the only measure of ag- 

gression that takes place outside of a struc- 
tured context: it may occur anywhere, at any 
time. In contrast, our measures of sacrifice, 
coercion of the gods, and aggression to initiates 
all occur in the course of some ceremony, the 
timing of which may not be decided by the 
aggressor. The timing of attacks in war is 
similarly beyond individual control, depending 
upon strategic considerations or the attack 
of the enemy. This difference, note, is not a 
distinction between direct and displaced ag- 
gression, since any of the four directions of 
aggressive response might be instigated either 
by frustration from the agent against whom 
the aggression is directed or by frustration 
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from some other agent. Nor can it be assumed 
that verbal aggression necessarily follows im- 
mediately upon the frustrating event, since 
both gossip and direct insults may well be de- 
layed actions, But delayed or not, the timing 
and form of the verbal aggression may be 
chosen by the aggressor and presumably occurs 
in the presence of hostile emotions, whereas 
the ceremonial aggression and warfare usually 
occur at a predetermined time when, at least 
in the case of ceremonial aggression, the ag- 
gressor may or may not experience hostile 
emotions. It would seem that one can distin- 
guish two kinds of societies, one in which 
people express their aggression spontaneously 
in unstructured situations and another in 
which they devise more structured means for 
aggressive outlet. These data provide some 
justification for Rosenzweig’s (1944) conten- 
tion, quoted above, that: “when it (aggression) 
proceeds directly from anger it has a very 
different character from that found when 
aggression is first inhibited and then only in- 
directly finds expression in extrapunitive pro- 
jection” (p. 383). 

The other significant relationship among 
recipients of aggression is the positive associa- 
tion between frequency of sacrifice and amount 
of aggression toward initiates. The reason for 
this relationship is not clear. It is possible, of 
course, that aggression directed toward initi- 
ates is aggression directed toward the children 
of ego, and, therefore, toward “his” extended 
ego. Further research on the antecedents or 
concomitants of both practices is necessary 
to determine the connection between them. 


Interrelationships between Agents of Frustration 
and Targets of Aggression 


The relationships presented in Table 3 are 
designed to answer the primary question of 
this research: whether or not societies typically 
direct their aggression to those agents whom 
they believe to be frustrating or whether they 
displace aggression to (possibly) innocent in- 
dividuals. 

Looking at the measure of intrapunitive 
outlook one sees that the intrapunitive belief 
that the entire tribe is responsible for famine 
1s not related to the intrapunitive response of 
sacrifice. One cannot regard this test as con- 
clusive because of the low frequency with 


i 


) 


" which our index of intrapunitive outlook is 
counted as present. However, these data fail 


to demonstrate a relationship between intra- 

itive outlook and intrapunitive response. 
f There is, however, some evidence for a posi- 
tive association between extrapunitive outlook 
and extrapunitive response. None of the meas- 
ures of extrapunitive outlook (i.e., beliefs that 
tribal members, outgroup members, or spirits 
are responsible for famine) are related to either 
the intrapunitive response of sacrifice or to 
the index of impunitive responsiveness. If such 
aggression takes the form of fairly direct re- 
taliation, we would expect that the belief that 
a tribal member is responsible for famine 
would be positively associated with aggression 
toward members of the tribe. We have two 
‘ratings of such aggression, verbal aggression 
and aggression to initiates. Belief in the guilt 
of tribal members is positively related to one 
of these responses, aggression against initiates 
(p < .05). This is, however, the only relation- 
ship in which aggression is directed against 
the kind of agent believed to be responsible 
for the frustration. The belief that the out- 
group causes famine is not significantly re- 
lated to frequency of warfare, nor is belief 
that the spirits are frustrating agents related 
to coercion of the gods and spirits. 

We do find, however, that the extrapunitive 
belief that the outgroup is responsible for 
famine, and the index of overall extrapunitive 
outlook, are positively associated with the 
Coercion of the gods and spirits. This might 
indicate that an extrapunitive belief may re- 
sult in an extrapunitive response, but one 
which is directed against some target other 
than the one believed to be frustrating. The 
deities seem to be a popular target of aggres- 
sion that is engendered from other sources. 
This being the case, it is somewhat surprising 
that coercion of the deities is not related to the 
belief that the spirits themselves are responsi- 
ble. The measure of spirits as agents of famine 
seems to be a somewhat insensitive index, 
since it is not significantly associated with any 
other measure. In order to check on the possi- 
bility that it is this particular measure of ag- 
gressiveness of the deities which is responsible 
for the lack of relationship, we plotted the re- 
lationship between coercion of the gods and a 
general rating of the frequency with which 
the gods are aggressive in all areas of their 
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behavior.” The probability of this positive rela- 
tionship is less than 2%. Frustration by the: 
deities is, then, associated with aggression 
against them, when this broader measure is. 
used. 

The relationship between aggression by the: 
gods and aggression to them may be the ex- 
planation for the positive correlation between 
the belief that the whole tribe is responsible: 
for famine and frequency of coercion of the 
gods (p < .10). This is the only example of a 
response that is not accompanied by a belief 
falling at least within that category of re- 
sponse, according to Rosenzweig’s (1944) clas- 
sification. This association can perhaps be ex- 
plained by the nature of the deviation that 
causes the failure of the food supply. Fre- 
quently the crime committed by the tribe is 
an offense against the gods. The group fails, 
in some way, to pay homage to the gods, either 
by some sacrilegious act or through failure to 
perform some sacrifice or ceremoney. In retali- 
ation the gods send famine. Therefore, al- 
though the group is responsible, the gods are 
also responsible for the misfortune. Viewed 
in this fashion, the relationship is an extra- 
punitive-extrapunitive one. The society retali- 
ates against their hostile gods by being in turn 
hostile to them. 

There is one negative relationship between 
an extrapunitive belief and an extrapunitive 
response, that between the belief that a mem- 
ber of the society causes famine ang frequency 
of warfare. It seems reasonable that if warfare 
is a direct response to frustration, it would be 
unlikely to occur when some member of the 
ingroup is perceived as the frustrating agent. 
This argument would carry more weight, how- 
ever, if warfare were positively related to the 
belief that the outgroup causes famine. 

The general positive association between 
the indices of impunitive outlook and impuni- 
tive response presented in Table 4 is perhaps 
the most interesting finding of this research. 
Certain societies are apparently nonaggressive 
both in outlook and response. In future work 
we hope to examine other characteristics of 
these impunitive societies and their aggressive 


7 This rating was developed in the same manner as 
those ratings presented in this text. The frequency of 
aggression of the gods was measured against the base- 
line of their total behavior (see Lambert, Triandis, & 


Wolf, 1959). 
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counterparts that may account for this varia- 
tion in aggressive tendencies. 

It would seem, in summary, that the types 
of aggressiveness of Rosenzweig’s (1944) classi- 
fication at the societal level, at least, are better 
viewed as relatively independent variables 
rather than mutually exclusive categories. Be- 
liefs about apperceived agents of frustration 
are independent of each other, and the nega- 
tive associations among types of aggressive 
responses are generally between spontaneous 
vs, ceremonialized aggression rather than one 
type of aggressive target and another. Fur- 
thermore, for the most part, intrapunitive, 
extrapunitive, and impunitive beliefs are in- 
dependent of responses in other categories 
(e.g., extrapunitive beliefs are unrelated to 
either impunitive or intrapunitive responses). 
Rosenzweig’s implied contention that the ag- 
gressive response is typically directed at the 
apperceived, agent of frustration is upheld for 
extrapunitiveness and impunitiveness. Intra- 
punitive outlook was found to be unrelated to 
intrapunitive responsiveness. This lack of rela- 
tionship may, however, be due to the relative 
rarity in the cultures studied of the bases for 
our measure of intrapunitive belief. 


SUMMARY 

This research represents a cross-cultural in- 
vestigation of relationships between beliefs 
about people thought to be responsible for 
damage to.the food supply and types of people 
against whom the members of these societies 
direct their aggressive responses. Rosenzweig’s 
(1944) classification of intrapunitive, extra- 
punitive, and impunitive types of aggressive- 
ness was employed and elaborated upon. 
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Ratings were made on the basis of ethno- 
graphic material on a world-wide sample of 
60 primitive societies. The ratings included 
judgments of intrapunitive, extrapunitive, and 
impunitive beliefs about famine-producing 
agents and intrapunitive, extrapunitive, and 
impunitive types of response. Extrapunitive 
beliefs ard responses were further distin- 
guished in terms of whether the frustrating 
agents or targets of aggression were superior, 
equal to, or inferior in status to the typical 
adult male of the society. 

No evidence was found for intrapunitive re- 
sponsiveness as a reaction to intrapunitive 
outlook. Instead, intrapunitive outlook was 
found to be positively associated with the 
extrapunitive response of coercion of the gods. 

Extrapunitive outlook was generally found 
to be associated with extrapunitive responsive- 
ness. Impunitive outlook was found to be as- 
sociated with impunitive responsiveness. 

The data are discussed in terms of displace- 
ment, projection, and Rosenzweig’s classifica- 
tion. 
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THE PERPETUATION OF AN ARBITRARY TRADITION 
THROUGH SEVERAL GENERATIONS OF A 
LABORATORY MICROCULTURE 


ROBERT C. JACOBS! ann DONALD T. CAMPBELL 


Northwestern 


ccoRDING to Sumner (1906) “the mores 
A can make anything right,” and he 
went on to illustrate with a chapter on 
sacral harlotry and child sacrifice. According 
to Tarde (1903) “imitation is the key to the 
social mystery,” “society is imitation,” “social 
man is a veritable somnambulist.” Such quo- 
tations indicate the late 19th century aware- 
ness of the power of culture to perpetuate 
arbitrary beliefs. Less drastically expressed, 
some such perspective permeates present day 
social science, although tempered somewhat 
by functionalism in anthropology and soci- 
ology. In social psychology, for example, 
cultural tradition tends to be invoked pri- 
marily as an “explanation” for social evils, 
prejudice, resistance to ameliorative social 
change, and the like. The current emphasis 
upon conformity reinforces this view, in spite 
of Asch’s (1952) demurrer. Yet there are, no 
doubt, restraints upon a complete arbitrariness 
to culture, particularly with regard to beliefs 
that lie within people’s direct range of observa- 
tion. Thus arbitrary and erroneous supersti- 
tions about the shape of fishhooks probably 
would not long survive the systematic drift 
pressures resulting from continual minor varia- 
tions some of which are in the direction of 
noticeably more effective form. Here lies the 
problem in the theory of culture for which a 
laboratory analogue was sought. 

Since Sherif’s (1936) classic studies on the 
formation of “social norms” in laboratory 
groups, there have been sporadic efforts to 
bring the process of cultural transmission into 
the laboratory. While much small group re- 
search might be so interpreted, the present 
study was in particular inspired by Sherif’s 
studies, by Rose and Felton’s (1955) “experi- 
mental histories of culture,” and by the 
preliminary report of experiments on the 
evolution of ‘microcultures” by Gerard, 
Kluckhohn, and Rapoport .(1956). In the 


1 Now at the University of Colorado Medical School. 
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latter two studies and in the present one, there 
is an effort to demonstrate a perpetuation of 
“cultural” characteristics that transcends the 
replacement of individual persons. In the 
present study confederates have been employed 
to establish an extreme cultural norm, the 
inculcation and survival of which is then 
studied as the confederates are one by one 
taken out of the group, naive new members 
gradually introduced, who then unwittingly 
become the further transmitters of the belief 
to still newer entrants. 

The specific culture trait and the ecology of 
the group relative to such traits should obvi- 
ously affect the success in the indoctrination 
of each new generation, and the rate of erosion 
of the tradition by innovation. In the Rose and 
Felton (1955) study, group interpretations of 
Rorschach cards were employed, a material 
offering considerable latitude for cultural arbi- 
trariness and idiosyncrasy. In the Gerard, 
Kluckhohn, and Rapoport (1956) study, the 
groups were continually faced with new in- 
stances of a very difficult puzzle series, and 
were continually confronted with evidences of 
the lack of perfection of their ¢raditional 
solutions, a situation encouraging the evolu- 
tion of more adaptive norms, and unfavorable 
to the perpetuation of pure superstition. For 
the present study, a task was sought which 
would allow as much latitude for cultural arbi- 
trariness as the Rose and Felton situation, 
and which would, in addition, make possible 
the ready quantification of effects. The original 
Sherif (1936) autokinetic movement situation 
was judged to provide this: if a person views a 
pinpoint of light in an otherwise totally 
blacked-out room, the light soon appears to 
move. The illusion is a strong one for most 
persons, and is apparent even to those who 
know the light to be stationary. As Sherif has 
shown, naive respondents are very suggestible 
as to the degree of movement seen, and labora- 
tory group norms on the amount of movement 
are rapidly established without the respondents 
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being aware of the fictitiousness of their 
judgments. 

So labile is the autokinetic experience or at 
least the translation of it into judgments of 
linear extent, that one reading the reports of 
studies employing it might expect that an 
arbitrary group norm once established would 
be passed on indefinitely without diminution; 
that once well indoctrinated, the naive group 
members would become as rigid and reliable 
spokesmen for the norm as were the confeder- 
ates who preceded them; that each new genera- 
tion would unwittingly become a part of a 
self-perpetuating cultural conspiracy propa- 
gating superstition and falsehood. 

But the autokinetic experience is not com- 
pletely labile. The very fact that sophisticated 
observers under strong “suggestion” to see the 
light remain fixed still perceive movement, 
shows this. Haggard and Rose (1944), in at- 
tempting to condition perceived movement on 
a right-left dimension, found for most respond- 
ents a strong rightward bias. In preliminaries to 
the present experiment, we learned that for 
our setting, respondents reporting alone pro- 
duced mean individual judgments over 30 
trials of from .64 to 6.67 inches, and with 
individual judgments rarely if ever approach- 
ing the arbitrary cultural norm of 15 to 16 
inches which the confederates in our study 
provide. It is the presence of this potential 
natural norm that makes our inculcated cul- 
tural norne truly “arbitrary,” and provides a 
counterpressure to the cultural tradition in its 
transmission across the experimental genera- 
tions. 


METHOD 


Respondents 


The 175 respondents who took part in this experi- 
ment were students, enrolled in introductory social 
sciences courses. They were unsophisticated concerning 
the autokinetic phenomenon and were unadvised as to 
the nature of the experiment previous to the time of 
serving in it. The respondents were assigned randomly 
to the six conditions involved. The number of respond- 
ents participating per group is shown in Table 1. 


Materials, Apparatus, and Common Procedure 


The experiment took place in a completely darkened, 
windowless, fan-ventilated room. The respondents were 
seated in a row 8 feet from a box designed to emit a 
small pinpoint of light. A 1 rpm induction motor was 
attached in series with the lamp, and a single switch 
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controlled both. The motor was used solely as an audi- 
tory effect supporting the illusion of movement. 

All respondents were blindfolded when brought into 
the experimental situation and were asked to put their 
blindfolds on wHenever the door was opened to allow 
old respondents to leave and new ones to enter. This 
gave the respondents little if any knowledge of the 
size and arrangement of the experimental room. The 
individual respondents received the following instruc- 
tions: s 

This is a movement experiment. It is designed to 
test the students’ visual perceptions with respect to 
space. In the next room is an apparatus that will 
project a small light. A few seconds after we begin, 
the light will move. It may move in a wavy motion 
or pattern or it may follow a recognizable course. 
However, your specific task will pertain merely to 
judging the distance that the light moves from the 
time that it appears until it is turned off. All judg- 
ments should be made in inches along a straight line 
connecting the starting point and the ending point 
on each trial. 

Respondents going into group conditions received these 
additional instructions: 

You are going to join a student (or number of 
students) who are already serving in this experi- 
ment. ... You will be seated in the seat on the far 
left. To simplify recording procedure I will ask you 
to report your judgments beginning with the person 
on the far right and ending with you on the far left. 
When I stop you the person on the far right will 
leave, each of you inside will move right one seat, and 
a new person will be brought in. 

Each trial was 5 seconds in length and after each 
block of 30 trials a ventilation fan was turned on by 
the experimenter. During this time in the group condi- 
tions, the “oldest” member of the group was taken 
out, other members moved to the right one seat, and a 
new member introduced. In the solitary conditions, 
respondents were told that ventilation took place as a 
health requirement. The ventilation period was timed 
to take approximately the same amount of time that 
was necessary to give instructions to a new respondent 
and bring him in in the other conditions. 


Experimental Conditions 


Table 1 shows the main feature of the six treatment 
conditions. (These have been given designations sum- 
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Fic. 1. Mean judgments of the control groups. 


marizing the group’s construction. C and X stand for 
control and experimental. The first digit indicates the 


_ size of the group, while the second digit indicates the 


number of confederates present in the first generation.) 
‘There are two control conditions. In C-1-0 each re- 
spondent judged the movement of the light in solitude 
for four periods (called generations for the group con- 
ditions) of 30 judgments each. In C-3-0 respondents 
were run in groups of three, replacing one each genera- 
tion (or 30 trials), for a total of nine generations. 

A major dimension of experimental variation was 
the number of “culture-bearing” confederates with 
which the first naive respondent found himself placed. 
In experimental groups X-2-1, X-3-2, and X-43, the 
initial generation consisted of a solitary naive respond- 
ent sitting to the left of one, two, or three confederates 
who gave their judgments before he did, and who had 
been instructed to give judgments between 15 and 16 
inches. The experimental variation in number of con- 
federates was expected to produce cultural traditions of 
increasing strength. Confederates were recruited from 
the same subject pool as were the naive respondents. 
All were pledged to secrecy, with apparent success. 

Another mode of manipulating cultural strength was 
explored by adding X-3-1, a group of two naive re- 
spondents led by one confederate. Thus C-3-0, X-3-1, 
and X-3-2 provide a set of three-person groups of 0, 
1, and 2 confederates in the first generation. 

‘The confederates who were present in the starting 
condition were removed one at a time, after each round 
of 30 judgments. Thus the groups with more confed- 


erates present in the first generation likewise had some 
stooging present for more generations. The shifting of 
members one chair to the right each generation, plus 
the rule of responding in turn from right to left, insured 
that on each trial a confederate or the eldest member 
of the group spoke first. 

Each of the groups was replicated thre times, while 
24 respondents were run in the solitary control condi- 
tion. The conditions were given in a counterbalanced 


order insuring that the differences in respondent selec- 
tion from one part of the term to the other were kept 
independent of treatments. 


RESULTS 


Control Treatments 

Figure 1 shows the results for the two con- 
trol conditions. It will be noted that for C-1-0 
the mean of the 24 respondents starts at 3.80 
and decreases steadily generation by generation 
to a value of 2.90. This decrease is highly 
significant statistically, being found in the 
individual records of 21 of the 24 respondents, 
for which, by a two-tailed sign test, p < .0002. 

The three-person group control, C-3-0, shows 
very comparable mean levels, and likewise a 
general decline. The effect is not so clearly 
significant and is beclouded by the mutual 
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influence that the group situation introduces. 
In the three replications, two groups declined, 
while one started abnormally low and in- 
creased. Comparing Generation 1 with the 
last generation with three persons present, 
Number 8, average judgment changes were 
3.93-2.90, 6.72-2.03, and 1.72-2.49. Notice 
that in spite of the idiosyncratic norms of 
Generation 1, the three replications became 
more similar as the process of successive re- 
placement went on. Of the 27 respondents 
present in more than one generation, 16 showed 
declines between their first and last genera- 
tions. 

In graphing C-3-0, the data have been 
averaged in a different fashion from the fore- 
going. To show the successive replacement and 
overlapping generational character, the careers 
of each successive replacement have been 
plotted separately. The plotted points repre- 
sent the average of the three replications; that 
is, each plotted point represents the average 
judgment of three individual respondents, each 
taking the corresponding role in the three 
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separate groups replicating this control treat- 
ment. 

The two types of control established such 
similar reference standards that they cannot 
both be used conveniently as visual base lines 
in the graphing of the results of the experi- 
mental treatments. For this reason, in Figures 
2, 3, and 4, only C-1-0 has been plotted. This 
plotting has been done in such a manner as to 
provide a parallel for each new starting genera- 
tion in the experimental group. Thus in Figure 
2, where each respondent stays only two gen- 
erations, the first two generations of data from 
C-1-0 have been repeatedly plotted. In Figure 
4, all four generations of C-1-0 are needed as a 
comparison base. 


Transmission of Arbitrary Norms over Total 
Replacement of Indoctrinator 


Figures 2, 3, 4, and 5 present the transmis- 
sion results from X-2-1, X-3-2, X-43, and 
X-3-1. The first question that we ask of the 
data is whether or not the naive respondents, 
once indoctrinated by the confederates, have 
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Fic, 5, Transmission of arbitrary norm in a three-person group initiated by one confederate and two naive 


respondents, 


themselves transmitted any vestiges of the 
arbitrary cultural norm once the original in- 
doctrinators have passed on. To test this, one 
can examine the judgments of the first genera- 
tion of respondents to judge without any 
confederates present. For X-2-1, X-3-2, and 
X-4-3 these values are 12.07, 9.79, and 9.82. 
Each of these differs significantly (p < .01) 
from the C-1-0 control group by # test, when 
the mean of the 3 experimental respondents is 
compared with the mean of the 24 control 
respondents. For X-3-1, the mean is 5.34, not 
significantly different from the C-1-0 value of 
3.80. 


By pooling groups X-2-1, X-3-2, and X-4.3, 
we can compare to the 24 C-1-0 respondents 
9 experimental respondents newly introduced 
at each of several generations beyond the final 
confederate. For the first such generation, M = 
10.57, £ = 7.25, p < .0001; for the second, 
M = 7.80, t = 3.74, p < .0001; for the third, 
M = 6.06, t = 4.02, p < .0001; for the fourth, 
M = 5.08, ¢ = 2.04, p < .03; for the fifth, 
M = 4.37, t = 1.31, p < .10. Thus the arbi- 


trary norm is transmitted in some degree up 
through the fourth and perhaps the fifth 
generation beyond the last confederate. By the 
sixth it has entirely disappeared, the mean of 
3.60 being slightly below the control value of 
3.80. 


Differences in Strength of Induced Traditions 


The experimental comparisons were intro- 
duced in the expectation that induced cultures 
of differing strengths and persistence would be 
produced. In fact, there was some expectation 
that the strongest conditions might produce 
an arbitrary culture which would persist with- 
out apparent diminution. No condition pro- 
duced a culture of any such strength, and 
among the three experimental groups starting 
with one naive respondent, the number of 
confederates in the groups had little if any 
clear-cut effect. In the first generation, with 
all confederates Present, the means for the 
naive respondents were, for X-2-1, X-3-2, and 
X-43, respectively, 12.23, 14.37, and 14.13. 
X-2-1 was expected to be lower than the others. 


A LABORATORY MICROCULTURE 


The difference between 12.23 and 14.37 is 
significant (t = 2.16, df = 4, p < 05, one- 
tailed test). The absence of a difference be- 
tween X-3-2 and X-4-3 is against our expecta- 
tion. The literature on this point is confusing. 
Asch (1951) finds almost no effect from a 
single confederate, whereas we, in X-2-1, and 
Goldberg (1954) and Kidd (1958) find strong 
effects from a single other. Asch finds the 
effect to increase strikingly when the confeder- 
ates are increased to 2 and again to 3, whereas 
4, 8, and 16 others have little or no greater 
influence than 3. Goldberg and Kidd find no 
significant gains between 1 and 3 (that is, 
between 2- and 4-person groups) in spite of 
large numbers of cases. 

A comparison of the overall strength of the 
induced cultures is difficult because of the fact 
that in the larger groups not only are the 
culture-inducing confederates more numerous, 
but they are also retained for more generations. 
Thus it would seem unfair to compare all 
groups at the fourth generation from the be- 
ginning, since X-4-3 would have had a con- 
federate in the immediately preceding genera- 
tion, while X-2-1 would have been without a 
confederate for three generations. Perhaps 
more appropriate is a comparison of strength 
in the first generation subsequent to the re- 
moval of the last confederate. The values are, 
respectively, 12.09, 9.79, and 9.82. The smallest 
group, X-2-1, seems strongest at this stage 
although the differences do not approach 
significance. But this comparison is not ex- 
actly fair, either, because for X-2-1 the con- 
federate, when present, dominated the culture, 
while for X-4-3 the final confederate had his 
influence diluted by the vocal presence of two 
not completely loyal naive respondents. By the 
fifth postconfederate generation, these three 
values are 4.69, 3.90, and 4.55. Lacking both 
striking differences and an optimal compari- 
son, it seems fair to conclude that no differences 
in effective cultural strength have been 
achieved by thus manipulating group size. 

One manipulation did indeed provide indoc- 
trinations of varying magnitude. The naive 
respondents of X-3-1 were significantly differ- 
ent from C-1-0 only in the first generation, 
that is, with the confederate present (t = 3.88, 
p < 01). But even at this stage the naive 
respondents of X-3-1 were significantly less 
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indoctrinated than those of X-3-2 at Genera- 
tion 2, when it likewise consisted of but one 
confederate and two naive persons (t = 4.69, 
p < .001, 10 df). Thus manipulating the ratio 
of confederates to naive judges has little effect 
when achieved through adding more confeder- 
ates, but has great effect when achieved 
through manipulating the number of naive 
respondents. As Asch (1951) has noted, this 
adding of a “true partner” seems to have an 
effect disproportionally greater than would be 
expected from the simple fact of diluting the 
majority. This dimension has not been sys- 
temically explored, however. In the present 
setting it would be interesting to compare 
groups of varying size in which the naive-to- 
confederate ratio was held constant, as through 
a group consisting of two confederates and two 
naive respondents, to be compared with X-2-1, 
etc. 


Judgments as a Compromise between Own and 
Other’s Observations 


The deterioration of the cultural norm oc- 
curs, in part, because each naive respondent 
makes judgments somewhat lower, somewhat 
closer to the natural norm, than do the con- 
federates or elder citizens serving as his 
mentors. Of 36 naive respondents serving for 
the first time in X-2-1, X-3-2, and X-4-3 during 
the first four generations (where some signifi- 
cant strength to the cultural norm was still 
present) 34 made mean judgments lower than 
the average of their mentors (for which, p < 
0000001). This clear-cut fact we take as sup- 
porting the interpretation that each judgment 
represents a pooling of the person’s own 
observations (the value for which we infer 
from the control group) with the reported 
observations of others. In the resulting com- 
promise the person’s own observations are not 
given much weight, but they are given some. 

In the first generation of the experimental 
groups, the conditions are simple enough so 
that we can make some estimates of this rela- 
tive weighting of own observations and other’s 
reports. For the first naive respondent in X-2-1, 
the other’s (confederate’s) report averages 
45.52. The own observation is inferred from 
C-1 to be 3.80. If the naive respondent 
weighted his own observation equally with the 
report of the confederate, his judgment would 
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fall halfway between these values, at 9.66. 
Instead, the judgment is 12.23, much closer to 
the confederate’s. Rather than regarding the 
confederate as an equally good observer as 
himself, the respondent has weighed the con- 
federate’s opinion 2.6 times as heavily as his 
own. The formula for the weighting ratio is 
(12.23-3.80) /(15.52-12.23). Assuming each con- 
federate to be equally weighted, and solving 
for the weight given a single confederate’s 
opinion, we get values of 4.6 for X-3-2 and 
2.5 for X-4-3. Since these ratios are based upon 
the performances of but three naive respond- 
ents each, they are quite unstable, and are 
provided here primarily for illustrative pur- 
poses. Similar computations for the later 
generations have been omitted because of the 
difficulty of taking into account the reciprocal 
effect of new respondents upon the older naive 
respondents acting as indoctrinators. The first 
generation situation is computable just because 
the confederates were unresponsive to the 
judgments of their fellows. 

Whatever the weighting ratio, as long as 
some weight is given own observations, and as 
long as own observations deviate in a consistent 
direction from the cultural norm, the arbitrary 
culture is doomed to deterioration eventually, 
at least in the limited sanction system of this 
laboratory. While the question has not been 
stated in this form, the relative respect for own 
observations over the reports of others is much 
higher in less ambiguous situations, as Asch 
(1951) has shown. It is easy to understand how 
the ambiguities of the autokinetic situation 
lead to lack of confidence in own opinion, but 
it is perhaps an indication of human trust that 
this same ambiguity does not lead to a pro- 
portional lack of confidence in the reports of 
others. 


Forgetting and Reciprocal Influence 


If the previous factor of compromise weight- 
ing were all that was involved, the deteriora- 
tion in culture as graphed in Figures 2, 3, and 
4 might have been of a stair-step fashion, with 
each new inductee starting lower than his in- 
doctrinator, but once started, holding to his 
initial level through the several generations of 
his life in the group. As can be seen from the 
graphs, this is not the case. Instead, the 
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respondent is less loyal to the old culture the 
longer he stays in the group. Of the 36 naive 
respondents introduced in the first four genera- 
tions of X-2-J, X-3-2, and X-4-3, 31 made 
lower mean judgments on their last session 
than on their first (for which, p < .00001). 

In the present design, three factors are con- 
founded in generating this effect. While we 
have not extricated them, it seems well to 
mention them. The first factor is the autono- 
mous decline noted previously in the control 
groups. In terms of absolute magnitude, this 
would seem to account for little of the total 
effect. 

A second factor is forgetting. Were a re- 
spondent to judge with the leadership of a 
culture-bearing confederate, and then to judge 
in isolation for several sessions, he would in 
these later sessions be pooling his immediate 
visual impression of the stimulus with the 
remembered reports of the other in a previous 
session, As these memories dimmed, their 
weighting could be expected to diminish, and 
the relative weighting of own observation could 
be expected to increase. No doubt in the 
present situation forgetting of indoctrination 
provides some of the decline within the judg- 
ment series of a single respondent. 

A third possible source of this within-re- 
spondents decline in loyalty to the arbitrary 
culture is the effect of the new group member 
upon his elders. Since judging was always from 
eldest to youngest on any given stimulus 
presentation, this would have to involve an 
influence carrying over from one turn to the 
next. We feel sure that some such influence 
was present, but its demonstration would 
require a separate experiment placing con- 
federates in the final answering position. 


Discussion 


In presenting these materials to our col- 
leagues, we have found them most fascinated 
by the fact of the cultural transmission sur- 
viving total replacement of specific individuals, 
and by the fact of naive respondents becoming 
unwitting conspirators in perpetuating a 
cultural fraud. This demonstration was of 
course our main intent, and this outcome was 
inevitable if the conformity research upon 
which we built had any transfer validity. But 


these results are highly significant statis- 
ly, they fall short of our expectations. 
inculcated arbitrary norms turn out to be 
“eroded by innovation at a much more rapid 
than we had expected. We have hardly 
_ provided a laboratory paradigm for the ex- 
“amples of tenacious adherence to incredible 
"superstition with which anthropologists and 
students of comparative religion provide us. 

_ Of course, a relative weakness in strength is 

to be expected in a laboratory example. 

Mature individuals are undergoing transient 
indoctrination by unknown age-mates, as 
opposed to the natural setting of the teaching 
of the very young by tribal adults over a 
riod of years. Fewer confederates are in- 
volved than in the tribal situation, although 
the effect of number upon degree of conformity 
is unclear in the laboratory. More important, 
there are no sanctions rewarding conformity 
nor punishing innovation. Furthermore, it is 
even possible that the subject of judgment in 
our experiment is less ambiguous than the 
subject material of religious belief, for example. 
There is, after all, a fairly clear consensus 
among our naive solitary observers. In our 
situation, the spontaneous deviations from 
| orthodoxy were all in the same direction, 
= whereas for many arbitrary items of culture 
o the deviations may lack such concerted direc- 
tion. 

Nonetheless, the outcome may well warn us 
against the assumption that a purely arbitrary 
cultural norm could be perpetuated indefinitely 
without other sources of support. Even if 
people weigh the opinions of their elders many 
times that of their own direct observations, the 
collective effect of their own observations 
probably soon erodes a functionless arbitrary 
belief, Where we observed tenacious bizarre 
cultural belief we must look to more than mere 
tradition, suggestibility, or conformity, to ex- 
plain its retention. Latent functions (e.8. 
Merton, 1949) at the personal or societal level 
must be present to counteract the pressures 
from continuous spontaneous innovation ina 
more natural direction. For example, Moore 
(1957) and Aubert (1959) have recently sug- 
gested such latent functions to superstitious 
magical lotteries used in the selection of 
hunting and fishing sites. As our understanding 
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of the requisites of individual and social life 
further increases, we may expect to discover 
such latent functions for many if not all of 
those “meaningless” superstitions which stub- 
bornly persist. 


SUMMARY 


In an autokinetic judgment situation in 
which solitary judgments of movement aver- 
aged 3.8 inches, confederates were used to 
establish arbitrary “cultural norms” of 15.5 
inches. The transmission of this norm was 
studied as one at a time confederates and old 
members were removed from the group while 
new members were added. Significant rem- 
nants of the culture persisted for four or five 
generations beyond the last confederate. 
Gradually in each of the 12 experimental 
groups the arbitrary norm decayed and the 
group judgments drifted away from it back to 
the natural norm found in the control groups. 
The size of group had no clear effect upon the 
endurance of the norm established, although 
a group beginning with one confederate and 
two naive respondents was markedly weaker 
than the others and transmitted no culture be- 
yond one generation. 

Major effects are interpreted in terms of a 
judgment process in which each respondent 
weighs the reports of others and his own ob- 
servations, achieving a compromise between 
them. Even with the respondents giving the 
reports of others from 2.5 to 4. times the 
weight of their own observations, slight “in- 
novations” result which rapidly cumulate to 
erode the original arbitrary cultural norm. 
Details of the process and analogues to the 


cultural transmission of bizarre beliefs are dis- 
cussed. 
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number of responsible peeple have 
Az much time and effort in the 

last decade or so attempting to con- 
vey the notion that alcoholism is a disease. 
Similar attempts have been made over a 
longer period of time to educate the public 
about the nature of mental illness. In both 
cases, the benefit has been a shift in public 
opinion toward regarding alcoholism and 
mental illness as preventable and treatable, 
rather than simply deplorable. 

It is possible, however, that the effects of 
this indoctrination are not entirely salutary. 
Tf alcoholism really is a disease, then the direc- 
tion to be taken in theorizing and experiment- 
ing in this area would be to search for the 
unique set of events—cause, symptoms, 
course, and sequelae—which, when found, 
would define it. Most of the research in alco- 
holism has followed this line of thought. 
Whether the cause has been sought from 
genetic, biochemical, dietary, or psychiatric 
sources, no disease entity has been isolated, 
and about the only “facts” that have been 
established concern survey information, symp- 
tomatic relief, or fixed chronological stages of 
drinking behavior. Actually, in patient man- 
agement, the medical profession has frequently 
found it expedient to fall back upon the quasi- 
religious controls of the Alcoholics Anonymous 
Program as its major resource in later stages 
of treatment. 

An alternative to the conception of alco- 
holism as a disease entity is to consider prob- 
blem drinking as a consequence of a number of 
complex patterns of learned needs, attitudes, 

1 Formerly at the University of Houston. 


,* Employed part-time at the Houston Veterans Ad- 
ministration Hospital, where part of this study was 


« conducted. 


The investigators express their appreciation to N. 
J. Bradley, Superintendent, H. Davis, Chief Psycholo- 
gist, M. E. Brandes, Alcoholics Anonymous Counselor, 
and other counselors and staff members of the Willmar 
State Hospital, Willmar, Minnesota, for their coopera- 
tion and many helpful suggestions during this study. 
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habits, etc. acquired under varying causal 
conditions. This general assumption underlies 
this investigation: Alcoholism is not the name 
of a specific disease, the nature of which has 
yet to be fully discovered and understood; 
rather, it is a loose, inexact sociological or 
legalistic term used to cover a number of quite 
different cause-effect sequences. 

In a clinical paper written some 20 years 
ago, Knight (1937) proposed that a clear dis- 
tinction be made between two classes of prob- 
lem drinkers—“essential” and “reactive” alco- 
holics. Knight’s clinical description of the 
essential alcoholic is summarized as follows: 
He is emotionally dependent, egocentric, pleas- 
ure seeking, irresponsible, insincere, and un- 
reliable; he is unable to establish or to main- 
tain long term goals, or intimate, lasting rela- 
tions with others. Knight’s description of the 
reactive alcoholic is less specifically drawn; 
but the implication is that the reactive alco- 
holic, to one degree or another, possesses traits 
opposite to those listed for the essential group, 
and in addition is strongly compulsive. Knight 
assumes that these more generalized differ- 
ences in behavioral characteristics result from 
differences in psychosexual development in 
the early, formative years. But he also charac- 
terizes the differences between these two 
groups in terms of more concrete behavioral 
implications of the typological distinction. 
Thus, he refers to likely differences between 
the groups in terms of educational and occupa- 
tional experiences, kinds of drinking and eat- 
ing habits, time of onset of drinking, progno- 
sis, etc. 

Although Knight (1937) did not specifically 
extend his clinical distinction to the following 
additional points, the investigators judged 
that they were not inconsistent with his de- 
scriptive distinctions. It was inferred that the 
reactive alcoholic has assimilated cultural 
values (predominantly middle class) through 
the process of identification and shows a rea- 
sonable degree of the behavior controlling 
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effects of anxiety and guilt. Contrarily, it was 
assumed that the essential alcoholic shows a 
general culture-combativeness, both in his 
family background and in his own personal 
history, and is relatively free of the controlling 
effects of anxiety and guilt. 


GENERAL HYPOTHESES 


The purpose of this study was to attempt 
to provide a broader empirical base (or con- 
struct validity) for this clinically derived dis- 
tinction between essential and reactive alco- 
holics. With the descriptive and inferential 
material cited as a rough guide, the investiga- 
tors entered into the choice of instruments and 
the subsequent analysis of the data on the 
basis of some general hypotheses.’ 

Family background and marital relations. The 
essential alcoholic’s family (orientation and 
procreation) background is one of strife, con- 
flict, disloyalty, disunity, and disharmony 
within the family, and of nonconformity with 
society and its institutional controls, All of 
this is less true of the family background of 
the reactive alcoholics. 

General personality characteristics. The es- 
sential alcoholic’s general personality charac- 
teristics are those frequently subsumed under 
the terms “psychopathic” or “sociopathic” 
adjustment. This implies more specific traits 
such as “hedonism” (direct, immediate pleas- 
ure seeking), avoidance of social responsibility, 
and unmopitored expression of feeling or emo- 
tion. 

Defensive behavior. As contrasted with the 
essential group, reactive alcoholics employ a 
more pervasive defensive system. The reactive 
group’s defensive system is relatively complex, 
both in terms of range of separate defensive 
tactics for different areas of conflict or threat, 

and of a relatively greater tendency toward 
the more exacting tactic—intellectualization. 


METHOD 
Subjects 


Subjects were drawn from a group of male patients 
undergoing treatment for alcoholism in a large state 


* Two additional hypotheses set forth in this study 
concerned group differences in patterns of acquired 
needs and in traits presumably associated with differ- 
ences in experience during early psychosexual develop- 
ment. Because of the uncertain validities of the instru- 
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institution. A total of 34 patients served in prelimi 
studies to develop the criterion instrument to be d 
scribed below. Subjects in the main study were d 
from a random sample of 188 male patients to w 
the criterion instrument had been routinely admin 
tered. 

The selection requirements were as follows: Ai 
percentile score above the 43rd percentile and sc 
at least one sigma above or below the mean on th 
criterion measure (mean of 19.17, SD of 5.89) 
final group of patients meeting these requirem 
numbered 36; this was the W used for most of the d 
analysis. At a later time in the study, 22 more sub 
were added to facilitate the statistical analysis of dat: 
from one instrument—a life history question 
These additional subjects were selected from the o1 
inal larger group of 188 by resetting the cutting 
on the criterion measure to one-half a sigma above 
below the mean. : 

The correlation values (V = 188) between scores 
the criterion instrument and age and education did 
differ significantly from zero. In the smaller sai 
(N = 36), there were no significant differences betw 
the essential (V = 20) and reactive alcoholic (N 
groups on the variables of age, education, intel 
or socioeconomic status of the family of orientati 
However, in the enlarged sample (W = 58), the reacti 
alcoholic group (N = 27), was significantly (p = 0 
higher than the essential alcoholic group (N = 31) 
educational level, h 


Criterion Instrument 


Knight’s (1937) description of the specific beha: o 

and situational distinctions between the two types 

used to construct an interview form of questionnaire 
select the experimental groups. Questions and 
criteria were developed gradually in two preli 
studies. In final form, the structured inquiry con 

of 14 filler items and 55 scorable items concerning 
or present experiences and behavioral tendenci 
eight areas (e.g., age at onset of drinking, drinki 
habits, types of friendship relations, history of gastri 
intestinal symptomatology).‘ For each subject, a 

terion “score” was obtained by summing the numb 
of interview items for which the response was jut 
to reflect istics of essential alcoholism as 
scribed by Knight. 


ments used to test these hypotheses, the comp! 
eon results that were obtained in this study i 
learly interpreted. Consequently, this part 
study will not be covered in this report. s 
‘The following supplementary materials have b 
deposited with the American Documentation Institut 
a 69-item questionnaire for the selection of esseni 
and reactive alcoholic patients, and a manual for scor 
defensiveness with sentence completion data. O 
1 t No. 6768 from ADI Auxiliary Publicati 
Project, Photoduplication Service, Library of Cong 
Washington 25, D. C., remitting in advance $1.75 
microfilm or $2.50 for photocopies. Make checks 
able to: Chief, Photoduplication Service, Library of 
Congress, ý 
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About a year after the original criterion instrument 
data were obtained, the questionnaire form was mailed 
to a stratified subsample of the original 188 subjects 
with the request to respond again to each of the ques- 
tions. Nineteen of these 69 subjects retumed the com- 
pleted form; these records were rescored blindly. A 
rough check on the consistency of the questionnaire 
data was made by correlating the respective pairs of 
scores for these subjects; the obtained product-moment 
value was .66 (p = .01). Considering the rather marked 
differences in the response situations (i.e, responses 
obtained by interview vs. by mail), this value is re- 
garded as indicative of a satisfactory degree of stability 
in the measurement of the criterion variable. 


Other Instruments and Techniques 


The Rorschach test. Rorschach records were judged 
for type of defensive behavior according to a system 
proposed by Schafer (1954) and developed as a research 
technique by Weiss (1956) and Selzer (1956). In this 
system, the subject’s use of form, the content of his 
responses, and his behavior and attitudes expressed 
during the test situation, are analyzed to distinguish 
the most pervasive defensive style among four major 
modes of defensive operations—repression, intellec- 
tualization, projection, and reaction formation.’ The 
relatively high interjudge agreement concerning these 
ratings of defenses from Rorschach data has been pre- 
viously demonstrated (Eskenazi, 1957; Weiss, 1956). 
In the present study, one of the raters® with previous 
experience made the judgments of predominant de- 
fensive style. 

The sentence completion task. Constructed by Wiener, 
Carpenter, and Carpenter (1956) to distinguish between 
“repressive” and “sensitizing” defenses as evidenced 
in verbal behavior, a sentence completion task was used, 
consisting of 20 items, 5 of which are fillers. The re- 
maining 15 items are designed to elicit five responses 
in each of three potential conflict areas—sex, hostility, 
and self-adequacy. 

_ For the purposes of this study, the present inves- 
tigators developed a new system of criteria for cate- 
gorizing defensiveness. In this system, responses to the 
sentence stems were classified under two main head- 
ings, “defense” and “no defense.” The first major class, 
defense, subsumes three separate subclasses—intellec- 
tualization, repression, and projection.” Under the 


®The type of defensive behavior termed “reaction 
formation” here has been called “denial” in the pre- 
vious studies cited. To the present investigators, the 
term “reaction formation” more accurately conveys 
behavioral characteristics associated with this form of 
defensive maneuver. 

8 The investigators are grateful to Bertram A. Weiss 
for carrying out these judgments. 

7 In the opinion of the investigators, the pervasive- 
ness of the defense of reaction formation cannot be 
measured by summing separate responses in a sentence 
completion task. An essential characteristic of a reaction 
formation defensive system is a relatively high degree 
of consistency across a broad range of behavioral trans- 
actions. This does not seem to be necessarily as true for 
other forms of defensiveness. 
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second major class, no defense, are two subclasses— 
conflict and no conflict, Thus, there are, in all, five 
separate scoring categories; for each of these, a number 
of scoring criteria were developed and built into a 
scoring manual.® 

In the development of this system of criteria, all of 
the responses for each item were assembled, and the 
scoring criteria for each item were established sepa- 
rately, with information identifying the subjects masked 
out. This procedure was carried out item by item until 
all responses had been surveyed. Each response, again 
item by item, was then reconsidered and formally 
categorized as meeting one of the developed criteria. 
About 2 months after the formal scoring was completed, 
a total of 144 responses was rescored. Of these, 134 (or 
93%) were judged as meeting the same criterion applied 
to them in the initial scoring. 

The Minnesola-Briggs History Questionnaire. A his- 
tory questionnaire developed by Briggs (1955) was 
used to obtain routine case history material. Briggs 
estimated that this procedure provides about three 
times as much reliable information as the usual case 
history interview. Among the areas sampled are family 
background and relations, home atmosphere, educa- 
tional and occupational history, dating and sexual 
history, trouble with authority, and various personality 
characteristics. Briggs used eight clinical judges to 
cluster these items into a number of diagnostic, 
havioral, and experiential categories, Of these, the 
following nine clusters were chosen for use here: con- 
flict with parents, unstable-unrewarding home, broken 
parental home, sibling conflict, psychopathic adjust- 
ment, achievement attitudes, educational accomplish- 
ment, occupational accomplishment, and parental 
socioeconomic status. The last category was used as a 

tial control variable. 

A number of relatively minor alterations of the ques- 
tionnaire were introduced by the present investigators 
to construct a research instrument meeting the require- 
ments of this study (e.g., revising the items for direct 
response by the patient). In final form, =the revised 
questionnaire contained 171 items, with the number of 
alternative responses ranging from three to eight. Five 
additional item clusters were developed to form the 
following categories: patient’s socioeconomic status, 
marital difficulty, marital irresponsibility, patient’s un- 
monitored expression of feeling-emotion, and patient’s 
hedonistic orientation. ( 

In the development of a scoring system to yield a 
total score for each subject for each cluster, separate 
item-alternative values were checked (and, if necessary, 
altered) in terms of expected directionality, and then 

to standard values on & zero-to-nine point 
scale to equalize the effect of varying numbers of al- 
ternatives among the items. For each subject, standard 
values for each item within a cluster were then summed 
to obtain the total score for each cluster. 


RESULTS 


Defensive Behavior 
Rorschach data. The distribution of judg- 
ments of predominant defensive style based 


8 See Footnote 4. 
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TABLE 1 
DISTRIBUTION OF JUDGMENTS OF PREDOMINANT 
DEFENSIVE SysTEM BASED UPON AN ANALYSIS 
or RORSCHACH PROTOCOLS 


Mechanism of Defense 


a 
§ 
Grup |x| 38 | 3| 8 | £ | Total 
S8 | $E 8 3 
MES ea) |) os 
S gm S 2 
4 4 4 À 
Essential Alco- | 20 | — — 6 14 20 
holics 
Reactive Alco- | 16 1 6 $ 8 16 
holics 


upon an analysis of Rorschach protocols is 
shown in Table 1. Group differences in the 
relative strength of each of the separate defen- 
sive tactics cannot be tested at present because 
a satisfactory system of estimating these 
scalar values for each defense has not yet 
been developed for the analysis of Rorschach 
protocols. Although an over-all difference be- 
tween the groups in distribution of predomi- 
nant defensive style seems apparent, it is not 
possible to assess the appropriate level of confi- 
dence for this over-all difference because of 
low expected frequencies in the distribution. 
The significance of the group difference in 
choice of reaction formation as the predomi- 
nant defense can be estimated (p = .004, 
Fisher’s exact tables) and, of course, if the 
cell frequenceis for intellectualization and reac- 
tion formation were combined as representa- 
tive of more complex forms of defensiveness, 
the difference would be slightly greater. The 
higher incidence of repression for the essential 
alcoholics does not reach significance (p = 
.115, Fisher’s exact tables). 

Sentence completion data. During the devel- 
opment of scoring criteria for defensive tactics 
shown in this task, one item of the five in the 
conflict area of self-adequacy was omitted as 
not fitting into the scoring schema. Responses 
to each of the remaining 14 items for each sub- 
ject were classified as fitting one of the five 
categories of defensive behavior described 
above. The results of an analysis of group 
differences for these data by means of the ¢ 
test are shown in Table 2. 

The results clearly indicate that the reactive 
alcoholics are significantly (.001) more defen- 
sive than the essential alcoholics on total de- 
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fensiveness as measured by the summation 

the defense totals of intellectualization, repres- 
sion, and projection. The reactive alcoholics 
were higher, on each of the separate defense 
totals, but only significantly (.01) so for in- 
tellectualization. Under “no defense,” the es- 
sential alcoholics were significantly higher than 
the reactive alcoholics on both “total no con- 
flict” (.05), and “total conflict” (.001). Possi- 
bly, it should be observed that these tests of 
significance are not completely independent 
of each other since the “defense” and “no de- 
fense” total scores are reciprocally related. 

Essentially the same results were obtained 
when responses for each of the three conflic 
areas were analyzed separately. 

The question of relative simplicity-complex 
ity of defensive systems, in terms of differences 
in range of available tactics between the two 
groups, was approached from two angles. 
First, within each of the two groups, 
the separate defense scores (intellectual 
ization, repression, and projection) were 
correlated against each other. Only one of the 
six coefficients was significant; this was be- 
tween intellectualizing and repressing within 
the essential alcoholic group (r = —.58; p = 
01). That is, within this group, the over-all 
tendency to use intellectualizing as a defense 
is negatively related to the over-all use of re- 
pression. 

The second form of analysis was carried out 
by deriving a score for each subject based upon 
the proportion of the score for the most fre- 
quently used defense to the summed score for 
total defensiveness. Group differences in th 
distribution of these proportions were then 
evaluated by computing a chi square. This 
value was 5.67 (p = .01), the essential alco- 
holic group showing the greater frequency of 
high proportions. There was some likelihood i 
of a confounding factor in this result because 
the essential group had lower over-all fre- 
quencies in defensive responses and, thus, 
might be expected to show higher proportions: 
This possibility was investigated by computing © 
a rank order correlation for all subjects be- 
tween the proportion value discussed above 
and the summed score for total defensiveness: 


Thus, from two sources, there is the sugges- 


ee 
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TABLE 2 


COMPARISON OF DEFENSIVE OPERATIONS BETWEEN THE ESSENTIAL AND REACTIVE 
ALCOHOLICS ON THE SENTENCE COMPLETION TASK 


e Essential Group Reactive Group 
Scoring Category GRN (Ra t ’ 
M SD M SD 
Defense 
Intellectualization Total e 3.00 1.71 4.62 1.58 2.89 01 
Repression Total 2.85 1.80 3.94 1.85 1.72 — 
Projection Total 1.60 1.46 1.94 1.08 18 zs 
Defense Total 745) 2.27 10.50 1.70 4.48 001 
No Defense 
No Conflict Total 3.00 1.30 2.00 1.17 2.35 05 
Conflict Total 3:55 1.83 1.50 94 4.24 001 
No Defense Total 6.55 2.27 3.50 1.70 4.48 001 
TABLE 3 


SIGNIFICANCE OF DIFFERENCES BETWEEN EssENTIAL AND REACTIVE ALCOHOLICS ON ITEM 


CLUSTER CONVERTED 


SCORES FROM THE MoınnesoTA-BRIGGS HISTORY QUESTIONNAIRE 


i Chis 
Title of Cluster Ttems FEEN et quare ? 
Family of Orientation 
Conflict with parents 22 83.80 77.21 6.08 .02 
Unstable-unrewarding home 18 58.88 51.41 9.43 „01 
Broken parental home 9 30.29 28.72 .39 — 
a Sibling conflict 6 24.53 21.55 4,52 +05 
‘amily of Procreation* 
Marital difficulty 22 64.13 57.90 14.70 001 
Marital irresponsibility 4 31.68 26.09 7.04 01 
K EE level 3 14.12 17.12 8.71 01 
ersonality Characteristics 
Unmonitored feeling-emotion 14 64.23 58.76 5.81 02 
Hedonism 13 57.44 49.15 8.98 .01 
Psychopathic adjustment 23 87.79 73.94 15.59 001 
Unfavorable achievement attitudes 17 71.93 62.47 15.59 001 
Educational accomplishment 9 42.16 47,22 6.42 02 
Occupational accomplishment 5 25.84 27.84 4.97 .05 
e 


* For items concerning family of procreation, N = 27 in each group. 


tion that the essential alcoholic group employs 
a relatively simple, generalized defensiveness, 
and the reactive alcoholic group a relatively 
complex, more specialized system of defenses. 


History Questionnaire Data 


Data concerning the subject’s family back- 
ground, marital relations, and general person- 
ality characteristics were obtained from the 13 
item clusters of the history questionnaire. The 
results of the analysis of these data by chi 
squares are shown in Table 3. With all 13 
clusters, the group differences are in the ex- 
pected direction, and in 12 of these compari- 
sons, the differences are significant at $ levels 
of .05 or beyond. These results are not com- 
pletely independent because there is a certain 
amount of item overlap among some of the 


clusters; however, this overlap is not exten- 
sive. 


Discussion 


Although previous experimental and clinical 
findings provided no basis for specifically an- 
ticipating a higher incidence of the defense of 
reaction formation among the reactive alco- 
holics, this finding is not too surprising, in 
view of the relative complexity of this defen- 
sive technique. 

One finding involving the Rorschach data, 
however, was clearly not anticipated. This was 
the lack of comparability in the results of the 
Rorschach and sentence completion data 
analyses of defensive behavior. There is evi- 


dence of reasonable interjudge agreement for 
each of the two general systems of judgment 
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of defensive behavior. Further, each of these 
two systems has led to successful predictions 
of behavior in other studies. Thus, some net- 
work of meaning exists for the concepts of 
defensiveness tied into each set of operations. 
Assuming that the two techniques for inferring 
kinds of defensive behavior are reasonably 
reliable, one is led to the conclusion that the 
Rorschach and sentence completion situations 
represent actually different sets of response 
requirements. 

The most obvious difference in the two 
testing situations concerns the degree of struc- 
ture. It follows that differences in degree of 
structure in the testing situations would be 
associated with differences in degrees of aware- 
ness in the subjects of the implications of their 
test behavior to the examiner in the two situa- 
tions. One of the clearest differences in the 
Rorschach and sentence completion records 
in this study was that of a high incidence of 
judgments of projection on the Rorschach 
and a relatively low frequency of responses 
reflecting this defense in the completion task. 
Technically, it was quite possible for the sub- 
jects to complete each of the items in the 
latter task by employing a projective defense. 
The fact is, they usually did not. Similarly, 
it was quite possible to reply to the sex-con- 
flict items by openly acknowledging a conflict 
in this area. Again, this type of response sim- 
ply was not given. 

Under srdinary social conditions, the use of 
projection as a defense is a luxury few can 
afford. Its cost, usually, is social punishment 
and a consequent loss of “face” if detected, 
or some degree of loss of reality, if not. It is 
one of the earliest temptations for a young 
child and, frequently, one of the first response 
tendencies to evoke strong negative sanction. 
In Cameron and Magaret’s (1951) terms, pa- 
rental disapproval of the child for falsely im- 
puting blame to others would frequently be 
one of the earliest conditions for the develop- 
ment in the child of “muting of communica- 
tive speech” and, later, “behavioral duplicity.” 

The instructions in the completion task are 

“to express your real feelings.” It appears 
likely that, to varying degrees, subjects rein- 
terpret this instruction, when threatened, to 
the personal meaning of “suppress your real 
feelings, if you can do so and still appear to 
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‘ 
comply.” Thus, the avoidance of too-obvious 
projection, the avoidance of acknowledgment 
of sexual conflict, and the apparent but not 
actual compliance with test instructions to | 
express “real feelings” are all tactics of “be- | 
havioral duplicity” that are relatively “suc- 
cessful.” They are segments of well-learned 
behavior, sequences which permit the individ- 
ual to “play his roles”—to escape threat while 
still maintaining self-integrity in the eyes of 
the observer. 

But, as has been frequently observed, the 
“rules of the game,” or the cultural expect- 
ancies, are much less clear in the less struc- 
tured Rorschach situation. If this interpersonal 
situation is sufficiently threatening to the 
individual, he has fewer guide lines to lead 
him toward the safer defensive tactics and he 
may, in desperation, resort to the more dan- 
gerous projection—to “attack” or impute 
blame to others, or to acknowledge openly his 
fear of “attack” from others. 

One of the tentative findings of this study — 
was that the specificity-generality of a defen- 
sive system may be a function of the com- 
plexity-simplicity of organization of the in- 
dividual personality. Here, the possibility is 
also suggested that choice of defense may 
vary with the individual’s degree of awareness 
of the implications of his behavior. There may 
be other determinants of defensive choice 
(e.g., degree of threat, amount of skill and 
experience the specific defensive technique” 
might require to be successfully used, or type. 
of conflict) that could be introduced systemati- 
cally in future research. 

Most of the discussion has dealt with results 
obtained in this study that were essentially 
unanticipated. A good proportion of the re- 
sults were anticipated in terms of Knight’s 
(1937) clinical observations concerning the 
distinctiveness of essential and reactive alco- 
holism. The classes of expectancies that were, 1 
good part, confirmed concerned group differ- 
ences in defensive behavior, developmen 
experiences, and a number of aspects of person- 
ality organization. Thus, Knight’s distinction 
between essential afd reactive alcoholics aP- | 
Pears to be an essentially valid and useful on 
which can serve to reduce variance in future” 
clinical research in much the same way that 
such other clinical distinctions as process-rea* 
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tive, good and poor prognosis in schizophrenia, 
and satellizing-nonsatellizing in personality 
development have already done. 


SUMMARY 


On the basis of clinical experience, Knight 
(1937) has proposed a distinction between two 
classes of alcoholism—‘essential” and “reac- 
tive.” The purpose of this study was to at- 
tempt to provide a broader empirical base for 
this distinction in terms of differences in de- 
velopmental experiences, defensiveness, and 
personality organization. 

A pretested selector questionnaire developed 
to establish the criterion groups contains 55 
scorable items in eight behavioral areas em- 
phasized by Knight (e.g., drinking habits, 
types of personal and institutional depend- 
ency, history of gastrointestinal symptomatol- 
ogy). Subjects were males hospitalized for 
alcoholism. There were no significant group 
differences in age, intelligence, or parental 
socioeconomic status. 

As expected, the reactive alcoholics showed 
a higher degree of over-all defensiveness, their 
defense system was more complex (less gen- 
eralized), and they placed greater reliance 
upon the more exacting tactic—intellectualiza- 
tion. Also as expected, the essential alcoholics 
“scored” significantly higher on psychopathic 
adjustment, conflict with parents, unstable- 
unrewarding home, sibling conflict, hedonism, 
unmonitored feeling-emotion, marital diffi- 
culty, and marital irresponsibility. The reac- 
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tive alcoholics “scored” significantly higher 
on over-all achievement, educational accom- 
plishment, occupational accomplishment, and 
socioeconomic level of the conjugal family. 
Knight’s distinction between these classes 
of alcoholism appears to be an essentially 
useful and valid one that can serve to reduce 
variance in future research with alcoholics. 
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f HE effects of anxiety on performance 
have been subjected to fairly intensive 
experimental study in past years (Child, 

1954). A good deal of this research has shed 
light on the type of performance that is dis- 
rupted by anxiety. For instance, it has been 
shown that anxiety interferes with the accom- 
plishment of complex tasks but facilitates the 
performance of simple tasks (Farber & Spence, 
1953). Considerably less attention has been 
given to a theoretical analysis of anxiety it- 
self, the conditions of its arousal, and its de- 
fining properties. Experimenters do not always 
agree with each other or with theoreticians 
and clinicians as to the exact meaning of anx- 
iety as a conceptual term. The first part of 
the present paper is a review of the psycho- 
analytic theory of anxiety. This framework is 
selected because many authors have operated 
within it. Following the presentation of the 
theoretical outline, some shortcomings of con- 
ventional experimental approaches to anxiety 
will be pointed out. Finally, an experiment 
will be reported, the procedures of which ap- 
proximate the theoretical specifications more 
closely. 

The first of the two conventional procedures 
to be discussed segregates the subjects with 
respect to anxiety level on the basis of a ques- 
tionnaire to which the subject responds with 
an introspective report of his characteristic 


1 An abbreviated version of this paper was prepared 
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anxiety level and stress experiences (Mon- 
tdgue, 1953; Taylor & Spence, 1952). The 
second procedure produces a stress in the sub- 
ject through fictitious reports of his inadequate 
performance on tasks which are presented as 
tests of intelligence or as other measures of 
adequacy (Mandler & Sarason, 1952; Sarason, 
Mandler, & Craighill, 1952). Sometimes these 
two procedures are employed jointly. They 
are subject to limitations which will become 
clear after the theoretical outline is presented. 

The psychoanalytic theory of anxiety 
(Freud, 1936) begins with the observation 
that for some people the intensification of a 
particular drive constitutes a psychic danger. 
The nature and origin of this danger are to be 
sought in the individual histories of such per- 
sons. Unconsciously fantasied physical dam- 
age, helplessness, or loss of love are frequently 
encountered examples of such danger. In an- 
ticipation of this danger, such people charac- 
teristically distort the recognition and expres- 
sion of the drive in question by means of de- 
fense mechanisms. These mechanisms avert 
the expected danger to the extent that they 
block or distort the drive successfully. Anxiety 
is defined as the psychological mechanism 
whereby the current intensification of a dan- 
gerous drive results in the elicitation of de- 
fenses. The term “signal anxiety” refers to 
this function specifically. Anxiety is therefore 
a theoretical construct which is anchored on 
the antecedent side to the intensification of a 
dangerous drive and on tlre consequent side 
to the rise of defensive behavior? It should be 
noted that this strict definition of anxiety 

* Anxiety as a mechanism should not be confused 
with anxiety as a character trait. To say “John is 
anxious about aggression,” does not necessarily mean 
that John is aggressively stimulated at the present 
time and is counteracting this stimulation with defenses. 
Rather, it means that John would become defensive if 
he were aggressively stimulated. This is another way of 
saying that aggression is a dangerous drive for John oF 
that John has conflicts about aggression. 
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makes no phenomenological references to 
feelings of distress, heart palpitations, etc. 

Defenses are therefore a central concept in 
the definition of anxiety. They afe recognized 
when the individual behaves in ways which 
interfere with the experience or expression of 
his needs. For example, when it is observed 
that a person responds to extreme aggressive 
provocation with inexhaustible forbearance 
and forgiveness for the offender, it may ‘be 
said that he is defending himself against ag- 
gression. In clinical practice and experimental 
studies (Eriksen, 1954) it has been found that 
people who are defensive characteristically 
distort their perception of stimuli which are 
associated with the arousal of dangerous 
drives. Moreover, it is found that the greater 
the danger, the greater is the distortion 
(Schafer, 1954). 

It is well known that anxiety is in certain 
instances further characterized by conscious 
experiences of distress and by physiological 
stress symptoms. However, these additional 
conditions are of secondary importance in the 
definition of anxiety and may be completely 
lacking. Thus the term anxiety may be applied 
regardless of whether subjective experiential 
factors and physiological stress symptoms 
play a minimal role or assume spectacular im- 
portance, as they do, for instance, in anxiety 
attacks. Only its traceable origin in past ex- 
perience, which has rendered the drive dan- 
gerous, and the automatic elicitation of de- 
fenses are central to the definition of anxiety. 
In fact, the concomitance of anxiety, whatever 
its experiential salience, and of defensiveness 
is so constant that it is difficult to distinguish 
conceptually or experimentally between the 
behavioral consequences of the one and the 
other. Under what conditions do the secondary 
experiential and physiological factors appear 
at all? The theory*provides an explanation in 
terms of the effectiveness of the defense. If 
the defense mechanism does not sufficiently 
block or distort the dangerous drive, and the 
drive grows in intensity, then the experiential 
and physiological aspects of anxiety will 
gradually become more conspicuous. In other 
words, in the case of a person with ineffective 
defenses, these aspects will be added to signal 
anxiety when the drive is intensified. They 
subside as soon as the drive is sufficiently dis- 
torted. 
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It is an important part of the theory that 
anxiety operates automatically in many cases. 
That is to say, the elicitation of defenses does. 
not depend on the experiential and physiologi- 
cal aspects of anxiety. This is especially char- 
acteristic of functioning normal adults. In 
fact, people who are frequently subject to 
anxiety experiences are often in need of treat- 
ment. Anxiety is not an ever-present condition 
or mood in well functioning individuals. Even 
in patients, the intensity and effects of anxiety 
and defensiveness vary with the intensity of 
the drive that gives rise to them. Only an 
acute intensification of dangerous drives will 
result in anxiety—experiential or otherwise. 
For instance, a person for whom aggressive 
stimulation is dangerous will try not to expose 
himself to social situations in which he may 
be aggressively stimulated. Thus anxiety and 
defense arousing situations may only rarely 
occur in his life. 

The view outlined here reserves the con- 
cepts of “anxiety” and “defense” for situa- 
tions in which the danger has an internal 
source, specifically a drive, and the term 
“fear” for those in which the threat originates 
in the external world. Although borderline 
instances and cases of overlapping may be 
cited, the paradigm itself is clear in this re- 
spect. 

In the light of this theory, both the ques- 
tionnaire and the stress procedures mentioned 
initially have serious shortcomings. ‘As for the 
questionnaire, its outstanding limitation is its. 
reliance on the subject’s introspective report 
of anxiety. By theory, subjects who rate them- 
selves high on anxiety by reporting experiences 
of psychic distress and physiological stress are 
not only anxious in the primary sense of the 
word—they not only become defensive in the 
presence of the drive—but in addition, they 
utilize ineffective defenses which do not dis- 
tort the drive sufficiently. Furthermore, they 
may not be successful in avoiding situations 
in which they are likely to be dangerously 
stimulated. While these individuals represent 
a challenging clinical problem, it is incorrect 
to designate others, who do not report experi- 
ences of anxiety, as not anxious in the strict 
sense. The only admissible distinction between 
subjects who report and subjects who do not 
report anxiety experiences is on the basis of 
the effectiveness of their defenses, not on the 
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basis of whether or not defenses are being 
elicited. Subjects who do not report anxiety 
experiences are either not anxious about the 
drive, or they avoid anxiety arousing situa- 
tions; or, if they do not avoid them, then 
anxiety never becomes more than a signal be- 
cause their defenses are effective. It should be 
noted that people who resolve underlying con- 
flict with relatively flexible defenses are not 
detectable by means of a questionnaire. 

As for the stress procedure, it seems to ap- 
proximate the specifications of fear arousal 
more than those of anxiety arousal. True, the 
prospects of failing an intelligence test or of 
showing up poorly on a measure of mascu- 
linity are dangerous both internally and ex- 
ternally; such failures would involve a com- 
promise in the eyes of others as well as a drop 
in self-esteem. Yet it seems preferable to in- 
vestigate these effects separately rather than 
under confounded conditions. If it were possi- 
ble to bring about the intensification of a dan- 
gerous drive without simultaneously produc- 
ing an immediate external threat, the condi- 
tions of anxiety arousal specified in the theory 
would seem to be most nearly met. 

In view of these considerations, the follow- 
ing method was proposed. Aggression was 
selected as a drive about which people are 
anxious to varying degrees.’ First, the degree 
of anxiety about aggression was to be deter- 
mined for each subject by noting his extremity 
of defensive distortion in response to an ag- 
gressive assessment stimulus. Then, inde- 
pendently of this determination, subjects were 
to be exposed to an acute aggressive provoca- 
tion. It was assumed that anxiety and defen- 
siveness would automatically be aroused again, 
just as they were in response to the assess- 
ment stimulus, and that the effects of anxiety 
and defensiveness on behavior could then be 
measured. In other words, the subject’s own 
“built-in” predisposition to become anxious 
as a result of drive arousal was to be utilized 
rather than a fictitious external threat. Nor 
was any use to be made of the subjects’ aware- 

ness of their anxiety. 

An anagram solving task was selected as a 

measure of the effects of anxiety on perform- 


3 The choice of aggression is not crucial to the theo- 
retical formulation of this paper. Anxiety about de- 
pendency or sexuality could have been utilized as easily. 
This will be discussed later on. 
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ance. Findings indicate that the effect of 
anxiety upon performance varies with the 
complexity of the task (Farber & Spence, 
1953). A task is considered complex if the 
correct response is either weak in the initial 
response hierarchy or is in competition with 
other equally strong responses, whereas a 
simple tazk, like that in classical conditioning, 
is one in which the correct response is strong 
in’the hierarchy. In the former tasks, anxiety 
has been found to have an impairing but in the 
latter a facilitating effect on performance. 
Anagram solution is a complex task which is 
known to be vulnerable to the effects of anx- 
iety as measured by questionnaire (Maltzman, 
Fox, & Morrisett, 1953). 

The prediction tested by the present experi- 
ment is the following: In the case of subjects 
who are observed to become defensive in reac- 
tion to an aggressive assessment stimulus, an 
experimental intervention designed to arouse 
aggression results in impaired performance on 
a complex problem solving task. Subjects who 
do not become defensive in response to the 
assessment stimulus are not so affected by the 
experimental intervention. 


METHOD 


The experimental subjects were 62 high school 
students who were tested individually during two 
separate sessions. During the first session, a Story 
Completion test, an Anagrams test, and an abbreviated 
Intelligence test were administered (Rosenwald, 1958). 
Thirty-nine of the subjects constituted the experimental 
group. At the beginning of their second session, they 
were subjected to an experimental intervention designed 
to arouse aggression, For the remaining 23 control 
subjects, this intervention was omitted. Following this, 
alternate forms of the three tests were administered to 
all subjects. Differences between the forms of the tests 
were controlled for by the use of a counterbalanced 
design. 

The Story Completion test consisted of five incom- 
plete stories which the subject was asked to complete. 
Each contained an aggressive provocation of the main 
figure. This thematic test was selected as the assess- 
ment stimulus because projective devices are traditional 
for detecting anxiety and defensiveness in the sense 
defined, and because responses to such tests are or- 
dinarily less subject to influence by socially conditioned 
attitudes. Such tests may identify individuals who 
characteristically respond to increases in drive in @ 
defensive manner but are not necessarily aware of theit 
anxiety or of the entailed defensive drive distortion. 
Because projective test responses reflect anxiety an 
defensiveness concerning which the subject has little of 
no control or awareness, they seem particularly suited 
to the present experimental paradigm. 
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The Anagrams test consisted of 10 scrambled four- 
and five-letter words which the subject was asked to 
convert into meaningful words by rearranging the 
letters in his mind. Half the anagram solutions were 
“loaded”—that is, they had aggression related conno- 
tations; the other half was neutral. According to the 
conception of the present study and the findings of 
past research (Maltzman et al., 1953), the anagrams 
task should be subject to the effects of anxiety regard- 
less of the content of the anagrams, The*rationale of 
this prediction is entirely in terms of task complexity, 
and in this respect it differs from studies of perceptual 
defense. “Loaded” stimuli were included out of interest 
in the effects which content may perhaps have beyond 
those postulated. Just as incomplete stories with ag- 
gressive themes are more likely to point up a subject’s 
anxiety about aggression than stories with other themes, 
so the aggressive content of the anagrams may add to 
the anxiety which a subject already experiences as a 
result of the experimental drive arousal. 

The Information subtest of the Wechsler Adult 
Intelligence Scale was administered as a measure of 
intelligence (Wechsler, 1955). 

The experimental intervention consisted of two 
parts. First, the subject was accused of and reprimanded 
for having discussed the experiment with his friends 
after having been asked at the end of the first session 
to refrain from doing so, He was told that this was 
childish, disobedient, and irresponsible. Second, he was 
asked to take a mirror tracing test as part of the study. 
His performance in this was characterized as inade- 
quate, and the experimenter demanded that the subject 
return after school to continue practicing mirror trac- 
ing. The intervention was delivered in a sarcastic, 
belittling manner, and no replies were tolerated. (Mirror 
tracing, a measure external to the experiment proper, 
was chosen for the intervention because of possible 
changes in task set which might have affected perform- 
ance on the Anagrams test had if been used for this 
purpose.) All of the subject’s verbalizations at this 
time were recorded by the experimenter. The demand 
for after school practice was retracted at the end of the 
experiment, and the overall purpose of the study was 
explained. 


RESULTS 


The five story completions obtained in the 
first session were used to obtain an index of 
defensiveness. Each completion was ranked 
with respect to defensiveness in relation to all 
others obtained in response to the same in- 
complete story. A completion was judged to 
be defensive in proportion to the extent 
to which aggression was mot expressed, the 
extent to which the main figure of the story 
was not seen as active in bringing about a 
favorable solution to the presented conflict, 
the extent to which he was mot seen as capable 
of acknowledging the discrepancy of interest 
between himself and the instigator of the ag- 
gressive provocation, and the extent to which 
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TABLE 1 
Exzenpep Mep1an Test (Siegel, 1956) AnD Witson’s 
Distrrsution Test (Wilson, 1956) FOR THE 
DIFFERENCE BETWEEN SOLUTION TIMES ON 
THE First AND SECOND ADMINISTRATION OF 
THE ANAGRAMS TEST 
(High scores indicate relative impairment) 


Experimental Control 
Rae regi ake = 
Daneive Defensive pendii peony 
Above 
Overall Median 14 4 6 7 
Below 5 16 5 5 
x? Total 11.90 ay $ <.001 
x? Defensiveness 6.46 1 p <.02 
x? Treatment 0.62 idf = 
x? Defensiveness 4.82 idf $ <.05 
X Treatment 


Note.—A nonparametric test was employed to permit the pool- 
ing of difference scores obtained from two replications differing in 
the order of administration of parallel test forms and therefore 
not comparable in absolute values of solution time. 


he was not seen as blaming the instigator but 
rather as blaming himself. No attempt was 
made to categorize the completions according 
to specific defense mechanisms exemplified by 
them. Rather, the degree of defensiveness was 
determined by the magnitude of the distor- 
tion. By using an objective scoring manual 
(Rosenwald, 1958), an alternate judge arrived 
at reliably similar rankings, with rho coeff- 
cients ranging from .71 to .92. The ranks ob- 
tained for each of the subject’s five stories 
were summed, yielding his dé&fensiveness 
score. The subjects were then divided at the 
median into a high defensive and a low defen- 
sive group. } 

In comparing the times required for solu- 
tion on the first administration of the Ana- 
grams test, it was found that high defensive 
and low defensive subjects performed com- 
parably. 

An analysis of performance changes from 
the first to the second administration of the 
Anagrams test is shown in Table 1. The break- 
down of the chi square into the components 
attributable to Defensiveness, Treatment, and 
their interaction was undertaken, using Wil- 
son’s (1956) test. It was concluded that the per- 
formance changes of the high defensive and low 
defensive groups were significantly different, the 
former showing impairment and the latter im- 
provement of performance when compared to 
the control subjects. Furthermore, these 
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changes could be attributed to the effects of the 
experimental intervention since the interaction 
of Defensiveness and Treatment was found to 
be significant. 

In addition, there is suggestive but statisti- 
cally unreliable evidence that high defensive 
subjects required longer solution times for 
“loaded” than for neutral anagrams on the 
first administration of the test and that this 
was not so for the low defensive subjects. 
However, the reported effects of the experi- 
mental intervention were not limited to the 
loaded items. If anything, neutral items 
seemed to be affected somewhat more noticea- 
bly by the intervention than were loaded ones. 

The high defensive and low defensive sub- 
jects did not differ in their performance on the 
intelligence scale either before or after the in- 
tervention. 


Discussion 


The above findings may be briefly summa- 
rized as follows: High defensive and low de- 
fensive subjects do not at the outset differ 
from each other in regard to their performance 
on the Anagrams test; they are differentially 
affected by the experimental intervention, 
with the high defensive subjects showing im- 
pairment and the low defensive subjects im- 
provement as measured by solution times, 
and the content of the items does not seem to 
influence their susceptibility to the facilitating 
or impairing effects of the experimental inter- 
vention. 

In connection with the first of these three 
findings, it should be noted that investigators 
assessing anxiety with questionnaires have 
found that subjects differ in performance 
even in the absence of any experimental 
arousal (Farber & Spence, 1953; Taylor & 
Spence, 1952). The failure of the present study 
to corroborate this observation may be taken 
as indirect evidence that Defensiveness rank- 
ings are not interchangeable with scores on an 

anxiety questionnaire, as indeed they were not 
intended to be. The present findings can be 
accounted for without assuming that any of 
the subjects ever became consciously distressed 
or experienced physiological stress symptoms, 
although such a contingency is certainly not 
ruled out. Since the effectiveness of defenses 
is theoretically related to the presence or ab- 
sence of the experiential aspects of anxiety, 
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it is possible that differences in performance 
which have been found in past studies between 
subjects high and low on an anxiety question- 
naire are due tò the degree of defense effective- 
ness rather than to the degree to which the 
subject becomes anxious in the theoretically 
strict sense. The present study is only a first 
attempt tœelucidate experimentally the differ- 
ences between the behavioral consequences of 
oveft anxiety, introspectively observed by the 
subject and reported on a questionnaire, and 
of the disposition to become defensive, meas- 
ured by projective devices and not necessarily 
accompanied by conscious experiences of 
distress. 

An inspection of the items of the Taylor 
(1953) scale suggests that a high score is 
likely to indicate a more or less permanent 
drive or affective state. The rationale and 
findings of the present study, however, ex- 
plicitly provide for and give positive evidence 
of an acute rise in anxiety conditional on an 
acute rise in a dangerous drive. Once again, 
it appears that the present procedure for con- 
trolling anxiety could be applied to people 
who are not characteristically subject to the 
experiential accompaniments of anxiety, even 
though they may be judged anxious and de- 
fensive about a particular drive. The method 
described here might, therefore, facilitate the 
conceptual disentanglement of the effects of 
anxiety and defense. 

The first and second findings, taken to- 
gether, suggest strongly that the defensive- 
ness rankings measure what it was hoped they 
would measure: not the person’s usual or 
typical level of anxiety, but his disposition to 
become anxious and to defend when danger- 
ously aroused. In accordance with the ration- 
ale of this study, it should be possible to repli- 
cate the reported findings by raising the level 
of any other drive in subjecte»for whom that 
drive is known to be dangerous. 

i With regard to the second finding, the ques- 
tion arises as to what psychological mecha- 
nisms were responsible for the observed per- 
formance changes. It may be argued that a 
high defensive subject could so distort the 
meaning of the experimental intervention and 
of his reaction to it that he might ignore its 
Provocative implications. After all, the inter- 
vention is a stimulus associated with drive 
arousal just as the incomplete stories were- 
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In that case, should the intervention not fail 
to affect him? This is certainly possible, but 
it must be remembered that anxiety always 
occurs in combination with défensive efforts 
of varying kinds and degrees of success. At 
the present state of knowledge, the effects of 
anxiety and defense on cognitive behavior 
cannot yet be distinguished eithersexperimen- 
tally or even conceptually. In other words, 
the high defensive subjects’ test performance 
was impaired by anxiety or the defensive dis- 
tortion of the experimental intervention or 
both. Parallel with this, it is unclear whether 
investigators of anxiety who have not been 
systematically concerned with the effects of 
defense were justified in attributing their 
findings only to the effects of anxiety. Perhaps 
their data should be viewed in part from the 
standpoint of the defenses which are automati- 
cally aroused by the anxiety stimulus. 

The separation of the effects of anxiety from 
those of defense would have more than se- 
mantic importance. It is conceivable that 
anxiety affects performance through the med- 
iation of drive increments and irrelevant 
responses (Child, 1954), whereas defense may 
exert its influence through the mediation of 
cognitive rigidity and the narrowing of atten- 
tion and associative processes.‘ Whether or 
not a particular task performance is affected 
by anxiety and defensiveness would then de- 
pend on the effectiveness of an individual’s 
defenses and on the vulnerability of the se- 
lected task to one or the other type of mediat- 
ing mechanism. For instance, if performance of 
a task is most vulnerable to cognitive rigidity, 
then a subject who is only moderately defen- 
sive and experiences a good deal of distress 
will not be so severely penalized in his per- 
formance as another subject whose defenses 
are intensely and pervasively constricting and 
who is, thereforezfree from experiences of overt 
distress and physiological stress. The analysis 
of intellectual functioning in different diag- 
Nostic groups tends to bear this out (Rapa- 
Port, Gill, & Schafer, 1945). 

Little is known at the present time that 
would easily account for the improvement of 
the low defensive subjects. It is possible that 
unanxious, and therefore undistorted, experi- 

“The effect of rigid defenses and controls on one 


phase of creative and inventive behavior has been dis- 
cussed by Schafer (1958). 
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ence of increased aggression provided them 
with greater energy, perhaps even with vin- 
dictive overtones. (“I'll show him who is im- 
mature!”) Although the finding is not sur- 
prising in the light of clinical experience, there 
is little if any research support for the assump- 
tion that the aggressive drive can at times 
be utilized in this constructive way. Another 
possible explanation is that this added drive 
had merely a distracting effect on the low 
defensive subjects and that they overcame it 
by intensifying their efforts to attend only 
to the task. Whether or not any drive arousal 
will have such a facilitating effect on subjects 
who experience little anxiety about it is not 
known. It is quite possible that aggression, 
because of its nature, is more likely to have 
such energizing effects than is, say, an arousal 
of dependency needs. Furthermore, one of 
the criteria used for categorizing subjects as 
high defensive or low defensive was whether 
or not they made reference to active problem 
solving behavior on the part of the main 
figure in their story completions. The low de- 
fensive subjects’ superiority on the Anagrams 
test may be looked upon as another facet of 
their greater concern with or investment in 
problem solving. Just as these subjects in- 
volved their story figures in active problem 
solving attempts, so they became themselves 
involved in active attempts to restructure the 
anagram stimuli. 

The findings of the experimentél groups are 
interesting also in that they are consistent 
with economic considerations in psychoana- 
lytic theory. It is assumed that the ego per- 
forms its adaptive functions, of which prob- 
lem solving is one, with energy which is of 
instinctual origin but has undergone varying 
degrees of neutralization. The capacity to 
neutralize instinctual energy is thought to 
depend on the person’s relative freedom from 
anxiety concerning the instinct in question. 
That is, defensiveness tends to preclude neu- 
tralization. The present findings can be ex- 
plained as follows. The low defensive group 
had a greater quantity of neutralized, dis- 
chargeable energy available for problem 
solving and therefore showed an improvement, 
whereas the high defensive group had to 
block the aroused aggressive energy and there- 
fore pre-empted it from adaptive utilization, 
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These economic considerations seem the more 
applicable because the groups performed com- 
parably in the absence of energic arousal and 
because the stimulus content was noncontribu- 
tory to the observed effects. 

An incidental observation needs to be men- 
tioned in connection with the second finding. 
An analysis of the experimental subjects’ 
verbal reactions to the experimental interven- 
tion revealed a tendency for high defensive 
subjects to object to the experimenter’s accu- 
sation and arbitrariness more frequently than 
was true of low defensive subjects. Although 
this finding falls far short of statistical signifi- 
cance, failure to find a significant difference in 
the opposite direction serves as a reminder 
that one may not freely predict diffident and 
meek behavior from the presence of defensive- 
ness in projective test responses. In many 
everyday situations, such defenses are over- 
laid with socially conditioned counterdefenses 
against passivity and timidity. In neurotic 
patients, considerable therapeutic work may 
often be required to bring to the surface the 
submissiveness which hides behind impressive 
swagger and assertiveness. However, this rela- 
tionship between fantasy defensiveness and 
action assertiveness stands in need of further 
exploration. 

The third finding suggests that theories 
derived from the phenomena of perceptual de- 
fense may not provide adequate explanations 
of the present observations. Apparently, psy- 
chological functions like those involved in the 
solution of anagrams can be subject to whole- 
sale interference which transcends particular 
areas of tabu content. It is suggested that al- 
though anxiety is aroused by drives which 
have specifiable content, the effects, facilitat- 
ing or inhibiting, of anxiety and defensiveness 
on performance may have a wider sphere of 
influence than the anxiety arousing content. 
There is no a priori reason to assume that 
anxiety will affect a given type of cognitive 
behavior differentially, depending on what 
drive gave rise to the anxiety. This too is 
borne out by the analysis of intellectual func- 
tions in patient groups (Rapaport et al., 

1945). 

Finally, it could be argued that the effect 
of the experimental intervention operated 
through a decrease in the cooperativeness or 
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incentive of the high defensive subjects. The 
best reply to this and similar explanations is a 
question: Why should the experimental treat- 
ment have different effects on the cooperative- 
ness of people who have high and low defen- 
siveness ratings? The advantage of combining 
predispositional and experimental controls is 
that the number of plausible explanations of 
the obtained effects can be greatly reduced. 


a 


SUMMARY 


The psychoanalytic theory of anxiety and 
defense was reviewed. Following this, weak- 
nesses were pointed out in experimental 
methods which apply the term “anxiety” to 
overt experiences of subjective distress accom- 
panied by physiological distress symptoms, or 
which seek to control the level of anxiety by 
means of external threats. A new procedure 
was proposed and tested, consisting in the 
experimental arousal of a motive known to be 
psychologically. unacceptable or dangerous for 
the subject. Anxiety is thought of as the auto- 
matic accompaniment of such arousals, re- 
gardless of its experiential conspicuousness to 
the subject himself. 

A thematic test was utilized to assess de- 
fensiveness about aggression. Subsequently, 
an experimental intervention, designed to 
arouse aggression, was administered to the 
subjects, and the effects of this intervention 
on the performance of a problem solving task 
were measured. Findings were as follows: 

1. Under drive arousal, high defensive sub- 
jects showed impairment and low defensive 
subjects improvement in performance. 

2. Subjects did not differ in level of per- 
formance before drive arousal. 

3. The content of the stimulus material 
used to measure problem solving ability did 
not seem to contribute to the vulnerability of 
performance to the effects of-anxiety and de- 
fensiveness. 

This reformulation of anxiety appears to 
hold promise for the conceptual and experi- 
mental separation of the effects of anxiety 
and defense. 
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ITHIN recent years considerable 
W attention has been directed to the 
search for a minimum number of 
dimensions in terms of which to describe a 
person’s overt behavior and informal status 
in small groups. Perhaps as a natural antidote 
to the growing number of performance meas- 
ures, factor analytic techniques have been 
employed in an effort to demonstrate that a 
large proportion of the variance of these per- 
formance measures may be accounted for by 
a small number of underlying dimensions. The 
major advantages of being able to specify such 
a set of factors include greater economy and 
efficiency of effort in measuring performance, 
greater consistency of research methods 
across studies, and the opportunity to consider 
the theoretical significance of the factors that 
are found to be of primary importance. 
Carter (1954), in his review of several fac- 
tor analytic studies of individual performance, 
concluded that three factors recur with suffi- 
cient regularity to constitute at least a pre- 
liminary set of dimensions for the field. He 
labeled these factors: Individual Prominence 
and Achievement, Aiding Attainment by the 
Group, and Sociability. Bales (1956), in his 
discussion of Carter’s synthesis, proposed that 
the factors be labeled: Activity, Task Ability, 
and Likeability. However, subsequent studies 
have resulted in less than perfect agreement 
with the three-factor scheme proposed by 
Carter and Bales. Borgatta, Cottrell, and 
Mann (1958), in their factor analysis of 40 
personality traits and behavioral descriptions, 
found five factors, three of which correspond 
to Carter’s proposed set. The remaining two 
factors they label Manifest Emotionality and 
Task Interest. On the other hand, Slater (1955) 
found only two significant factors of per- 
formance in small groups engaged in discuss- 


1 This article is based upon a dissertation submitted 
in partial fulfillment of the requirements for the degree 
of Doctor of Philosophy at the University of Michigan. 

2 The author wishes to express his appreciation to 
Roger W. Heyns for his guidance and encouragement. 
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ing human relations problems. His first factor, 
Task Prontinence, represents a fusion of Car- 
ter’s first two factors, those which Bales calls 
Activity and Task Ability; Slater’s second 
factor, Likeability, is in close agreement with 
Carter’s Sociability factor. Thus, the three 
factors proposed by Carter and Bales appear 
under some conditions to underestimate and 
under other conditions to overestimate the num- 
ber of significant dimensions of performance 
that are found. 

Unfortunately, the research methods used 
in the various studies have been so divergent 
that any attempt to make use of previous re- 
search to determine the critical conditions 
under which the nature and number of per- 
formance factors vary is bound to be a haz- 
ardous enterprise. Among other things, the 
groups studied by the various investigators 
have differed in the task assigned to them, the 
length of previous interaction, size, the use of 
observer scoring techniques vs. peer ratings, 
and the nature of the populations from which 
the samples were drawn. While Carter’s (1954) 
list may be a useful beginning, the question of 
how stable such a set may be under various 
conditions clearly deserves more attention. 

The purpose of the present research was to 
determine what effect varying the group prob- 
lem would have upon the dimensions of in- 
dividual performance. The group problems 
were chosen with direct reference to the di- 
chotomy drawn by Bales (1950) and Parsons 
(1951) between task groups in which adaptive- 
instrumental goals predominate and social- 
emotional groups in which integrative-expres- 
sive goals predominate. In Maier’s (1952) 
terms, task groups are more concerned with 
the quality of their solution, whereas social- 
emotional groups are more concerned with 
reaching maximum consensus. The main ques- 
tion for this study, then, was whether the 
underlying dimensions of performance remain 
constant across task and social-emotional 
conditions. 
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METHOD 


Subjects 


The subjects were 100 male undergraduates at the 
University of Michigan, drawn from the most recently 
initiated members of 16 undergraduate social fraterni- 
ties. With the exception of 14 sophomores and 1 senior, 
all the subjects were freshmen. The subjects were 
recruited through their pledge-class presidents and 
were offered $5.00 for roughly 5 hours of experimental 
time. 


Experimental Groups 


Each subject participated in two five-man group 
discussions, once under each experimental condition, 
no two subjects appearing together in both task and 
social-emotional groups. Before the groups were com- 
posed, each subject was asked to indicate which of 
the other 99 subjects he knew. Of the 400 pairs of 
subjects in the 40 experimental groups, in only 15 did 
one member say he had a passing acquaintance with 
the other member; in only three of these cases was 
the recognition reciprócated. The experimental sessions 
were conducted in the Group Dynamics Laboratory at 
the University of Michigan, which is equipped with an 
observation room with a two-way sound system and a 
one-way observation screen. Half of the subjects were 
placed in the Task condition first, the other half in the 
Social-Emotional condition first. 

Task condition. The 20 groups under the Task condi- 
tion worked on The Mined Road Problem, a situation 
test developed by the OSS Assessment Staff and modi- 
fied for group discussion by Maier. The group was told 
to devise the best possible solution to a situation con- 
fronting five soldiers who must cross a mined road, 
using only miscellaneous debris found at the side of 
the road. The groups were allowed 50 minutes to arrive 
at their solution. A bonus of $25.00 was offered to the 
group with the best solution as judged by scoring 
criteria previously developed by Hoffman (1957). 

Social-Emotional condition. The 20 groups under the 
Social-Emotional condition were asked to discuss how 
fraternities should handle the period of pledge train- 
ing.* Their assignment was to arrive at a group state- 
ment of “the ideal policy and general program for 
handling pledge training in the houses.” It was empha- 
sized that there are no right or wrong answers to the 
question, but only various opinions regarding the ideal 
policy. Since eacksubject had recently undergone a 
long period of pledgeship prior to his initiation, and 
since each fraternity had its own unique training prac- 
tices, a divergence of opinion among the subjects was 
expected. The groups were told that after the 50-minute 
discussion they would rate each fraternity represented 
in the group on how closely its program of pledgeship 
approximated the ideal program upon which the group 
had settled. Foreknowledge of this rating was intro- 
duced in an effort to increase the tendency on the part 


2 The period of “pledgeship” intervenes between the 
first and tentative acceptance of the potential frater- 
nity member and his final acceptance and initiation. 


675 


of each subject to defend the system his fraternity 
employed. 

Final session questionnaire. After their second ex- 
perience with group interaction the subjects were asked 
to contrast the two groups. The Final Session Question- 
naire contained nine questions that served as a check 
on the effectiveness of the experimental manipulation, 
and specific predictions were made about the answer 
to each question. Subjects under the Task condition 
were expected to feel that (a) there were clearer grounds 
for deciding who was right or wrong and (b) there was 
greater need for intelligent, impersonal, and objective 
thinking. Under the Social-Emotional condition sub- 
jects were expected to feel (a) more concerned that 
their opinions be accepted, (b) more nervous or tense, 
(c) more annoyed or angry, (d) greater need for every- 
one to be sensitive to the feelings of others, (e) more 
judged or criticized as a person, (f) aware of greater 
competition for leadership, and (g) more bored. 


Performance Measures 


The subject’s performance under the Task and 
Social-Emotional conditions was described by 35 
measures of his overt behavior and informal status. 
Ten of the measures were derived from the observer’s 
scoring of the act-by-act behavior of each subject; 16 
of them were based on the peer ratings made by each 
subject of each other subject in his group after the 50- 
minute interaction was over; and 9 of them were 
observer ratings of each subject in the group, made 
after the interaction was over. 

Observer scoring measures. The observer scoring sys- 
tem used Bales’ (1950) interaction process categories 
in a modified form; some of Bales’ 12 categories were 
collapsed to form a 7-category system. The 10 observer 
scoring measures were: the raw number of acts initi- 
ated in minutes 1-16, minutes 17-33, and minutes 
34-50; and the percentage of total acts in Category 1, 
Agreement and Solidarity; Category 2, Laughing and 
Joking; Category 3, Suggestions; Category 4, Opin- 
ions and Orientations; Category 5, Questions; Category 
6, Disagreement and ‘Antagonism; and Category 7, 
Tension. 

Peer rating measures. At the end of the 50-minute 
interaction each subject rated each other member of 
his group on eight informal status scales and eight be- 
havior description scales, using a 10-point scale with 
no ties permitted. The description of the subject used 
in these analyses is the sum of the ratings received from 
the four other group members. The 16 peer rating 
measures were: Talking, Valuable (“. .. how valuable 
was he in guiding the discussion and getting things 
done?”), Liking, Leadership, Concern (“... how much 
concern had he for the feelings of others in the group?”), 
Influence (“. . . how much his suggestions were accepted 
by the other members of the group?”), Pleasant KAST 
how pleasant it was to be in the same group with him?”’) $ 
Agreement (this and the seven behavior description 
ratings to follow differ only in the insertion of a different 
behavior in the question, “Regardless of how much he 
talked altogether, how would you rate each member of 
this group, excluding yourself, on how much he tended 
to agree with what others had said?”), Laughing and 
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Joking, Suggestions, Opinions and Orientations, 
Questions, Disagreement, Tension, Agressiveness, and 
Future Leader (‘... how much would you like to see 
him as leader of this group, if the group were to meet 
again with a formal leader appointed?”). 

Observer rating measures. After the interaction was 
over, and while the group was completing its peer 
ratings, the experimenter rated each group member on 
nine scales, using a 10-point scale with no ties allowed. 
Although the ratings were made without reference to 
the scoring sheets, they were made by the same person 
who had scored the act-by-act behavior of the subjects 
in the group. The nine observer ratings were: Talking, 
Leader, Liking, Agreement, Laughing and Joking, 
Suggestions, Questions, Disagreement, and Tension. 


RESULTS 
Effectiveness of the Experimental Manipulation 


Two comparisons between Task and Social- 
Emotional conditions may be made in an 
attempt to assess the effectiveness of the ex- 
perimental manipulations. First, the Final 
Session Questionnaire may be examined to see 
whether the subjects perceived any differences 
between conditions, and, second, a comparison 
between’ conditions may be made in terms of 
the distribution of acts scored by the observer 
among the seven categories. 

Table 1 presents the distribution of answers 
to the Final Session Questionnaire. Seven of 
the nine hypothesized differences between 
conditions were borne out by the data, and 
four of the seven differences in the predicted 
direction were statistically significant. The 
clearest difference between the two conditions 
is seen in the strong tendency to feel that the 
Task condition provided clearer grounds for 
deciding who was right or wrong as contrasted 
with the strong tendency to feel that there was 
greater need for sensitivity to other people’s 
feelings in the Social-Emotional condition. 
Contrary to expectation, there was a clear 
tendency for subjects to report more anger in 
the Task condition. 

Table 2 presents the distribution of overt 
behavior into the seven categories employed 
by the observer. Under the Task condition 
there was a stronger tendency to give sugges- 
tions and to disagree, whereas under the Social- 
Emotional condition there was a stronger 
tendency to laugh and tell jokes, to give opin- 
ions and orientations, and to show tension. 
With the exception of Category 5, Questions, 
the differences between conditions are sig- 
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TABLE 1 

RESPONSES TO THE FINAL SESSION QUESTIONNAIRE: 
Number of sub- 

4 jects selecting each 

condition 
Question 
Social- 
Task Emotional 


In which group did you feel more | 42 58 
concerned that your opinions be 
aceepted? 

In which group did you feel more | 47 53 
nervous or tense? 

In which group did you feel more an- | 59 41 
noyed or angry? 


In which group were there clearer | 75 25%% 
grounds for deciding who was 
right or wrong? 

In which group was there the greater | 15 85** 


need for everyone to be sensitive to 
other people’s feelings? 

In which group was there the greater | 49 51 
need for intelligent, impersonal, 
and objective thinking? 

In which group did you feel more | 23 
that you were being judged or 
criticized as a person? 

In which group did le com- | 49 51 
pete more for the leadership of the 
group? 

In which group did you feel more 
bored with what was going on? 


77** 


41 59* 


* Deviation from 50-50 significant at .05 level, using one-tailed 
test. 

** Deviation from 50-50 significant at .01 level, using one-tailed 
test. 


TABLE 2 
PERCENTAGE oF ToraL Acts in Each CATEGORY 
UNDER TASK AND SocIAL-EMOTIONAL 


CONDITIONS 
Condition 
Category RER cee 
Task | pmotiona 
1. Agreement and Solidarity 13.0 | 15.0 
2. Laughing and Joking 4.9 8.0 
3. Suggestions 20.1 | 6.2 
4. Opinions and Orientations 45.2 | 54.6 
5. Questions 5.1 5:5 
6. Disagreement and Antagonism {® 8.8 5.0 
7. Tension 2.9 5.6 
Total 100.0 | 99.9 
Base number 14,901 | 16,550 


nificant at beyond the .001 level, using the 4 
test for the difference between proportions. 
The experimental manipulation of the group 
problem appears to have succeeded in creating 
reasonable approximations of task and social- 
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emotional groups. Judging from the perceived 
clarity of evaluative standards, the greater 
frequency of attempted solutions, and the 
greater amount of disagreement, the group 
members under the Task condition were con- 
cerned primarily with arriving at a good solu- 
tion, hopefully the best one. On the other hand, 
the subjects’ perception that they are being 
judged, their concern that group members be 
sensitive to the feelings of others, the high 
level of overt tension, and, finally, the high 
level of laughing and joking within the group 
indicate that the social-emotional session was 
characterized by more persistent effort to solve 
the interpersonal problems generated by the 
inherent lack of consensus. It remains, then, to 
determine whether the underlying dimensions 
of performance remain the same under these 
two demonstrably different conditions. 


Performance Factors 


The 35 performance variables were inter- 
correlated separately for each condition, using 
product-moment correlations. Each 35 X 35 
matrix was then subjected to a centroid mul- 
tiple factor analysis.‘ Six factors were extracted 
from each matrix, the factors were rotated to 
an orthogonal solution by the normalized 
varimax criterion developed by Kaiser (1957). 

Table 3 presents the first four factors of the 
rotated factor matrices under the two con- 
ditions. Under the Task condition the six 
factors account for 66.6% of the variance of 
the performance measures; under the Social- 
Emotional condition the six factors extracted 
account for 64.9% of the variance. For the 
purposes of this discussion only the first four 
factors in each matrix are presented since 
Factors VI and VII under both conditions 
account for little of the total variance of the 
Measures and do not suggest any secure inter- 
pretation. £ 

The first factor in both factor matrices, 
which may be labeled Task Prominence, 0c- 
curs in much the same form across conditions. 
Table 4 shows the factor loadings of the vari- 
ables with the 10 highest loadings under each 
condition. The variables with the highest 


‘The correlation matrices and unrotated factor 
matrices may be found in Mann (1959). Copies of that 
work may be obtained from University Microfilm, Ann 
Arbor, Michigan. 
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loadings are the behavioral measures of total 
activity, giving opinions and orientations, and 
giving suggestions, togéther with the peer 
rating measures of total activity and task 
leadership (Leadership, Valuable, and In- 
fluence). One crude indication of the similarity 
between the Task Prominence factors under 
the two conditions is the fact that the rank 
order correlation between them is .90. 

A second point of similarity between the 
factor structures under the two conditions is 
found in the examination of Factor ITI under 
the Task condition and Factor II under the 
Social-Emotional condition. All three peer 
rating measures of popularity (Liking, Con- 
cern, and Pleasant) are highly loaded on these 
factors, and they may be labeled the Like- 
ability factor. However, the similarity between 
the Likeability factors under the two condi- 
tions lies only in the measures of popularity; 
several measures of the subject’s behavior do 
not have similar loadings on these two factors. 
Table 5 shows the variables with loadings of 
greater than .40 on the Likeability factors. 
Under the Task condition, the three measures 
of laughing and joking all have loadings over 
40 on the Likeability factor, whereas the 
highest loading for any of the three measures 
under the Social-Emotional condition is .16. 
The behavioral measures that do load the 
Likeability factor highly under the Social- 
Emotional condition are the various measures 
of agreement and disagreement. AlF but one of 
the six measures of the subject’s tendency to 
agree or disagree with others have loadings 
greater than .40 on the Likeability factor under 
the Social-Emotional condition; none of the 
six measures has this high a loading under the 
Task condition. Finally, there is a slight 
tendency for the peer ratings on how much 
influence the subject had and how he was 
rated as a potential future leader of the group 
to be related to the Likeability factor more 
positively under the Social-Emotional con- 
dition. 

The correspondence between Factor IV under 
the Task condition and Factor TIL under the 
Social-Emotional condition, although slight, 
deserves mention. As may be seen in Table 6 
both factors have loadings greater than .40 
on two of the three measures of tension, but 
only the high loading for the observer’s scoring 
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TABLE 3 
ROTATED FACTOR MATRICES UNDER TASK AND SOCIAL-EMOTIONAL CONDITIONS 
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Task Condition 


Social-Emotional Condition 


Factor Variable Factor 
h h 
I I mI IV I Ir m IV 
Observer Scoring 
ól —03 18 —30 55 Acts first third 4 —-04 j-04 -il 58 
15 01 08 —25 77 Acts second thifd 78 —05 —23 05 68 
64 04 08 —29 60 Acts third third 87 —14 —09 10 81 
il 31 24 15 47 % Agreement 01 46 03 11 44 
—40 —16 57 —12 54 % Laughing —26 08 + —06 57 aay 
19 14 -25 —50 37 % Suggestions 17 —15 03 —39 35 
45 09 —10 07 26 % Opinions 12 10 —14 —14 64 
—18 “4 -i 67 62 % Questions 25 -06 -11 -32 45 
07 —71 06 —01 52 % Disagreement 09 —64 26 —00 60 
—29 —31 =12 59 58 % Tension —46 —19 43 —18 51 
Peer Rating 
88 05 07 —37 93 Talking 93 05 —07 01 88 
90 12 24 =—21 92 Valuable 80 31 —13 —26 87 
49 25 66 —08 77 Liking 42 69 05 23 71 
92 05 15 —25 93 Leadership 90 21 —06 —12 89 
15 24 71 18 64 Concern 07 70 —08 —12 54 
18 08 20- — 22,72 Influence 65 $5 -10 -09 75 
47 26 76 —06 87 Pleasant 38 72 —03 23 73 
—46 23 39 43 73 Agreement =A EERI 19), SA 
23 01 72 —00 65 Laughing 45 07 00 49 48 
90 12 17 —09 86 Suggestions 73 32 —06 —22 76 
93 14 20 —14 94 Opinions 79 02 —05 07 71 
47 22 28 14 48 Questions 42 32 —07 —20 61 
58 —19 —07 ot 55 Disagreement 35 —45 19 04 78 
-17 —06 —04 59 (56 Tension —50 —12 06 01 42 
15 06 01 -15 78 Agressive T 96 12 j4 38i 
67 33 35 11 76 Future Leader 75 46 —10 —19 84 
Observer Rating 
73 A09 01 —16 90 Talking 77—07 -25 06 -75 
66 12 07 —16 82 Leader 78 08—33 «4-05 813 | 
34 64 24 —0 71 Liking 42 2 -6 09 68 l 
05 72 26 01 60 % Agreement 24 39 —62 19 63 
11 15 45 -13 46 h Laughing 35 16 —27 A3 49 
67 04 -O1 —15 83 4% Suggestions 70. —00 20-03, 74 
38 49 01 09 46 % Questions 49 06 —36 —05 64 
5 -7 -25 30 73 f Disagreement 25 —55 38 —06 62 
—10 —26 05 2 4 % Tension —21 —33 63 -08 56 


Note.—Decimal points have been omitted. The 4? column is based 


of tension is maintained across conditions. The 
two factors may be labeled Tension in defer- 
ence to their slight similarity, but the pattern 
of loadings is not consistent. 

Finally, two factors for which no parallels 
appear may be noted. Factor II in the Task 
condition is highly loaded by three of the four 
observer measures of agreement and disagree- 
ment, as shown in Table 7, but the peer ratings 
on these behaviors are conspicuously missing. 
Factor IV in the Social-Emotional condition 


upon the six unrotated factors. 


has three loadings over .40, a8 shown in Table 

8, all of which are measures of laughing and 

joking. These two factors which are unmatched 

with a corresponding factor in the other con- | 
dition represent the behaviors that fail to be 
related to the Likeability factor in that con- 
dition. Thus, in the task session, when Like- 
ability is defined by the measures of popularity 
and by laughing and joking, some of the meas- 
ures of agreement and disagreement define 4 
separate factor. In the Social-Emotional con- 
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TABLE 4 TABLE 6 
Tue TASK PROMINENCE FACTOR Tue TENSION FACTOR 
= 
Variable EN Condition 
‘al Social- Variable ae 
Task Emotional Task  pmotional 
Leadership (P) 92 -90 Tension (OS) 59 43 
Talking (P) 88 93 Tension (OR) 21 63 
Opinions (P; 93:3 79 Tension (P) 59 .06 
Valuable (P; 90 -80 Liking (OR) —.01 —.64 
Suggestions (P) -90 -73 Agreement (OR) 01 —.62 
Acts second third (OS) 75 -B Questions (OS) ap iu 
Ate tei thaed (OS) ‘aT per Si a a 
ri r . . t . en 
eee (oR) eu 13 Agreement (P) 43 ii 
ggressive . : Note,—OS indicates an observer scoring measure, P a pe 
nee ® ; 18 aps rating measure, and OR an observer rating measure. a 
Acts first third (OS) 61 74 TABLE 7 


Note,—OS indicates an observer scoring measure, P a peer rat- 


THE AGREEMENT-DISAGREEMENT FACTOR 


ing measure, and OR an observer rating measure. 
TABLE 5 Vian Condition, 
Tue LIKEABILITY FACTOR AN Task 
Condition Agreement (OR) Ap 
Variable Social- Disagreement (OS) —.11 
Task Emotional Disagreement (OR) c | 
Liking (OR) 164 
Pleasant (P) 76 72 
Concern (P) nAi -70 Note.—OS indicates an observer scoring measure, OR an ob- 
Liking (P) .66 -69 server rating measure. 
ooi and Jokie @. ae 07 
aughing and Joking (O: k . 
Agreement (ey E OY 39 -69 TABLE 8 
Disagreement (OS) .06 —.64 Tue LAUGHING AND JOKING FACTOR 
Disagreement (OR) —.25 —.55 
us) (P) .20 55 ind Condition 
greement (OS 24 46 ‘ariable See 
pome Leader P) (35 “46 Social-Emotional 
isagreement (P) -.07 —-45 Laughing and Joking 08) F s 
ughing : Behe 
Note.—OS indicates an observer scoring measure, P a peer rat- EER ai foring OR) 43 


ing measure, and OR an observer rating measure. 


dition, when the Likeability factor is defined 
by the same measures of popularity and the 
behavioral measures of agreement and dis- 
agreement, all three measures of laughing and 
joking define an independent factor. 


e DISCUSSION 


One of the more striking findings of the 
present study is the high consistency with 
which a Task Prominence factor, closely re- 
sembling the one found by Slater (1955), 
emerges as the main factor of performance 
under both conditions. Despite considerable 
evidence that the two experimental conditions 
differ not only in the instructions given by the 
experimenter, but in the overt behavior and 
the subjective feelings of the group members, 
the Task Prominence factor remains remark- 


Note.—OS indicates an observer scoring measure, P a peer 
rating measure, and OR an observer rating measure. 


ably constant across conditions. In each con- 
dition, behavioral measures of the total amount 
of participation combine with status measures 
of leadership and influence to define the factor. 

The Task Prominence factors isolated in 
the present study and in Slater’s (1955) re- 
search combine the Activity and Task Ability 
factors described by Carter (1954) and Bales 
(1956) as independent dimensions. This dis- 


crepancy deserves further attention. The ex- 
EN ions that differentiate this 


imental condit: lifer 4 
and Slater’s study from studies in which the 


Activity and Task Ability factors do not fuse 
offer some clues to understanding the condi- 
tions under which they tend to fuse. Slater’s 
study and the present one are similar in inves- 
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tigating individual performance in ad hoc 
groups whose members were, by and large, 
unacquainted with one another prior to the 
experimental situation. In contrast, most of 
the studies that report independent Activity 
and Task Ability factors employed natural 
groups in which there was considerably more 
history to the group before the measures were 
taken. Perhaps a gradual differentiation takes 
place over time between the two aspects of 
Task Prominence, namely, Activity and Task 
Ability. This possibility derives some support 
from Slater’s finding that a gradual differenti- 
ation does take place between the most active 
and the most valuable member of the group, 
since most active and most valuable corre- 
spond closely to the Activity and Task Ability 
factors isolated by Carter. 

While the sociometric measures of popularity 
define an independent factor under both Task 
and Social-Emotional conditions in the present 
study, the behavioral measures that share in 
the definition of the factors vary markedly 
from one condition to the other. Laughing and 
joking behavior is highly related to Likeability 
only under the Task condition, whereas agree- 
ment and the avoidance of disagreement are 
strongly related to Likeability only under the 
Social-Emotional condition. These findings 
both replicate and challenge the results of 
previous research. 

In light of the differences between the con- 
ditions thaf may be inferred from the responses 
to the Final Session Questionnaire, these find- 
ings suggest a number of conclusions about the 
dynamics of popularity. When the group is 
engaged in a formal, intellectual task, one in 
which there is both competition with other 

groups and an implied standard of excellence, 
the persons who will be liked appear to be 
those who can break through the tension by 
laughing and joking. Such behavior may also 
serve to reduce the external threat, represented 
by a difficult intellectual task, allowing the 
group members to maintain the spirit of a 
lighthearted game, rather than a test of their 
intelligence. In the Task condition most of the 
jokes tended to be in the form of ridiculous 
proposals for the solution to The Mined Road 
Problem, proposals offered in jest and received 
with laughter. Turning to the independence of 
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variables concerning agreement and 
ment from the Likeability factor, 
Task condition, perhaps the most 
approach to this finding would be to 
impact of agreement or disagreement 
light of the task contributions to whi 
are directed. When the content of the d 
sion is focused on an external task, lik 
Mined Road Problem, reactions of apy 
or disapproval apparently do not tend to 
significantly the affective or liking rel 
ships in the group. Although these data 
no direct test of the view, the measure: 
agreement and disagreement may be 
pendent of Likeability because such a larg 
proportion of the task acts to which they a 
addressed are merely statements of fact a 
The Mined Road Problem, or the entire 
lem may have remained sufficiently dist 
from the fundamental ego involvements of i 
subjects to evoke only superficial invol 
in their proposals for its solution. 

The clearest differences between the Ti 
and Social-Emotional conditions, the 
level of tension and the greater sense of 
sonal threat and criticism under the 
Emotional condition, offer some lever: 
understanding how individual behavior rela 
to popularity in the Social-Emotional cor - 


group members, all of whom were neophy 
of different fraternities, seem to have b 
highly involved in the content of the d 
sion. Under such conditions, the person 
showed agreement and avoided disagree 
with others was likely to be well liked. 
ever, as the discussion increased in perst 
relevance, the danger that laughing and joki 
would provoke resentment and tension 
creased correspondingly. Whereas in the 

condition the jokes were directed out of 
group toward the experimenter and his fi 
trating intellectual problem, in the 
Emotional condition the jokes were direc 
within the group toward the activities of 
fraternities represented around the table. Vi 
likely a sufficient number of these jokes, 
haps even ones that evoked laughter from | 
group, carried with them a note of scorn eit 
for some other fraternity and its ways 0 
the whole fraternity system. 
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Summing up these speculations about the 
Likeability factor, these data suggest that the 
way in which overt behavior relates to popu- 
larity varies with the extené to which the 
group is task- or social-emotionally oriented. 
As the tension and integrative needs of the 
group increase, agreement and the avoidance 
of disagreement become important to the 
group members, and liking tends to be given 
to those who show the most support for others’ 
positions; at the same time, laughing and 
joking do not produce liking in others, as they 
do in the task oriented groups, because the 
target of the jokes shifts from outside the 
group to inside, where feelings are considerably 
more sensitive. 

In conclusion, the systematic study of per- 
formance factors as they vary under Task and 
Social-Emotional conditions challenges the 
three-factor proposals of Carter (1954) and 
Bales (1956) in several ways. The factors of 
Activity and Task Ability that they propose 
fuse into one Task Prominence factor in both 
conditions, a result that suggests the need for 
more exact knowledge about how Activity 
and Task Ability come to be differentiated 
from each other. Their Likeability factor turns 
out to be stable over both experimental con- 
ditions only with regard to the postmeeting 
measures of informal status; the behavioral 
measures that share in the definition of this 
factor vary markedly across conditions. Fur- 
ther consideration of these behavioral variables 
may extend our conception of the Likeability 
factor beyond its present bounds. Finally, the 
three-factor proposals of Carter and Bales fail 
to include a Tension factor, which appeared 
in both conditions, and the behavioral factors 
of Agreement-Disagreement and Laughing and 
Joking, which emerge as independent factors 
when the behaviors involved are not related 
to the Likeability factor. It might be wiser to 
avoid premature closure on any set of per- 
formance factors until more is known about 
the effect that variation in the situation, the 
size and nature of the group, or other experi- 
mental procedures has upon the resultant fac- 
tor structure. The present study suggests some 
of the outlines of the more complex view of 
performance factors that may emerge from 
such needed research. 
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SUMMARY 


The proposal by Carter (1954) and Bales 
(1956) that three major factors account for 
much of the variance in measures of perform- 
ance in small groups was subjected to further 
test by obtaining factor structures from per- 
formance measures under two demonstrably 
different experimental conditions. In the Task 
condition the group was oriented to an external 
problem, an intellectual task, and in the Social- 
Emotional condition a topic of high personal 
relevance created problems of integration and 
tension reduction, Twenty five-man groups 
were run under each condition. The perform- 
ance measures taken on the 100 subjects were 
factor analyzed separately for each condition. 

The results of the two factor analyses re- 
vealed a Task Prominence factor that was 
highly stable across conditions; it was identi- 
fied as a fusion of the Activity and Task Ability 
factors proposed by Carter (1954) and Bales 
(1956). A factor, labeled Likeability, showed 
consistent loadings from postmeeting meas- 
ures of popularity, but highly inconsistent 
loadings from the behavioral measures that 
shared in its definition. Some implications of 
the results for a more complex interpretation 
of Likeability and the dynamics of popularity 
than the one suggested by Carter and Bales 
were presented. A factor of low stability, 
labeled Tension, was extracted under each 
condition. Finally, independent behavior fac- 
tors were formed by measures of either agree- 
ment and disagreement or laughing and joking, 
depending on which behavior was not related 
to Likeability under that condition. 


REFERENCES 


Bares, R. F. Interaction process analysis: A method for 
the study of small groups. Cambridge, Mass.: 
Addison-Wesley, 1950. a j 

Bares, R. F. Task status and likeability as a function 
of talking and listening in decision-making groups. 
In L. D. White (Ed.), The state of the social sciences. 
Chicago: Univer. Chicago Press, 1956. Pp. 148- 
161. 

Borcarra, E. F., COTTRELL, L. S., & Mann, J. H. The 
spectrum of individual interaction characteristics: 
An inter-dimensional analysis. Psychol. Rep., 1958, 
4, 279-319. A 

Carrer, L. F. Recording and evaluating the per- 
formance of individuals as members of small 
groups. Personnel Psychol., 1954, 1, 477-434. 

Horrman, L. R. Homogeneity of member personality 


682 Ricnarp D. Mann 


and its effect on group problem-solving. Unpub- 
lished doctoral dissertation, University of Michi- 
gan, 1957, 

Kaiser, H. V. The varimax method of factor analysis. 
Unpublished doctoral dissertation, University of 
California, 1957. 

Marr, N. R. F. Principles of human relations. New 
York: Wiley, 1952. 

Mann, R. D. The relation between personality charac- 


teristics and individual performance in small 
groups. Unpublished doctoral dissertation, Uni- 
versity of Michigan, 1959. 

Parsons, T. The social system. Glencoe, Ill.: Free Press, 
1951. 

SLATER, P. E. Psychological factors in role specializa. 
tion. Unpublished doctoral dissertation, Harvard 
University, 1955. 


» (Received June 20, 1960) 


Journal of Abnormal and Social Psychology 
1961, Vol. 62, No. 3, 683-685 


CRITIQUE AND NOTES 


SEX, ETHNICITY, AND PLAY PREFERENCES OF PRESCHOOL CHILDREN 


BOYD R. McCANDLESS axp JUNE M. HOYT" 
Towa Chilé Welfare Research Station 


N the literature concerned with ethnocentrism, 
I few studies have been concerned with preschool 
age children. Moreno (1934) and Criswell (1937), 
using sociometric techniques, found no evidence 
of ethnocentric cleavage until about 8-10 years of 
age. Lambert and Taguchi (1956) suggest that 
such cleavage exists in preschool children, but is 
not demonstrated in the kinds of choice required 
by the standard sociometric techniques. They 
devised choices that they assumed were more 
meaningful to young children, and found that their 
measure elicited ethnocentric choices in Oriental 
children in a Montreal preschool, but that the 
comparable trend among Occidental children was 
not statistically significant. Among possible ex- 
planations, the authors considered that the Ori- 
ental children, as a minority group, perceive more 
threat in a mixed racial situation than do the 
Occidentals, and hence cling more closely together. 

In an intensive study of race awareness in Negro 
and white children of preschool age, Goodman 
(1952) found that cross-racial interactions of 
children did not differ in frequency from that 
which would be expected by chance, although 
interactions of white with white were 8% below 
chance, while those of Negro with Negro were 9% 
above chance, Goodman concludes: 


The high rate of Negro-Negro interaction and the 
low rate of white-white interaction must therefore result 
from the operation of variables of a nonracial order. 
We conclude that these variables are personality traits 
which are unequally distributed between the Negro and 
the white children in this particular group (p. 259). 


_ In the present study it was hypothesized that, 
in a situation where social mixing of races is com- 
mon, preschool children show little tendency 
toward ethnic cleavage. Choice of play companions 
in a free play situation was assumed to be an inter- 
action, meaningful to the child, in which this 
hypothesis could be tested. 


1 The authors are particularly indebted to Hannah 
Lou Bennett and Carolyn B. Bilous for providing the 
opportunity to conduct this study; to Betty Crooker 
and Mary Rushit for opening their preschool groups 
to the senior author; and to Mark Beals for his help in 
establishing reliability of observations. The basic data 
were collected while the senior author was 4 Carnegie 
Visiting Professor at the University of Hawaii. 


METHOD 


Subjects were 33 children of Oriental (non-haole) 
ethnic origin and 26 Caucasian (haole) children at the 
University of Hawaii Preschools. The age range was 
from 3 years 6 months to 5 years 3 months, with a mean 
age of 4 years 5 months. All except one child had been 
in the preschools for at least 6 months, this exception 
being the senior author’s youngest child, who was a 
newcomer to her group. There were 17 boys and 16 
girls in the non-haole group and 11 boys and 15 girls in 
the kaole group. Three subjects were eliminated: one 
girl because she did not fall into either of the ethnic 
groups under study, one blind child who did not social- 
ize with anyone, and one mentally defective boy. 

The children were observed in free play in their 
regular preschool groups, the ethnic and sex composi- 
tion of the three groups being approximately the same. 
The time sampling method employed in making ob- 
servations consisted of 15 5-minute observations for 
each child, during which time the duration and type of 
his interactions with other children were recorded. 
‘Three observers studied the children over a period of 
approximately 3.5 months, Percentage of exact agree- 
ment for Observers A and B was 92; and for A and C, 
93. A tabulation was made of total time spent with 
every child with whom the child under observation 
interacted during the observation periods, and total 
time was computed for interactions with children in 
each of the two ethnic groups. 

The chance expectancy for time spent playing with 
haole and non-haole children was computed for each 
preschool group on the basis of the ethnic proportion 
in the group. For example, if a group is made up of 11 
haole children and 10 non-haole children, the chance 
expectancy for a given kaole child for each ethnic group 
is 50% of his time. Tf he spent a total of 60 minutes in 
free play, it would be expected that 30 of these were 
spent with other haole children, and 30 with non-haole 
children, From this base, discrepancy scores were 
computed. For our hypothetical haole child, if he spent 
45 minutes playing with other haoles, the discrepancy 
score would be +15. The same procedure was used to 
study free play as related to sex. 7 A 

Comparisons were made, using a chi square analysis, 
of non-haole and kaole children’s choices of play com- 
panions based on this discrepancy score. The mean 
discrepancy was determined for each ethnic group, and 
separate comparisons were made within each group by 
sex of child. Comparable analyses were made for choice 
of companion by sex without regard to ethnic group. 


RESULTS 


The chi square analysis, Yates’ correction, 
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TABLE 1 
MEAN Toe AND STANDARD DEVIATIONS IN MIN- 
UTES SPENT AND PREDICTED BY CHANCE BY 
DESIGNATED GROUPS IN PLAY WITH EACH OTHER 


Actual Time | Chance Tapec- 
uM SD “M SD 
Boys with boys 114.6 | 42.9 | 72.9 | 22.8 
Boys with girls 46.4 | 30.1 | 88.1 | 23.1 
Girls with boys 42.8 | 27.2 | 63.3 | 28.8 
Girls with girls 82.7 | 33.8 | 62.1 | 25.0 
Orientals (non-haole) 91.1 | 49.6 | 78.6 | 34.6 
with non-haole 
Non-haole with Cau-| 53.8 | 24.7 | 66.2 | 26.3 
casian (kaole) 
Haole with haole 63.4 | 26.6 | 56.8 | 15.6 
Haole with non-haole 75.7 | 39.2 | 82.3 | 27.7 
TABLE 2 


Mean Tme IN Minutes BEYOND CHANCE EXPEC- 
TANCY SPENT IN OBSERVED PLAY WITH 
COMPANIONS INDICATED 


Mean 
Play Companions pate SD t 
Chance 
Without regard to ethnic group: 
Boys playing with boys 41.7 29.3 
Girls playing with girls 20.5 15.9 
Total difference from expectancy (boys | 30.6 25.5 |10.10° 
plus girls) 
Difference between boys and girls 21.2 3.44* 
Without regard to sex: 
Non-haoles playing with non-haoles 12.4 24.5 
Haoles playing with haoles 6.6 25.3 
‘Total difference from expectancy (boys | 9,2 | 25,11 | 2.87* 
plus girls)  ” 
Difference between non-haoles and| 5,8 +88 
haoles . 
Ethnic group by sex: 
Non-haoles playing with non-haoles 
Boys 15.4 28.3 
Girls 9.3 18.3 
Difference 6.1 -10 
Haoles playing with haoles 
Boys 11,9 33.2 
Girls 2.7 16.3 
Difference 9.2 -88 


* p= <,01 for 57 df. 


demonstrates that, disregarding ethnic group, boys 
overchoose boys as play companions, and girls 
overchoose girls (x? = 32.9; for p = .01, 1 df, 
6.635 required). There was also an ethnic cleavage 
in choice of play companion (x? for 1 df = 6.9). 
Tn other words, boys played more with boys, girls 
with girls, disregarding ethnic group; and, dis- 
regarding sex, haoles played more with haoles, 
non-haoles with non-haoles, 
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Table 1 summarizes the distribution of play 
times by subgroups, and Table 2 shows the mean 
time beyond chance expectancies associated with 
these choices. The sex cleavage was significantly 
more extreme for boys than girls, although both 
spent more than the expected proportion of their 
time playing with children of their own sex. The 
ethnic cleavages did not differ significantly for the 
haole and non-haole children, contrary to the find- 
ing of Lambert and Taguchi (1956), nor did boys 
differ*from girls in this respect. 

DISCUSSION 

Contrary to hypothesis, it was found that 
children in this study differentially selected play 
companions of their own ethnic group, even in an 
atmosphere that, subjectively, would be presumed 
to be more accepting of interracial companion- 
ships than might be expected in the Montreal 
setting of the Lambert and Taguchi (1956) study. 

The statistical analysis revealed a trend that 
had not been suggested to the senior author as he 
gathered the rather intensive observational mate- 
rial for the children. At no time did he hear an 
instance of verbal rejection that included ethnic 
reference, and his subjective judgment was that 
friendships and play contacts were the usual func- 
tions, found in ethnically more homogeneous pre- 
school groups, of personality attributes, play in- 
terests, sex, and accident. In other words, he would 
have predicted the sex difference, but not the 
ethnic difference. 

The authors consider that prejudice is based on 
hostility, and, for this reason, tentatively suggest 
that the ethnic “discrimination” found among 
these children is not prejudice as usually thought 
of, but some sort of “comfort” differential by 
ethnic group. Ethnically different background, in 
the sense of such factors as greater commonality 
of food habits within the ethnic group, colloqui- 
alisms drawn from the original parent language 
(such as the terms used for toileting), perhaps 
subtly different play experiences and interests, 
may operate to make the kaole youngster feel more 
comfortable with other kaoles (he has “more in 
common with them”). Similar factors may operate 
for the non-/aoles. In other words, thildren in one 
ethnic group do not actively avoid members of the 
other group, but differentially approach the mem- 
bers of their own group. 

Informal checking over the families and their 
patterns of out-of-school social interaction indi- 
cated that neither the non-haole nor kaole parents 
in this Honolulu metropolitan area associated 
selectively with each other (i.e., the kaole parents 
knew the non-haole parents as well as they did the 
haole, and did not appear to have selective out-of- 
school association with them by ethnic group; and 


a). It is probable, however, even in cos- 
n Honolulu, that, over-all, the out-of- 
lay and visiting contacts of non-haole 
sters are more frequently with other non- 
"and likewise for the kaole children. This 
bly contributes to the “comfort” or “com- 
” factor previously mentioned. 


SUMMARY 


D 
eschool children of Oriental (non-haole) and 
a (haole) background were observed, 
time-sampling technique; and their choices 
companions were analyzed. Children in 
ethnic groups were found to prefer play 

ions of their own sex, this tendency being 
ronounced in boys than in girls. Haole 

spent more of their time with kaole chil- 
d non-haole children with non-haole children 
ould be predicted on the basis of propor- 
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tional representation of each of the two groups. 
The subjective impressions of the senior author 
were that this cleavage was not due to prejudice 
in any conventional sense, but possibly to differ- 
ential “comfort” due to more common backgrounds 
among the members of a given ethnic group. 
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A COMPARISON OF MENTAL RETARDATES AND NORMALS 
ON KINESTHETIC FIGURAL AFTEREFFECTS 


HERMAN H. SPITZ anp RONALD S. IIPMAN 
Edward R. Johnstone Training and Research Center, Bordentown, New Jersey 


previous experiment (Spitz & Blackman, 
1959) indicated that mentally retarded 
males exhibit a poorer capacity for visual satia- 
tion, as measured by a visual aftereffect 
(VAE) test, than do normal males of approxi- 
mately equal CA. Since there is no correlation of 
individual scores on VAE and kinesthetic after- 
effect (KAE) tests, at least with college students 
(Spitz & Lipman, 1960), the question is posed of 
whether the group differences found in visual 
satiation also hold in kinesthetic satiation (Kéhler 
& Dinnerstein, 1947), Furthermore, use of a KAE 
test provides a control for the possibility that the 
results of the previous study were due in part to 
the retardates’ inability to fixate steadily. Female 
subjects, not included in the previous experiment, 
were included in the present study. 


METHOD 


As shown in Table 1, the average CA of the retarded 
group, all residents at a state training school, approxi- 
mated the average CA of a comparison group of college 
sophomores, 

The KAE test apparatus was similar to the one 
described by Klein and Krech (1952). On the subject’s 
right was the variable test (T) block on which the posi- 
tion of a rider could be read to the nearest 34 inch. 
The comparison T block on the subject’s left was 1.5 
inches wide and could be quickly replaced by the 
Inspection (I) block, which was 2.5 inches wide. 

The task was explained to the subject, who stood 
between the T blocks, gripping them with the forefinger 
and thumb of his left and right hands. Starting from 
random positions alternately below and above the 
point of objective equality (POE), the subject found 
the place on the right hand variable T block which 
felt as wide as the left hand comparison T block. He 
was given four “feedback” practice trials. That is, 
after each judgment, his hand was placed at POE. 
This procedure was introduced in an attempt to reduce 
variability, particularly in the retarded group. Next, 
he was asked to make four more equality judgments, 
this time without feedback. These judgments were 
averaged to yield a Control score, Then, with the same 
two fingers of his left hand, he rubbed the larger I block 
for 1.5 minutes, in approximate time to a metronome 
set at 108 beats per minute, after which the original 
T block was quickly returned for the subject to make 
four post-inspection equality judgments. The average 


1 The authors are most grateful to Marshall P. Smith, 
Chairmgn of the Psychology Department of Trenton 
State College, for his cooperation during this study. 


of these four post-inspection judgments subtracted 
from the Control score yielded a Satiation score for 
each subject. 

During the experiment, subjects wore goggles which 
blocked the lower half of the visual field. Whereas the 
normal subjects put on the goggles before the practice 
feedback trials, the retardates were not asked to wear 
them until after the feedback trials. Care was taken, 
however, to keep them from looking down at their 
judgments. In any case, all subjects wore the goggles 
for the Control Measurements and until the experiment 
ended. 

The reliability of individual scores on the KAE test 
used here has been shown to be significant (Lipman & 
Spitz, 1959; Spitz & Lipman, 1960), although the 
reliability coefficient is much lower for retardates than 
for normals (.34 as compared to .74). 


RESULTS AND DISCUSSION 


Since there were no significant sex differences, 
within groups, in mean Satiation scores, the com- 
bined group of normals was compared with the 
combined group of retardates, As shown in Table 
2, the mean Satiation score of the normals was 
significantly larger than that of the retardates, 
confirming the previous finding with VAE. How- 
ever, whereas on the VAE test, retardates were 
significantly more variable than normals, no such 
differences in variability were found in the present 
study. 

Although, as noted, sex differences within the 
groups were not significant, the females of both 
groups had lower Satiation scores than the respec- 
tive males in line with the trend reported by 
Wertheimer (1954a, 1954b) and by Spitz and Lip- 
man (1960). On the other hand, the present re- 
sults may be considered inconsistent with a pre- 
vious finding that female retardates perceive spiral 
aftereffects more readily than retarded males or, 
in fact, than normal females (Spitz & Lipman, 
1959). The spiral, however, is mdre open to sub- 
jective distortion in reporting results and, in any 
case, may not be the most adequate measure of 
satiation. 

It should be noted, also, that on tests of figural 
aftereffects retardates perform more like the brain 
damaged subjects reported by Spivack and Levine 
(1959) than those of the Klein and Krech study 
(1952), 

In summary, the present results with KAE, 
when added to previous results with VAE, indicate 
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CRITIQUE AND NOTES 


TABLE 1 
CHRONOLOGICAL ACE DISTRIBUTION OF RETARDED 


AND NORMAL GROUPS, WITH IQ DISTRIBUTION 
oF RETARDED GROUPS 
— 
Mean 
N | Age Range | Me Range 
(yr) 10 
Retarded Males 35 | 19.30 | 17.08-22.42| 70.34 | 56-86 
Normal Males 32 | 20.28 | 18.83-22.67] © 
Retarded Females | 20 | 18.83 | 15.92-21.83] 70.90 | 57-80 
Normal Females 50 | 19.58 | 18.58-20.75 P 
TABLE 2 


DIFFERENCES BETWEEN MEAN SATIATION SCORES OF 
NORMALS AND RETARDATES 


N M | SD t ? 
Normals 82 | 1.59 | 1.95 
Retardates oF | tas | 1.91 | 2-28 | $208 


that mentally retarded subjects are characterized 
by weaker satiation processes than are normals of 
comparable CA. 
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OPERANT CONDITIONING OF VERBAL BEHAVIOR OF 
DYSTHYMICS AND HYSTERICS 


DONNA M. GELFAND anp C. L. WINDER! 
Stanford University 


YSENCK (1955) has postulated that neurotics 
E of the dysthymic type should condition more 
readily than neurotics of the hysteric type. 
This hypothesis was given support by Franks 
(1956) in a study of GSR and eyeblink condition- 
ing. Franks concluded, in addition, that condition- 
ability is related to extraversion-introversion. The 
present experiment investigated the possible re- 
lationship between conditionability of verbal be- 
havior and type of psychopathology. The hypothe- 
ses are that dysthymics condition more readily and 
stably and therefore make more conditioned re- 
sponses in an acquisition and a countercondition- 
ing series than do hysterics. 


METHOD 


The subjects consisted of 26 female inpatients at a 
state mental hospital. Ten subjects fell into the dys- 
thymic category, while 16 were classified as hysterics. 
Hysteric subjects were described clinically as having 
one or more of the following characteristics: conversion 
symptoms, hysteria, psychopathic personality, or some 
similar type of character disorder. Hysterics tended to 
exhibit histories of poor impulse control and antisocial 
behavior. Dysthymic subjects displayed one or more 
of the following primary characteristics: manifest 
anxiety, reactive depression, or obsessive-compulsive 
features. Two judges, independently categorizing the 
subjects as dysthymic or hysteric, agreed on 22 of 25 
cases (exact chi square, p = .003). 

Guilford’s R scale (1940) was used in this study asa 
measure of extraversion. The Taylor Manifest Anxiety 
(MA) scale, MMPI scales L, F, K, and Edwards 
(1957) Social Desirability scate were also administered. 
The personality inventories were given prior to the 
conditioning procedure. 

One hundred sixty commonly used verbs were 
randomly selected, and each was typed on a separate 
3X 5 white index card. All verbs were in the past 
tense. Six pronouns—I, We, He, They, She, and You— 
were placed below each verb with their respective orders 
randomized. The subject was shown a sample card and 
instructed to make up a sentence containing the verb 
indicated on the card and beginning with any one of 
the pronouns. The subjects was asked to construct one 
such sentence for each of the 160 cards. 

Operant level of responding was determined in the 
first 20 trials. In the following 60 acquisition trials, the 
experimenter said, “Good,” in a flat, unemotional tone 


1This study, done at Agnews State Hospital, 
Agnews, California, was possible only because of the 
warm cooperation of the hospital staff. Jerry S. Wiggins 
= very helpful with suggestions regarding the investi- 
gation. 


at the end of any sentence which the subject began with 
I or We. The last series of 80 card presentations con- 
situted the extinction or counterconditioning phase, 
during which reinforcement was shifted to He and They 
responses. No subject reported awareness of the purpose 
of the experiment. 


REsULTS 


The two groups were not significantly different 
in operant level of responding as measured by the 
number of first person pronouns given in the first 
20 nonreinforced trials. In the 60-trial acquisition | 
series, dysthymics produced significantly more 
first person pronouns than hysterics (¢ = 1.98, p = 
.05, one-tailed test). During the extinction series, 
the dysthymic group continued to give more first 
person responses than hysterics (¢ = 1.71, p = 
-05, one-tailed test). Hysterics obtained signifi- 
cantly higher scores on the Edwards Social Desira- 
bility scale than dysthymics (¢ = 2.20, p = .05, 
two-tailed test). The samples did not differ sig- 
nificantly on the Guilford R scale, the MA, or 
MMPI scales L, F, and K. 


Discussion 


Results of this study indicate it is possible, on 
the basis of symptomatology, to choose groups 
which differ in conditionability of verbal behavior. 
However, it is not yet possible to define, in terms 
of scores on psychometric tests, the relevant 
Psychological variables upon which such groups 
differ, The findings do not support the relationship 
between extraversion and resistance to condition- 
ing postulated by Franks (1956) and Eysenck 
(1955). The only variable, other than condition- 
ability, on which the groups differ is social desira- 
bility. There is, however, no apparent relationship 
between ease of conditioning and social desira- 
bility (r = —.054). : 

Other investigators have found relationships 
between conditionability of verbal behavior and 
such variables as compliance (Sarason, 1957) and 
dependency-anxiety (Cairns, 1959). Investigation 
of possible relationships between these variables 
and symptomatology characteristic of dysthymics 
and hysterics seems important for future research. 
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PERCEPTUAL SELECTIVITY, MEMORY, AND ANTI-SEMITISM 


LEE PULOS 
Larue Carter Memorial Hospital, Indianapolis, Indiana 


‘HAT persons who score as anti-Semitic on 

objective scales are able to discriminate 
fairly accurately between photographs of Jews 
and non-Jews has been demonstrated by 
Allport and Kramer (1946), and Lindzey and 
Rogolsky (1950). They also noted that persons 
scoring as anti-Semitic tended to identify more 
photographs as Jewish than those who scored low 
in anti-Semitism. Their findings, however, have 
been extensively qualified and questioned on the 
basis of group composition of the photographs and 
experimental controls (Carter, 1948; Elliot & 
Wittenberg, 1955; Scodel & Austrin, 1957). 

Seeleman (1940) noted that those who indicated 
unfavorable attitudes toward Negroes were sig- 
nificantly less able to discriminate among a group 
of photographs of Negroes than those with Positive 
attitudes, 

Consideration of these studies suggests a need 
for further evaluation of the hypothesis relating 
perceptual accuracy to prejudice, plus examination 
of the additional hypothesis that once the in- 
Sroup-out-group distinction is made, there is no 
further need to distinguish among members of the 
out-group, 

The present investigation was thus conducted 
to verify first, the relationship between anti- 
Semitism and perceptual accuracy, and second, the 
possibility of differential memory for persons scor- 
ing high and low in anti-Semitism on photographs 
initially identified as Jewish or non-Jewish. In the 
latter case, Seeleman’s (1940) study suggests that 
a group scoring high in anti-Semitism, if they are 
able to identify accurately photographs of Jews 
and non-Jews, should be less able than persons 
scoring low in anti-Semitism to remember those 
photographs defined initially as Jewish. 


METHOD 
Subjects 


Two hundred and twenty male undergraduate 
students at the University of Denver served as subjects. 
Following administration of the anti-Semitism scale of 
Levinson and Sanford (1944), the 40 students who 
scored highest and the 40 who scored lowest on this 
scale were selected for further study. The range of 
scores for the entire 220 subjects was from 52-286. The 
mean of the scores for the low anti-Semitic group was 
68 and for the high group, 208. Both groups were 
equivalent in age, the low group having a mean of 21.2 
years and the high group a mean of 21.1 years. 
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subject, who was then requested to identify the photo- 
graphs that had been included in Set I. 


high and low groups on both identification an 


BERNARD SPILKA 


2 University of Denver 


Materials 

The photographs to be identified were selected 
yearbooks of the University of Denver. The g 
affiliation “and ethnic identity of each person 
established by checking personal history and r 
preference records in the Registrar’s office. All pic 
were of male seniors in cap and gown. This proce 
was a departure from earlier studies and was emp! 
to restrict possible cues employed in identificat 
of Jews and non-Jews, and also to provide an el 
of uniformity for all the photographs presented. 
judges rated each photograph on a five-point scali 
represented the degrees of Jewish and non-Je 
appearance of the photographs. Scale values were 
puted for each photograph, and all photographs 
were consistently rated as extreme or were oth 
easily distinguished and remembered were remo 
Of the original 82 photographs employed, 20 ph 
graphs of Jewish and 20 of non-Jewish persons ¥ 
selected. These were then broken into three sets 
follows: 

Set I: Mixed Jewish and non-Jewish (10 of é 

matched as to scale value). 

Set II: New set of Jewish faces (10) plus the Je 
ish faces employed in Set I. (Photog 
in both sets were matched individually 
scale values.) 

Set III: New set of non-Jewish faces (10) plus 
non-Jewish faces employed in Set I. 
were also individually matched in scale yall 

as in Set II, 
Procedure 


Following the selection of the 40 high and 40 
scoring subjects, each subject was individually ci 
tacted and tested on the three sets of photographs. 
order of presentation of all pictures in Set I was carefu 
arranged so that sharp contrasts between faces a 
scale values could not 


Resutts AND DISCUSSION : 
Table 1 presents the comparative data for the 
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TABLE 1 


Summary OF ACCURACY IN IDENTIFICATION AND 
Memory For Low AND HIGH ANTI-SEMITIC GROUPS 


Low anti- ©| High anti- 
Semitic Semitic 
Graa w= | (= 40), 


M SD “M SD 


et identifications of | 4.1 | 1.76 | 6*1 | 1.49 
jews 
Non-Jewish pictures iden- | 1.4 1.09 | 2.1 | 1.46 
tified as Jewish 
Pictures identified as Jewish| 5.5 1.47 | 8.2 | 1.48 
Jewish pictures that were 3.2 | 1.46 | 4.9 | 1.53 
accurately identified in 
Set I and were again ac- 
curately identified in 
Set II 
Non-Jewish pictures that | 6.2 | 1.45 5.4 | 1.73 


were accurately identi- 
fied in Set I and were 
again accurately identi- 
fied in Set IIT 


memory tasks, An analysis of the total number of 
pictures identified as Jewish revealed that the 
High A-S group was significantly more accurate 
than the Low A-S group in identification of Jewish 
photographs (p < .01). The difference between 
the two groups in accuracy of identification of the 
non-Jewish photographs fell short of significance. 
The High A-S group identified significantly more 
photographs as Jewish than did the Low A-S 
group (p < .02), but an analysis of covariance 
revealed that the accuracy in identification of 
Jewish photographs holds when the total number 
of Jewish identifications of non-Jews is controlled 
for (p < .01). The correlation between the accuracy 
scores and frequency of identifying the photo- 
graphs as Jewish was .40 (p < .01, 78 df) thus 
necessitating the use of the above procedure. 

In order to compare the high and low groups in 
regard to accuracy of memory for the Jewish and 
non-Jewish faces, an analysis of variance was 
computed for those pictures which were correctly 
identified in Set I and then again correctly identi- 
fied in Sets II and II. 

Table 2 indicates a significant interaction be- 
tween the groups and pictures, which necessitated 
a further comparison of the subgroups means 
through the use of the Tukey process (Walker & 
Ley, 1953), When the accuracy means (Table 1) 
were corrected for the group and photograph 
effects, the High A-S group obtained a mean ac- 
curacy score of 5.55 on the Jewish photographs 
and 4.30 on the non-Jewish photographs. The Low 
A-S group received an accuracy score of 4.29 
Jewish identifications and 5,54 for the non-Jewish 
identifications. 
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In order for these corrected means to be signi- 
cant at the .01 level, a difference of .38 between 
any two means would have to be demonstrated. 
It may be seen that a difference of 1.25 obtained 
between the photographs within each of the groups. 
Apparently the High A-S group remembered 
significantly more of the Jewish faces than did the 
Low A-S group, however the latter group demon- 
strated more accuracy in memory for the non- 
Jewish faces. 

The results of this investigation relating to 
identification of in-out-group members are clearly 
supportive of the findings of Allport and Kramer 
(1946) and of Lindzey and Rogolsky (1950). The 
second hypothesis relating to memory is contra- 
dicted by the data and suggests the operation of 
different processes in memory for Jews than were 
suggested by Seeleman (1940) in her study of 
memory for Negroes. The findings would initially 
suggest that the High A-S group remembered the 
Jewish photographs significantly more accurately 
than the non-Jewish photographs, in addition to 
demonstrating a significantly higher degree of ac- 
curacy than the Low A-S group in memory of the 
Jewish photographs. The situation, however, is re- 
versed for the non-Jewish photographs and this 
was not expected. Both the High and Low A-S 
groups appear to show differential sensitivity to 
in-out-group distinctions. 


SUMMARY AND CONCLUSIONS 


The present study was conducted to verify first 
the apparent tendency of anti-Semitic persons to 
identify Jews more accurately than do non-anti- 
Semites. Second, it was hypothesized that once an 
anti-Semitic person has distinguished between 
members of the in-group and the Jewish out-group, 
he should have no further need to distinguish 
among the out-group members, and his memory 
for those identified as Jewish should therefore be 


TABLE 2 


ANALYSIS OF VARIANCE FOR Memory OF ACCURATELY 
PICTURES FROM Set I THAT WERE AC- 


S Jpentiriep AGAIN IN SET II AND Ser III 
Source ss |af| MS | F 

PNP real a | anas 
Between subjects in same 179.27| 78 | 2.29 
motel between subjects 187.38 79 | here | sare 
Bpen (ee ange 
Ree within subjects 30-00 Rt 

Total 567.38159 

*p< 0. 
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less accurate than for those identified as non- 
Jewish. 

From a sample of 220 male subjects, the 40 
high and 40 low scorers on the Levinson-Sanford 
anti-Semitism scale were selected, and presented 
with matched sets of photographs of Jewish and 
non-Jewish males in cap and gown from a college 
yearbook, Three sets of photographs were em- 
ployed in a counterbalanced manner. 

The findings were: 

1. The high anti-Semitic subjects were both 
more accurate in identifying Jewish photographs, 
and labeled more of the photographs as Jewish 
than did the low anti-Semitic subjects. 

2. The accuracy of the anti-Semitic subjects 
was not a function of the increased number of 
photographs identified as Jewish. 

3. The anti-Semitic subjects remembered ac- 
curately significantly more Jewish photographs 
than non-Jewish photographs, and also were more 
accurate in identification of the Jewish photo- 
graphs than were subjects in the low anti-Semitic 
group. 

4, The low anti-Semitic group remembered 
significantly more non-Jewish photographs than 
Jewish, and were also more accurate in their 


CRITIQUE AND NOTES 


identifications of the non-Jewish photographs than 
the high anti-Semitic groups. 
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PERCEIVED CHARACTERISTICS OF ELECTED REPRESENTATIVES! 


ROBERT R. BLAKE ann JANE SRYGLEY MOUTON 
ë University of Texas 


OMMUNICATION and problem solving by elected 
C representatives are keys to decision making 
among segments of an institution where direct 
participation by each person is not possible. This 
paper deals with the question, “Are there dis- 
tinguishing characteristics in the direction of 
greater problem solving skill as seen by their 
peers, among those elected to represent their 
group in contrast with those who are not chosen?” 

The hypothesis is that the election of a member 
as a representative is not a random choice but is 
positively related to his degree of perceived social 
tactfulness, intellectual competence, and pro- 
cedural skill. The prediction is that the higher a 
member’s perceived social tactfulness, intellectual 
competence, and procedural skill, the greater the 
likelihood of his election to serve his group as its 
representative. 


METHOD 
Subjects 


The 88 subjects were composed into 11 eight-man 
units, matched for educational and occupational levels, 
for the purpose of studying decision making in groups 
under autonomous conditions (Blake & Mouton, 1960). 
The groups engaged in a sequence of activities centering 
on the examination and study of personal behavior, of 
group action, and of intergroup problem solving. Sub- 
jects were participants in three separate human rela- 
tions training laboratory programs, with three or four 
groups comprising a given laboratory. All subjects 
were from the same organization. 


Sequence of Activities Leading to the Election and 
Interaction of Representatives 


After each group had devoted approximately 12 
hours to ingroup activity over a period of 3 days, it 
was allowed 3 hours to develop its own approach to the 
solution of an assigned problem that involved producing 
a statement of ways to improve labor-management 
relations. Members recognized that their solutions 
would be a measure of their effectiveness as a problem 
solving group. It was understood that solutions would 
be evaluated for quality through interactions of elected 
representatives. As is true in many representation 
situations, here too, group members looked forward to 
the selection of their solution as “best” in a win-lose 
sense. 

Representatives were elected by rank ordering, 


without the opportunity of discussion or electioneering; 


1This study was partially supported by Grant 
M-2447-cl, Behavior of Group Representatives under 
Pressure, National Institute of Mental Health. 


all members from each group were evaluated according 
to the criterion, “Rank order the members of your 
group, including yourself, for serving your group as its 
representative.” 

Solutions were then dittoed and exchanged between 
groups. Approximately 2 additional hours were devoted 
to ingroup discussion of the merits of each. Members 
were urged to increase their understanding of the posi- 
tions of the other groups by noting points of similarity 
and of difference and points needing clarification or 
elaboration. 

At a meeting of the representatives, each representa- 
tive then explained and clarified his group’s solution in 
response to questions raised by representatives from 
other groups for an additional 1.5-2 hours. When 
members reported that additional interaction would 
not contribute to their understanding of the other 
group’s statements, the representatives ranked the 
solutions from “best to worst.” 


Measurement of Personal Behavior of Members 
during Ingroup Problem Solving Activities 


via judgments regarding how each person’s behavior 
had been experienced by ‘all the others according to the 
24 different items shown in Table 1. The ratings were 


in private, and anonymity of raters was insured, There 
was no relationship between these ratings of personal 


a rating for I 
groups, personal behavidr ratings were made prior to 
the training sequence concerned with intergroup rela- 
tions and for the other half following it. Since the timing 
of the personal evaluations reveals no significant differ- 
ences in connection with assessing the perceived char- 
acteristics of representatives, the combined data are 


reported here. 
RESULTS 


The members of each group were divided into 


quartile divisions, according to rank received in 
Data from all 


the election of representatives. a 
groups were combined so that each quartile 
division consisted of 22 subjects. Table 1 presents, 
for each quartile division, the mean rating received 
on the personal characteristic items. The three-fold 
categorization of items is on @ prior! grounds. 
Results in Table 1 can be evaluated from two 
points of view. One is the regularity of changes 
' from the highest to the lowest quartile division. 
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TABLE 1 


PERCEIVED CHARACTERISTICS OF INDIVIDUALS IN 
Four Quartz Divisions FROM Most (ist) TO 
Least (4th) PREFERRED AS REPRESENTATIVES 


uartie Dirken 
Type and Characteristic 
4th | 3rd /2nd | Ist | ¢* 
Procedural Skill 
‘Takes the lead in selecting topics 3.8] 5.6] 6.2] 6.8/6.66 
Offers constructive evaluation as | 4.8] 5.7] 6.8) 7.1/5.34 
needed 
Helps get to the meat of issues 5.1) 6.2} 6.8) 7.1/5.26 
Encourages group to high-level pro- | 4.5| 5.6] 6.4] 6.8/4.69 
ductivity 
Provides helpful objective feedback to | 5.0] 6.0| 6.6) 6.7|4.05 
members 
Provides good summaries when needed | 4.0] 5.2) 6.2| 6.7/3.86 
Helps group stay on target 5.2] 5.9] 6.5) 6.8/3.41 
He likes to wander 3.3) 3.5} 3.2) 2.9)1.42 
Blocks the group 2.4) 2.5) 2.4) 2.2) .80 
Intellectual Competence 
Offers original ideas 4.7) 5.9) 6.7) 7,116.85 
Helps members express their ideas 4.4] 5.4] 6.2) 6.6)5.41 
Expresses himself clearly and concisely | 5.9] 6.4] 7.0] 7.2|3.82 
Sticks to his point arbitrarily 3,1) 3,5) 3.5] 3.3] .46 
Social Tact 
Dominates and imposes his will on the | 1.9] 2.6] 2.9] 3.5/4.10 
group 


Runs away when faced with problems | 3,2] 2.8] 2.3) 2.2)3.44 
Yields to group pressures—conforms | 5.4] 4.7] 4.4] 4.0/3.41 
Annoys others . .5|2. 
Levels with other members 6.7| 7.1) 7.4| 7.41.32 
Effectively senses when to talk and | 6.3| 6.3| 6.7| 6.81.25 

listen 


Makes others feel at ease 6.7| 6.8| 7.1| 7.0) .81 

He fights rather than works 2.3) 2.6| 2.5| 2.6| .53 

Listens with understanding to what | 7.3| 7.2| 7.4| 7.4| .39 
others say 


‘He shows that he likes us 
Contributes without cutting others off 
ee 
® Between first and fourth quartile divisions, 
o 


Progressive increases or decreases from the first to 
the fourth quartile division provide an indication 
of the stability of the relationships obtained, In 
addition, #’s have been computed between the 
first and fourth quartile divisions to indicate the 
significance of differences between extremes. Note 
that all items for which the ?’s are significant also 
are characterized by consistent upward or down- 
ward trends from the first to the fourth quartile 
divisions. 

The results confirm the hypothesis, clearly 
showing that members who are most preferred by 
their peers as representatives are distinguished 
from those who are not chosen in terms of being 
Seen as possessing greater intellectual competence 
and procedural skill. They do not support the view 
that highly preferred representatives are perceievd 
to possess greater social tactfulness than members 
who are unchosen, 
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Procedural Skill 


Seven of the nine items evaluating procedu 
skills show significant differences between m 
and least preférred representatives. In each 
changes from the first quartile division to 
fourth represent a steady upward trend. Five 
are concerned with skill in aiding a group to fu 
tion effectively at the task level, such as taking 
lead in selecting topics, helping to get to the a 
of igsues, helping the group to stay on target, 
marizing when needed, and encouraging high- 
productivity. The remaining two significant i 
are concerned with giving constructive evaluati 
and providing objective feedback. Both deal 
aspects of group development that help mem 
steer themselves more effectively; that is, 
involve skill in giving reactions regarding hoy 
person feels or how others are affecting him. 

The two items that fail to distinguish the 
treme groups are concerned with blocking — 
group and with wandering off the topic. Both 
actions that interfere with the group effectiven 


Intellectual Competence 


Three of the four items measuring perceive 
intellectual aspects show progressive increase, wil 
differences in all cases significant far beyond 
01 level. For “offering original ideas” the diff 
ence between extremes is the most significant 
the study. The other two deal with clarity in 
munication. The item that does not differentiate 
significantly is related to sticking to one’s 0 
point in an arbitrary manner, also a behavior tl 
blocks group action. 


Social Tactfulness 


Only 3 of the 11 tactfulness items disting 
degrees of preference for representative beyo 
the .01 level, and the findings for even those I 
are counter to what would be expected if electi 
as a group representative were thought of as 
“popularity” contest. Though the absolute le 
is low for all quartile divisions, highly chos 
Tepresentatives are seen as dominating and ii 
posing their personal will on the gtoup more í 
the less frequently chosen. The other two signifi- 
cant items can be understood in the same manner, 
Highly chosen persons are seen to resist group 
pressures toward conformity and to avoid running 
away when faced with problems. 

Items failing to discriminate also are counter t0 
expectation, for they are kinds of behavior th 
frequently are thought of as increasing an in 
vidual’s personal attractiveness and therefore . 
Prospects of being chosen as a representa' 
Interestingly enough, highly chosen members ti 
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almost significantly, to be seen as more annoying 
than are those who are not preferred as representa- 
tives! 


e 
DISCUSSION 


° 
Much decision making within democratic in- 
stitutions is through actions of persons elected by 
others to speak for them. Yet little is known sys- 
tematically regarding the psychological processes 
involved in the election of a representative. We 
have failed to locate a single experimental study 
concerned directly with evaluating the personal 
characteristics associated with group preference of 
one individual over another as a representative. 

The underlying theory of representational de- 
mocracy presumes that group members elect to 
office those most capable of performing functions 
of representation. The present study confirms one 
aspect of this assumption by showing that persons 
preferred as representatives are seen as intellectu- 
ally more capable and as being more skillful from 
a procedural point of view. The hypothesis is not 
confirmed in the area of social tactfulness as or- 
dinarily understood. If anything opposite findings 
are obtained. Representatives are seen as being 
stronger in the sense of dominating group actions, 
of resisting conformity pressures, and of facing up 
to issues. Except for the items concerning tactful- 
ness, the results establish the validity of the hy- 
pothesis at the perceptual level. 

A logical second step now needs to be taken. 
The question is, “Are persons who are preferred 
as representatives actually more able under inter- 
group problem solving conditions to get their 
points of view effectively understood in interaction 
with other representatives?” 
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SUMMARY 


The present study deals with the perceived 
personal characteristics of members who are pre- 
ferred by their peers to serve as spokesmen in 
decision making interaction with representatives 
elected by other groups. Eighty-eight subjects 
composed into eight-man groups spent approxi- 
mately 12 hours in autonomous ingroup activities 
prior to intergroup problem solving through repre- 
sentatives. An assigned problem served as the basis 
for creating differences between groups. Each 
group rank ordered all its members with respect to 
preference for a representative. Members also 
evaluated each other on 24 items of personal be- 
havior categorized as measuring social tactfulness, 
intellectual competence, and procedural skill. 

Results show clear-cut differences and are in- 
terpreted as supporting the hypothesis that mem- 
bers do elect as representatives those whom they 
see as possessing superior procedural skill and in- 
tellectual competence. They do not bear out the 
part of the hypothesis concerned with social tact- 
fulness. Persons elected as representatives are nol 
more outstanding than members least preferred in 
terms of perceived positive social—emotional 
characteristics such as helping, harmonizing, or 
making others feel liked or at ease. Rather they 
are seen as personally “strong” in the sense of 
dominating the group, resisting conformity pres- 
sures, and facing up to problems rather than 
running away from them. 
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THE INTERACTION OF PERSONALITY AND MUSCLE TENSION IN PRODUCING 
AGREEMENT ON COMMONALITY OF VERBAL ASSOCIATIONS 


JAMES L. KUETHE 3 
Johns Hopkins University 


recent study by Kuethe and Eriksen (1957) 
A related verbal stereotypy under stress to 
subjects’ scores on the Psychasthenia (Pt) scale 
of the MMPI. Subjects with low Pt scores showed 
more stereotypy under experimentally induced 
anxiety or muscle tension than did similar subjects 
in a control group. Subjects with high Pt scores, 
however, showed less stereotypy under either 
stress condition than did similar subjects in a con- 
trol group. 

It was suggested that stereotypy represents a 
facilitation of response biases based on past learn- 
ing. This proposition was difficult to evaluate be- 
cause the subjects gave numerical responses. 

The present study had two major purposes. 
One was to replicate the investigation of Pt scores 
and the presence or absence of tension in producing 
verbal stereotypy. The second was to examine the 
role of prior learning in verbal stereotypy. 


METHOD 


The Pt scale of the MMPI was administered to male 
students in an introductory psychology class. For the 
actual experiment, 40 subjects were selected at random 
and divided into two groups. There were 20 subjects in 
a “tension” group and 20 subjects in a “nontension” 
group. All subjects were run in individual sessions. 

Each subject in the nontension group was told that 
the experimenter was interested in collecting data on 
verbal associztions so that he would have a basis for 
comparisons in future studies that he wished to do. 
The subject was then read 50 stimulus words from the 
Kent-Rosanoff list, and the experimenter recorded the 
subject’s association to each word. 

The subjects in the tension group received the same 
treatment as the nontension group with two exceptions, 
These subjects were told that the experimenter was in- 
terested in the effect of muscle tension upon verbal 
associations, Each subject in this group supported a 
weight by pulling down on a lever while giving his 
associations. The weight was heavy enough to require 
tension in arm and shoulder muscles, Subjects indicated 
that the task did not require excessive effort and that 
they could have supported the weight for far longer. 


RESULTS AND Discussion 


A count was made for each of the 50 stimulus 
words to determine the most frequent response 
given by the 40 subjects. The score of an individual 
subject was the total number of these high fre- 
quency associations that he produced. 

Within the tension and nontension groups, sub- 
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jects were classified as having high Pt scores (Pt-+) 
or low Pt scores (P!—). The division was made at_ 
the median Pt score of the subjects within each 
group, resulting in subgroups of 10 subjects each, 

Table 1 presents the mean agreement on com- 
monality scores for the Pi+ and the P!— sub- 
jects in the tension and nontension groups. 

An analysis of variance revealed that the inter- 
action of Pt scores with the presence or absence of 
tension was the only significant source of variance 
(F = 5,45, p < .05 with 1 and 36 df). Under ten- 
sion, Pt— subjects show more verbal between- 
subject agreement than do Pt— subjects in a non- 
tension situation. Tension produces slightly less 
verbal agreement in Pi+ subjects than is shown 
by Pt+ subjects in a nontension situation. 

The results of the present study support the 
previous findings of Kuethe and Eriksen (1957). 
In that earlier study, two types of stereotypy were 
investigated. An individual could show stereotypy 
by not using all of the response categories avail- 
able or through sequential dependencies in the 
generated responses. 

In the present study, the mean agreement on 
commonality score for a group may be regarded 
asa measure of group response stereotypy. To the 
extent that a group has a high mean agreement on 
commonality score, the subjects in the group are 
giving the same responses, These responses have 
probably received a great deal of emphasis in the 
experience of individuals from similar cultural” 
backgrounds. In response to the word “girl,” @ 
subject may say “Jane,” a response that is rela- 
tively idiosyncratic, or he may say “boy,” an 
association learned by most individuals. It is this 
latter type of response that seems to be facilitated 
by tension in subjects with low Pt scores. . 

The earlier study was concerned with variability 
within subjects, whereas the present study focu 
on variability between subjects. In both situations, 
however, the interaction of personality with in- 
duced tension determined the predictability of 
behavior in that it predicted variability. For ex- 
ample, Pi— subjects become more predictable 
(more stereotyped) under tension in that they do 
not respond randomly (Kuethe & Eriksen, 1957) 
and in that their responses increase in agreement 
on commonality. 

The facilitation of strong response biases of 
subjects under tension would be predicted by drive 
theory. But this prediction does not hold for sub- 
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TABLE 1 
MEAN AGREEMENT ON COMMONALITY FOR THE Pit 
AND Pi— SUBJECTS IN TENSION AND 
NONTENSION GROUPS 


Pi- Pit 
Nontension Group 19.9 21.6 
Tension Group 23.8 19.7 


jects with high Pt scores. Thig result is interesting 
because the Pt scale correlates .92 with the Mani- 
fest Anxiety (MA) scale (Eriksen & Davids, 1955), 
and the MA scale is a scale for which there is a 
claim that it measures drive level (Taylor, 1956). 
Since the correlation between the two scales is of 
about the same magnitude as their reliability 
coefficient, it is probable that the instruments are 
for the most part measuring the same thing. 

The present evidence supports the view that 
while the Pt and related “anxiety” scales may 
indicate initial differences in drive level between 
subjects, they also predict behavior in situations 
producing a strong drive. It is interesting that 
subjects with high “test anxiety” behave as though 
they were under relatively high drive when they 
are in a neutral situation but behave as though 
there waa reduction in drive level when they are 
subjected to stress, This latter phenomenon may 
be determined, however, by factors which override 
any simple relation between drive and behavior. 

It may well be a mistake to estimate drive level 
from performance in this instance. It is possible 
that individuals with high scores on anxiety scales 
tend to use defense mechanisms when they are 
subjected to experimental stress that are different 
in kind from the mechanisms activated in low 
scoring subjects. For example, when subjects with 
high Pi scores are under stress, they may become 
vigilant. In the present study this tendency could 
lead Pt-+ subjects to edit their behavior by not 
always giving the first response that occurred to 
them. Such behavior would result in lowered agree- 
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ment on commonality. This would mean that the 
defense mechanisms of the Pt+ subjects would 
prevent the operation of any simple effect of drive 
upon response characteristics. 

Eriksen, Kuethe, and Sullivan (1958) showed 
that subjects with low Pt scores show less aware- 
ness of their behavior than is true of high scorers. 
The absence of conscious interference with be- 
havior in the present situation could permit be- 
havior to be modified by a strong drive in the 
manner that drive theory would suggest. 

The evidence that anxiety scales measure more 
than “amount of anxiety” has been discussed by 
De Soto and Kuethe (1959, 1960). Part of their 
formulation involved the possibility that a low 
score on an anxiety scale may often reflect a trait 
independent of the one indicated by a high score. 
Scale scores are therefore regarded by De Soto and 
Kuethe as a disjunctive function of amount of 
anxiety and the nature of the defense mechanisms 
activated by the testing situation. 
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A FACTOR ANALYTIC INVESTIGATION OF PERSONALITY VARIABLES 
AND REMINISCENCE IN MOTOR LEARNING 
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° 


ONSIDERABLE interest has been aroused by the 

pioneer work of Eysenck (1956b) in both 
experimentally and theoretically relating person- 
ality variables to the phenomenon of reminiscence 
in motor learning. Several additional studies 
(Bendig & Vaughan, 1959; Ray, 1959; Recht- 
schaffen, 1958) have tested deductions drawn from 
this formulation, Briefly, Eysenck has suggested 
that the trait of extraversion, defined by Guilford’s 
Rhathymia and Eysenck’s (1956a) Extraversion 
scales, is positively correlated with the amount of 
reactive inhibition generated in a motor task and, 
consequently, with both the amount of original 
learning and reminiscence shown. Less surely, 
Eysenck also suggests that drive, as measured by 
Guilford’s Cycloid Disposition, Taylor’s Manifest 
Anxiety, and Eysenck’s Neuroticism scales, may 
be positively correlated with motor learning and 
reminiscence. 

Subsequent research has reported inconsistent 
results, Using measures from a reversed alphabet 
printing task, Rechtschaffen (1958) found no rela- 
tionships between scores on the Rhathymia scale 
and either learning or reminiscence, and Bendig 
and Vaughan (1959) reported zero relationships 
between Extraversion and Neuroticism scale scores 
and motor learning. Taylor and Rechtschaffen 
(1959), using learning data from Rechtschaffen’s 
original study (1958), found significant negative 
correlations Between Manifest Anxiety scale scores 
and motor learning. Both Eysenck (1956b) and 
Ray (1959) used a pursuit-rotor task and found 
Extraversion scores to be negatively related to 
learning, but Eysenck reported the scores to be 
correlated with reminiscence while Ray did not. 
Ray also showed that Neuroticism scores were not 
related to either learning or reminiscence, although 
Eysenck reported low but statistically significant 
correlations among the same variables, 

In considering these studies, it appears that 
studies using the pursuit-rotor task have demon- 
strated larger relationships between personality 
factors and learning phenomena than those using 
reversed alphabet printing. These studies have also 
restricted the investigation to one or, at the most, 
two personality traits, typically rhathymia or 
extraversion and neuroticism or manifest anxiety. 
Finally, previous studies have invariably di- 
chotomized subjects on the basis of personality 
scale scores and tested for significant differences 
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in mean learning or reminiscence perform 
rather than investigating the covariation amo 
the measures. The present study was designed 
to ytilize learning and reminiscence measures ft 
the pursuit-rotor task as described by Ray (1959), 
(b) to include a larger number of personality si 
scores by using a multiscale inventory such as 
Guilford-Zimmerman (1949) Temperament 
vey, and (c) to use factor analytic procedi 
investigating the relationships among these v 
ables. 


PROCEDURE 


Subjects were 160 male undergraduate student 

rolled in sections of a one-semester course in i 
ductory psychology. All subjects were individ 
contacted in class and requested to participate, Duri 
the experimental session, the subject was first given 
Guilford-Zimmerman (1949) Temperament Survey. | 
was then run on the pursuit-rotor with the equipmet 
instructions, and procedure identical with those 
by Ray (1959), requiring 5 minutes of contim 
practice, 15 minutes of rest, and 2 minutes of postré 
practice, Each subject read or studied during the 
period. The rest period of 15 minutes was chosen 
maximize reminiscence (Ray, 1959, p. 202.) 
The GZTS answer sheets were scored for all 
scales. Raw scores were used in the subsequent analys 
The variable of Motor Learning (ML) was defined 
the total time on target during the 5-minute ori 
practice period. The measure of Reminiscence 
used was the first 20-seconds of postrest perform 
minus the mean of the last two 20-second prerest t 
This measure of reminiscence was chosen since it g 
slightly more reliable measures of reminiscence than 
the one used by Ray (first 20-second postrest 
minus the last 20-second prerest trial). A ¢ test 
mean difference between the last two 20-second pi 
trials did not approach significance. 

The 12 variables (10 GZTS scores, ML, and R) were 
intercorrelated by the usual product-moment formula 
and six factors extracted from the matrix, using 
complete centroid method. The significant factors 
McNemar’s criterion were rotated to oblique sim 
structure, using the analytic criterion developed 
Pinzka and Saunders (1954).1 


RESULTS AND DISCUSSION ` 


The first four centroid factors were signific 
by McNemar’s criterion. The results of the obliq! 


1 The statistical analysis was accomplished using 
facilities of the University of Pittsburgh Comput: 
and Data Processing Center. 


y 


CRITIQUE AND Notes 699 
TABLE 1 
CORRELATIONS OF GZTS SCALES WITH REMINISCENCE AND MOTOR LEARNING MEAS) 
OBLIQUE REFERENCE VEcTOR MATRIX FOR TOTAL GROUP E A 
a (WN = 160) 
Variable Correlations with Oblique Vector Loadings 
R ML Em(—) SA EI Fr W 
General Activity -03o —04 11 14 —03 - 
Restraint -12 24** 02 —08 59 18 39 
Ascendance e —02 =g 12 51 13 =19 33 
Sociability —03 -1 —08 72 -12 19 58 
Emotional Stability 06 06 50 09 -01 09 27 
Objectivity 12 08 54 02 —02 16 32 
Friendliness —02 14 06 —01 11 56 33 
Thoughtfulness —07 00 —10 03 44 —22 25 
Personal Relations 07 13 08 29 02 50 34 
Masculinity 06 11 49 —22 02 -15 31 
Reminiscence (R) =12 16 —13 -25 -13 12 
Motor Learning (ML) 11 -13 33 11 15 
Direction Cosines —58 05 —58 
Among 05 53 
05 


Reference Vectors 


* b <0, 


rotation process are given in Table 1 along with 
the correlations of the GZTS scales with ML and 
R. The Restraint scale correlated significantly with 
the ML measure (r = .24), but none of the other 
correlations were significant. The four oblique 
factors are the Emotionality (EM), Social Ac- 
tivity (SA), Extraversion-Introversion (EI), and 
Friendliness (Fr) factors identified in previous 
factor analytic investigations (Bendig, 1960a, 
1960b; Bendig & Meyer, unpublished). The ex- 
perimental R and ML measures load primarily on 
the EI factor along with the GZTS Restraint and 
Thoughtfulness scales. The EI factor is inde- 
pendent of the other factors (direction cosines of 
.05, .05, and .05), while the correlations among 
the EM, SA, and Fr factors are appreciable (co- 
sines of —.58, —.58, and .53). 

The negative loading of the R measure and the 
Positive loading of ML on the EI factor support 
Eysenck’s predictions regarding extraversion and 
reactive inhibition, assuming that the GZTS 
Restraint and ‘Bhoughtfulness scales are suitable 
indices of Eysenck’s extraversion dimension. Since 
Eysenck considers the Rhathymia scale of Guil- 
ford’s STDCR battery as the best measure of 
extraversion, and since the Restraint scale was 
derived, with reversed scoring, from the original 
Rhathymia scale, this assumption appears. war- 
ranted. Introverts (subjects high on the Restraint 
scale) appear to show less reminiscence and greater 
motor learning than do extraverts (subjects low 
on the Restraint scale). If one further assumes 
that the GZTS Emotional Stability and Objec- 


tivity scales, like Taylor’s Manifest Anxiety scale 
and Eysenck’s MPI Neuroticism scale (Bendig, 
1960b), are reversed measures of a broad “Emo- 
tionality” factor that Taylor equates with level of 
irrelevant drive, then the suggestion that emotional 
drive level is related to learning and reminiscence 
is unsupported. The loadings of R and ML on the 
EM factor are quite low, and the Emotional Sta- 
bility and Objectivity scales show only low corre- 
lations with R and ML measures. Comparisons of 
the mean R and ML measures of subjects in the 
top and bottom fifth of the distributions of these 
two GZTS scores give no evidence of any signifi- 
cant differences. R 

Personality scale measures of extraversion 
appear to be related to motor learning and remi- 
niscence, but the relationship is not very large. A 
major factor in the relatively low correlations is 
the lack of reliability of the variables, particularly 
the measures of reminiscence. This is evident in 
the correlation of the Restraint scale with R as 
lower than with ML, and in the smaller loading 
on the EI factor of the R measure. This also may 
be why Ray (1959) failed to find extraversion 
scores significantly correlated with R although his 
study also shows a relationship between extra- 
version and ML. Future research should use pur- 
suit-rotor rather than reversed alphabet printing 
as the experimental task, and attention may better 
be concentrated on more reliable measures of 
motor learning rather than unreliable reminiscence 
indices that may confound both reactive and con- 


ditioned inhibition. 
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NEW ZEALAND CHILDREN’S ATTITUDES TOWARDS MAORIS 


G. M. VAUGHAN! anp R. H. T. THOMPSON 
University of Canterbury, New Zealand 


ut Maoris of New Zealand are a promiaent 
ethnic minority numbering approximately 


7% of a total population of nearly 2.5 million. 
However, as they have not yet achieved full inte- 
gration into European culture, instances of social 


discrimination still occur. In‘the larger cities df the 
North Island particularly, difficulties in Maori- 
white relationships have been aggravated by the 
large postwar increase in the Maori population. 

Investigations within this country of whites’ 
attitudes towards Maoris have been few. Fitt 
(1955), using a Thurstone technique, found that 
the ethnic attitudes of students in Auckland were 
unfavorable during the years 1943-1947, but were 
relatively indifferent by 1950, a change that he 
related to the shift of many Maoris to the cities 
during the war years, An analysis carried out by 
Thompson (1953, 1954a, 1954b, 1955) of news 
items concerned with Maori affairs in New Zealand 
newspapers indicated that a high percentage of 
themes implied unfavorable attributes among 
Maoris. For example, Maoris are “lazy,” “irre- 
sponsible,” “ignorant,” and “superstitious.” The 
first use of a picture story technique to measure 
attitudes towards Maoris was made by Collins 
(1954). Thompson (1959) used a similar but more 
refined set of pictures to investigate the attitudes 
of Christchurch University students. He found 
som: evidence of unfavorable attitudes towards 
Maoris, but the results were qualified by the ap- 
parent failure of some subjects to recognize cer- 
tain depicted characters as Maoris. 

The research reported in this paper is an ex- 
ploratory investigation, using a modified Thomp- 
son picture story technique, of the development of 
attitudes towards Maoris among white New Zea- 
land school children between the ages of 8 and 16 
years, 

Since more than 96% of Maoris live in the North 
Island of New Zealand, attention was also given to 
a possible relationship between the amount of con- 
tact with Maoris and the development of unfavor- 
able attitudes towards them. The census figures for 
1956 report that Maoris provide 4.24% of the total 
population (8-16 years of age) in Auckland (North 
Tsland) and 0.56% in Christchurch (South Island). 
Seventy-one percent of all Maori school children 
attend integrated schools throughout the country. 
From these figures, it was reckoned that of the inte- 
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grated schools total population (8-16 years of age) 
in each city, 3.1% in Auckland (high contact, HC) 
and 0.4% in Christchurch (low contact, LC) were 
Maoris. Maori and white children attending the 
same school mix freely to the extent that they work 
together in the same classrooms, play together in 
the same sports teams, and mingle together with- 
out apparent discrimination during play periods. 

Schools were chosen from predominantly work- 
ing class areas in both cities. This restriction not 
only assured comparability with respect to eco- 
nomic status, but also ensured that contact with 
Maoris, who are mainly working class in New Zea- 
land, was derived from mixing in roughly equal 
status groups. This control was considered essential 
in view of the evidence (Allport, 1954) that the 
degree of ethnic prejudice is related to the kind of 
interethnic contact involved. Allport also suggests 
that prejudice may be reduced by equal status 
contact between majority and minority groups in 
the pursuit of common goals. 

The aims of the present research were to clarify 
the relationships between ethnic attitudes in New 
Zealand and the variables of age and contact. The 
hypotheses formulated were: 

1. Unfavorable attitudes towards Maoris in- 
crease with age. 

2. Unfavorable attitudes towards Maoris in- 
crease as the amount of contact with Maoris de- 


creases. 
METHOD 0 


Materials and Subjectse 


‘The experimental material consisted of 10 matched 
pairs of TAT-type cards, depicting themes of play, 
school, neighbors, a friend, sport, a formal and an 
informal social occasion, yocation, law and order, and 
family. One card of each pair made up the experimental 
set, the other the control. Both sets of cards were photo- 
graphed from originals, and they differed only in that 
the crucial figure in the experimental set was Maori, in 
the control set white. Three sample pairs of cards are 
shown in Figure 1. 

The subjects were 120 white children, 60 boys and 
60 girls, in the three age groups, 8, 12, and 16 years, 
selected from Auckland (HC) and Christchureh (LC) 
schools. These two cities were chosen as areas which 
differed markedly in the amount of interethnic contact 
available to the subjects. The HC and LC groups were 
matched on a residential criterion (S-years residence in 
a working class suburb), and a combined 1Q-scholastic 
attainment criterion (“A” stream school classes). The 
20 subjects in each city-age group were allocated to 
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CRITIQUE AND Notes 


A E Z 
control and experimental groups on the basis of a table 


“of random numbers. 


Procedure 

Since there was considerable age range in the total 
sample, and therefore differences in levels of under- 
standing test instructions, it was considered desirable 
to use two different sets of directions. The following 
instructions were given to the 8-year-olds: 

This is not a test. It is a story-telling game, I am 
going to show you some pictures, one at a time, and I 
want you to make up a story about each one. These 
pictures are about life in New Zealand and the people 
in the pictures are New Zealanders. Tell me how the 
people in the pictures feel and think. There are no 
right or wrong answers. You have all the time you 
want. 

The 12- and 16-year-olds were instructed as follows: 

This is a test of creative imagination. I am going 
to show you some pictures, one at a time, and I 
want you to make up a story about each one. These 
pictures are about life in New Zealand and the 
people are New Zealanders. Tell me what has been 
happening in each picture, what is happening now, 
and then how it will turn out, Tell me how the people 
in the pictures feel and think. Use your imagination 
to the limit. There are no right or wrong answers. 
You have all the time you want. 

Great care had to be exercised with the 8-year-old 
‘subjects to ensure adequate rapport, and sometimes it 
was necessary to ask them to provide a start or finish 
to their stories. ` 

The scoring formula was similar to that used by 
praon (1950), who reports reliabilities ranging from 

.94 to 0.99 for this method. The stories were rated on 
the basis of four criteria: 

1. The subject’s reaction at the outset to the general 
Setting in the picture. 

2. The subject’s estimate of the test figure’s reaction 
to this situation. 

3. Judgment by the subject of the test figure—e.g-, 
whether he is esteemed or despised. 

4, Outcome of the story for the test figure. 
Responses were rated on a three-point scale as generally 
favorable, neutral, or unfavorable for each of the four 
criteria, 


RESULTS AND DISCUSSION 


The frequencies of favorable, neutral, and un- 
favorable respofises made by control and experi- 
mental subjects at three age levels, in both HC and 
LC groups are shown in Table 1. Unfavorable re- 
Sponses indicate the degree of conflict, favorable 
Tesponses the degree of cooperation recognized in 
the pictures by the subjects. Control subjects made 
their judgment when a white was the test figure, 
experimental subjects when a Maori was the test 

re. 


___ The results in Table 1 indicate that both HC and 


LC experimental groups made fewer favorable re- 


| Sponses, more unfavorable responses, and more 


703 


TABLE 1 
FREQUENCY OF UNFAVORABLE, FAVORABLE, AND 
NEUTRAL Responses To CONTROL AND EXPERI- 
MENTAL CARDS, IN RELATION TO AGE AND Op- 
PORTUNITY FOR INTERETHNIC CONTACT 


(Data based on 10 subjects in each cell) 


High Contact Subjects | TOY Contact 
Responses 
‘Age Level |Controt Experi: |Control [Expertis 

Unfavorable* 8 yrs. 29 59 42 59 
12 yrs. 97 | 138 | 112 | 123 

16 yrs. | 107 | 134 | 126 | 213 

Favorable 8 yrs. | 183 | 140 | 158 | 129 
12 yrs. | 182 | 135 | 194 | 170 

16 yrs. | 177 | 147 | 175 86 

Neutral 8yrs. | 188 | 201 | 200 | 212 
12yrs. | 121 | 127 | 94 | 107 

16 yrs. | 116 | 119 99 | 101 


a The percentages of unfavorable responses to control and 
experimental cards were compared by Zubin’s £ test as follows: 


t b 

HC Subjects 

8 yrs. 3.53 <,001 

12 yrs. 2.83 <. 

16 yrs. 2.12 <.05 
LC Subjects ` 

8 yrs. 2.12 <.05 

12 yrs. 1.41 ns 

16 yrs. 7.78 <,001 


neutral responses than the respective control 
groups. Howeyer, because of considerable depend- 
ence between favorable and unfavorable measures, 
the statistical treatment of the datatin Table 1 is 
limited to the percentages of unfavorable responses 
only made by all groups. 

An analysis of these percentages using Zubin’s 
t test for ratio intervals (Baggaley, 1957) is ap- 
pended to Table 1. This analysis shows that, in five 
cases out-of six, both HC and LC experimental 
subjects gave a significantly higher percentage of 
unfavorable responses than did the respective con- 
trol groups. In the case of the 12-year-old LC sub- 
jects, the trend was consistent, This significant in- 
crease in the degree of conflict associated with the 
substitution of a Maori test figure provides evi- 
dence of unfavorable attitudes towards Maoris 
among the subjects tested. 


Age Hypothesis 

HC subjects. Significant differences in the num- 
ber of unfavorable responses were found between 
8- and 12-year-old subjects in both control (t = 
7.07, p < .001) and experimental groups (t = 6.36, 
p < .001). The control figures indicate a tendency 
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to make responses, distinctly favorable or unfavor- 
able, to situations suggesting social conflict of any 
kind, while the experimental figures reflect an addi- 
tional tendency to make these responses to situa- 
tions specifically involving Maori-white relations. 
No significant differences were found between 12- 
and 16-year-old subjects, in either control or ex- 
perimental groups, suggesting a leveling-off process 
in the development of social and ethnic attitudes. 
These results partially support, i.e., between the 
years of 8 and 12, the hypothesis that unfavorable 
attitudes towards Maoris develop as age increases. 

LC subjects. Again, significant differences in the 
number of unfavorable responses were found be- 
tween 8- and 12-year-old subjects in both control 
(t = 6.36, p < .001) and experimental groups (¢ = 
5.66, p < .001), paralleling the results obtained 
for HC subjects. In comparing 12- and 16-year-old 
subjects, the difference between control groups was 
not significant, whereas that between experimental 
groups was (¢ = 7.78, p < .001). As with HC sub- 
jects, the control figures in this instance indicate a 
leveling-off process in the development of social 
attitudes. This process is not paralleled in the ex- 
perimental figures, an aspect of the results that is 
amplified below. 


Contact Hypothesis 


No significant differences in the number of un- 
favorable responses were found between HC and 
LC control subjects at any one age level, indicating 
that there was little intercity difference with re- 
spect to social attitudes in general. Further, there 
were no significant differences between HC and LC 
experimental subjects at the 8- or at the 12-year-old 
levels. At the 16-year-old level, however, LC ex- 
perimental subjects made significantly more un- 
favorable responses than sHC experimental sub- 
jects (¢ = 5.66, p < .001). It is clear that, between 
the ages of 12 and 16 years, low interethnic contact 
is associated with a marked increase in unfavorable 
attitudes, 

These results partially support, i.e., at the 16- 
year-old level only, the hypothesis that attitudes 
towards Maoris are more unfavorable among sub- 
jects experiencing minimal interethnic contact. 
This finding is consistent with Allport’s (1954) sug- 
gestion that prejudice may be reduced by equal 
status contact between two groups in pursuit of 
common goals, 


CRITIQUE AND NoTES 


SUMMARY 


A picture story technique designed to measure 
the attitudes of whites in New Zealand towards 
Maoris was employed to investigate the develop- 
ment of ethnic attitudes in relation to age, and 
contact between whites and Maoris of approxi- 
mately equal status. Control and experimental sets 
of pictures differing only in whether the key figure 
was depicted as Maori or white were presented to 
40 white subjects at Gach of the age levels 8, 12, and 
16 years. Responses were scored as favorable, neu- 
tral, or unfavorable. It was concluded that: 

1. Unfavorable attitudes towards Maoris in- 
creased significantly among both HC and LC sub- 
jects between the ages of 8 and 12 years. 

2. Unfavorable attitudes towards Maoris in- 
creased significantly among LC subjects between 
the ages of 12 and 16 years, 

The second conclusion supports the suggestion 
that prejudice may be reduced by equal status 
contact. 
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number of variables have been experimentally 

demonstrated to effect the amount and 
direction of social influence in dyads. Over 20 years 
ago Sherif (1937) presented ata suggesting that 
the prestige of a partner is an important factor in 
the amount of convergence of judgments shown 
by the other member of a dyad. This finding has 
been further documented by several more recent 
experiments (Bray, 1950; Kelman, 1950; Mausner, 
1953; Mausner & Block, 1957; Zeaman, 1956). The 
importance of prior success or failure on the same 
task has been shown by Kelman (1950) and by 
Mausner (1954a). At least one previous study has 
investigated the role of prior success or failure of 
a partner on a different, but related, task (Mausner, 
1954b). Subjects who observed their partner’s 
success on the prior related task showed greater 
convergence than did those who observed their 
partner’s failure. “Partners” were actually con- 
federates of the experimenter, however, and it was 
thus not possible to make direct comparisons. 
within dyads of the amounts of influence in each 
direction. 

The present investigation was designed to allow 
such direct comparisons within dyads. It was 
desired to compare the amount of influence each 
subject exerted on the other in a judgmental situa- 
tion, following the induction of perceptions of 
differential competence on a prior related task. 
The primary hypothesis may be stated formally as 
follows: Subjects who have been successful, on a 
prior related task, have more influence on their 
unsuccessful partners than these partners have on 
them. 

In order to conduct a direct test of this hy- 
Pothesis, an experimental situation is required in 
which the influence of each member of a dyad on 
the other can be measured separately. Customarily 
used tasks, such*as judging the amount of apparent 
movement in the autokinetic situation or estimat- 
ing the length of lines, are not well suited for this 


1A paper based on the research reported was read 
at the 1960 American Psychological Association con- 
vention in Chicago. 

2 The authors would like to acknowledge the many 
helpful comments and suggestions made by Bernard 
Mausner of the Graduate School of Public Health, 
University of Pittsburgh. 
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purpose. For example, if one subject shifts from a 
mean judgment of 3 inches to one of 6, while the 
other shifts from a mean of 10 inches to one of 6, 
there is some question as to which subject has been 
the more influenced. By comparing absolute 
amounts of shift, the conclusion is reached that the 
second subject has been influenced more, but if 
ratios are taken, the opposite conclusion is indi- 
cated. 

A second disadvantage of such tasks is that they 
require a prior session, in which each subject 
makes judgments by himself, in order to establish 
a baseline from which any shift is computed. 

With these considerations in mind, the second 
author devised a technique, to be described, that 
permits the simultaneous measurement of both 
components of dyadic influence without requiring 
the prior determination of individual baselines. 

Two experiments are reported. The first lent no 
support to the primary hypothesis. A second 
experiment was then designed and conducted in 
order to test a secondary hypothesis concerning the 
negative results of the first experiment, The second 
experiment clearly supported the primary hypothe- 
sis, and, in doing so, also supported the secondary, 
post hoc hypothesis. 


EXPERIMENT I 


Method . 


Subjects and stimuli. Subjects were 20 male and 14 
female undergraduates ‘tecruited from introductory 
psychology classes at the Carnegie Institute of Tech- 
nology. Two stimulus series were employed in the 
subsequent task. The first consisted of 50 jars of vari- 
ous sizes and shapes, each containing between 250 and 
1,500 beans. The second consisted of 50 lines drawn on 
cardboard, ranging from 3 to 15 inches. 

Procedure. Subjects were run in like-sexed pairs. 
Prior to arrival they knew only that the experiment 
concerned accuracy of judgment. Upon arrival at the 
laboratory the experimenter informed the subjects 
that their first task was to take the Cardall Test of 
Practical Judgment. It was indicated that the Cardall 
test is widely used in industry and elsewhere and has 
been shown to be an excellent predictor of general 
judgmental ability. Subjects were allowed 20 minutes, 
after which the experimenter collected the answer 
sheets and pretended to grade them. 

One of the subjects was randomly chosen to be 
accorded low perceived competence on the prior task, 
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while the other was accorded high perceived compe- 
tence (referred to hereafter as Low and High, respec- 
tively). After the sham scoring was completed, the 
experimenter approached High and addressed him as 
follows: 

Mr. ? Well, this is terrific! Your score was 
the second highest I’ve had yet and is one of the 
best possible on this test. Very good! 

The experimenter addressed Low as follows: 

Mr. , are you sure you understood the 
directions I gave you for taking this test? [Low al- 
ways responded that he did.) Well then, are you 
feeling all right today? Perhaps you're sick? [Low 
always answered that this was not the case.) Well, 
Mr. , you score is the worst I’ve had yet, and 
is almost as low as one can score. This experiment is 
not a joke, and if you tried to do poorly, or didn’t 
try at all, say so now and you may leave. [Low always 
indicated that he had tried to do his best.) Mr. 
—~ if this is the best you can do, I would strongly 
suggest that you make an appointment at the 
Bureau of Measurement and Guidance to talk with 
one of the counselors. At any rate, let’s get on with 
the rest of the experiment and see how you do there. 

The experimenter made his remarks to each subject 
loud enough to be clearly overheard by the other. 

After the perceived differences in related task 
competence were established, subjects proceeded to the 
subsequent task. Each stimulus was clearly visible to 
both subjects, and the experimenter announced a 
standard against which the stimulus was to be com- 
pared. Subjects were required to judge aloud whether 
the stimulus was longer or shorter, or more or less 
numerous, than the standard. The experimenter indi- 
cated that stimuli would exceed the standard half the 
time and would be exceeded by the standard half the 
time. In actuality, the standard was in every case 
exactly equal to the stimulus magnitude, High went 
first on the odd numbered trials, while Low went first 
on the even numbered trials. 

Computation of scores. Data from odd numbered 
trials and even numbered trials were treated separately, 
The expected proportion of trials that both subjects 
would respond “plus” (stimulus larger than standard) 
was determined by multiplying the proportion of times 
High responded “plus” by the Proportion of times Low 
responded “plus.” A corresponding product was 
formed for the minus responses. The sum of these two 
products was subtracted from the actual proportion of 
agreements. If this difference was positive, the subject 
going second tended to imitate his partner, while a 
negative difference indicated an opposition tendency. 
An imitation score was computed for each subject by 
algebraically summing his scores on the two stimulus 
series (beans and lines), and an imitation score was 
computed for each dyad by algebraically summing the 
imitation scores of the two subjects. Difference scores 
were computed for each dyad by subtracting the imita- 
tion score of High from that of Low. 


Resulis 


The results of Experiment I are presented in 
Table 1. Because sex differences were negligible, 
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TABLE 1 
Amount AND DIRECTION OF ĪMITATION: EXPERIMENT I 
M SE t dj $ 
Imitation Scores 
fia DMT 
Lines 2786 | .0408 | 6.83 | 16 | <.0001 
Beans 3180 | .0404 | 7.87 | 16 | <.0001 
High sub- 3155 | .0334 | 9.45 | 16 | <.0001 
jects 
Low sub- +2811*| .0276 | 10.18 | 16 | <.0001 
jects 
Total -5966 | .0519 | 11.50 | 16 | <.0001 
1 
Asymmetry of Imitation Scores 
Lines —¿0004 | .0202 | 0.02 | 16 50 
Beans —.0340 | .0318 1.07 | 16 | >.15 
Total —.0344 | .0493 | 0.70 | 16 | >.25 


only means for all dyads are given. The amount of 
imitation behavior exhibited by subjects was 
highly significant, with all five values of £ exceeding 
the .0001 level. On the other hand, the degree to 
which High imitated Low was not significantly 
different from the degree to which Low imitated 
High as indicated by the ¢ test for correlated 
means. We conclude that the hypothesis under 
investigation received no support from the data. 


EXPERIMENT II 


One possible explanation for the lack of signifi- 
cant asymmetry in Experiment I is that the experi- 
mental manipulation was not successful. However, 
the highly visible reactions of subjects made this 
seem quite improbable, A more likely explanation 
was that subjects failed to perceive the abilities 
tapped by the Cardall test as having any apprecia- 
ble similarity to the bean and line judging. Conse- 
quently, a decision was made to run a second 
experiment, employing a prior task that would be 
perceived by subjects as having greater similarity 
to the subsequent task. 


Method 


Subjects and stimuli. Subjects were 20 male under- 
graduates recruited from introductory psychology 
classes. The same stimuli were used for the subsequent 
task as had been used in Experiment I. 

Procedure. Exactly the same procedure was followed 
as in Experiment I, except that the Cardall test was | 
replaced by a series of 36 irregularly drawn areas of 
various sizes, Prior to judging these areas subjects 
listened to a rather elaborate explanation by the ex- 
perimenter which was designed to bring out the simi- 
larity between the area judging on the one hand and 
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TABLE 2 
AMOUNT AND DIRECTION OF IMITATION: 
EXPERMENT II 
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TABLE 3 

PERCENTAGE OF TOTAL INFLUENCE ATTRIBUTABLE TO 
IMITATION 


“u SE t | af ? 
y . 
Imitation Sco; 
Lines 2791 0503 | 5.55 9 +0003 
Beans 3273 | .0537 | 6.10 9 | <.0d01 
High sub- +1587 | .0376 | 4.220 9 0012 > p > .001 
jects 
Low sub- -4477 0564 | 7.94 9 <.0001 
jects 
Total -6064 | .0672 | 9.02 9 <.0001 
eee 
Asymmetry of Imitation Scores 
——ŘŘŮō—__——— r M 
Lines .1780 | .0348 | 5.12 9 0005 
Beans +1110 | .0453 | 2.45 9 <.02 
Total 2390 | .0686 | 4.21) 9 0012 > p > .001 


ee ee mee 


the bean and line judging on the other. As before, the 
experimenter announced a correct standard, but sub- 
jects were required to respond “plus” or “minus.” 
Judgments of the areas were recorded on answer sheets, 
so that neither subject knew the responses given by his 
partner. In “grading” the answer sheets and in inducing 
differential perceptions, the experimenter followed the 
procedure used in Experiment I. 


Results 


The results of Experiment II are presented in 
Table 2. As with Experiment I, the extent to which 
subjects imitated their partners was highly signifi- 
cant. In contrast to Experiment I, however, Low 
subjects imitated High subjects to a significantly 
greater degree than conversely, as indicated by the 
test for correlated means. The overall difference 
on all stimuli closely approached the .001 level. 
Furthermore, the mean imitation score for Low 
subjects was almost three times as large as the 
mean for High subjects. We may therefore accept 
the hypothesis under investigation with confidence. 

Table 3 gives the percentage of total influence 
that consisted of imitation, for both experiments. 
Total amount of influence was computed by sum- 
ming the absolute magnitudes of the scores earned 
by each subject on each stimulus series, whereas 
amount of imitation was determined, as previously 
stated, by computing the corresponding algebraic 
sum, The difference between the two is due to the 
presence of negative scores. A negative score indi- 
cates that the subject agreed with his partner less 
often than would be expected by chance. 

The percentages of imitation for Experiment I 
are so large as to make differences between stimulus 


Experiment I Experiment II 


Lines 97.7 87.2 
Beans 97.4 95.7 
High subjects, 99.3 74.1 
Low subjects 95.5 100.0 

Total 97.5 91.6 


series and partners negligible. The percentages for 
Experiment II are also large, although not so large 
as in the first experiment. The rather appreciable 
difference between the percentage for High and 
that for Low is another manifestation of the asym- 
metry between partners. Not only do High sub- 
jects imitate their partners less in an absolute sense, 
but they also exhibit a smaller degree of imitation 
relative to the total amount they deviated from the 
chance level of agreement. In other words, High 
subjects showed more pronounced opposition 
tendencies than did Low subjects. 


Discussion 


The almost perfect symmetry found in Experi- 
ment I, coupled with the highly significant degree 
of asymmetry found in Experiment II, indicates 
that the gradient of generalization for influence 
from one task to another is relatively steep. In the 
absence of a set of tasks scaled on perceived simi- 
larity, it is not possible to determine the exact 
shape of this generalization curve, but there seems 
little doubt that the outcome of a prior task will 
have little or no effect during the susequent task 
unless the two tasks are perceived as being highly 
similar. d 

Some supplementary support for this conclusion 
comes from the extremely high degree of signifi- 
cance reached by the tests for the amount of imita- 
tion in iment I. There is every reason to 
believe that a technique so highly sensitive to the 
presence of influence would also be sensitive to any 
directional difference that might be present. 

The conclusion of a relatively steep gradient of 
generalization does not rule out the possibility ofa 
highly general evaluation of one individual by 
another. Subjects in the present experiment were 
previously unacquainted, and as a result each 
subject had only the announced outcomes on the 
prior task to guide him in making relative evalua- 
tions. If several prior tasks had been employed, and 
if one subject were always seen to do much better 
than the other, it is quite possible that highly 
generalized relative evaluations would result. 
Certainly in cases of extended interpersonal con- 
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tact outside the laboratory such generalized evalua- 
tions occur at times, and the question of the neces- 
sary conditions warrants systematic investigation. 

The relative amount of opposition behavior was 
greatest among High subjects of Experiment II, 
Although the difference between High subjects in 
the two experiments only approached significance 
(.10 > p > .05), it raises some interesting possi- 
bilities for future investigations. One might hy- 
Ppothesize that, other things being equal, status 
differentials in a group lead to a lower overall level 
of group conformity. In a stratified overall group, 
the high status groups become positive reference 
groups for the members of the low status groups. 
The low status groups, however, take on the 
properties of negative reference groups for high 
status members, If and when the low status groups 
learn to imitate successfully the high status groups, 
these high status groups should become motivated 
to change their patterns of behavior. In a more 
egalitarian group, pressures towards imitation and 
conformity are given free rein due to the absence 
of well developed negative reference groups, Con- 
sider the hypothetical case of complete absence of 
status differences. No member or group could 
utilize superior status as a basis for effecting an 
innovation. If such an attempt were made, the 
remaining members would have no motivation to 
follow. One would predict that a state of equilib- 
rium would be approached at which a common 
pattern of behavior would be adopted by all 
members of the group. 


SUMMARY 


The problem investigated was that of the effect 
of perceived task similarity and perceived differ- 
ences in task competence on the asymmetry of 
influence between members, of dyads. 

Two experiments were conducted, In both 
experiments a prior task, in which perceptions of 
differential task competence were induced, was 
followed by the subsequent task of judging the 
magnitudes of 100 stimuli. In Experiment I, the 
induction of perceptions of differential prior task 
competence failed to produce a significant degree of 
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asymmetry of influence during the subsequent 
task. It was hypothesized that the lack of signifi- 
cant asymmetry was due to the failure of subjects 
to perceive the prior task as requiring skills similar 
to those required by the subsequent task. 

With the above post koc hypothesis in mind, a 
second experiment was designed and conducted, 
The prior task was replaced by one of greater 


similarity to the subsequent task, and instructions — 


were rewritten in order to emphasize this similarity, 
Experiment II yieldéd a highly significant degree 
of asymmetry of influence. It is concluded that 
perceived task competence is an important deter- 
minant of the amount and direction of social 
influence, but that its effectiveness depends in a 
crucial way on perceived task similarity. 
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SUCCESSIVE RESPONSES TO SIMULTANEOUS STIMULATION 
IN ELDERLY PATIENTS WITH MEMORY DISORDER" 


JAMES INGLIS aw R. E. SANDERSON 
Queen’s University, Kingston, Ontario 
. 


Or in learning ability with age can be 
shown to affect the performance of otherwise 
normal people (Welford, 1958), and severe dis- 
turbances of the learning pracesses constitute an 
important part of that kind of disorder of the aged 
commonly called “senile dementia” (Inglis, 1958). 

Welford (1956) has argued that the key to the 
understanding of the crucial nature of these and 
other performance decrements with age may lie in 
a better knowledge of the processes underlying 
short-term retention. It has been shown, however, 
that on the orthodox digit span test, often used 
clinically as a measure of short-term retention, 
performance falls off very little with age (Bromley, 
1958) and does mot appear to discriminate well 
between elderly psychiatric patients with and 
without memory disorder (Inglis, 1957). 

A number of experiments carried out by Broad- 
bent (1958) and his associates with young adult 
subjects have shown that certain modifications in 
the way of presenting digit span stimuli may serve 
to elucidate some of the problems raised by per- 
formance on this apparently simple task. Broad- 
bent has suggested that whereas the orthodox way 
of presenting the digit span stim i may involve 
mainly a simple perceptual process, the simultane- 
ous presentation of different parts of a span 
through different sensory channels (e.g., both ears 
at once) may require, in addition, the participation 
of a simple storage system, which itself may pro- 
vide the basis for short-term memory. 

Thus, in one experiment Broadbent (1957) 
showed that when digit span stimuli were relayed 
at a speed of 2 per second through headphones to 
each subject, one half of the span to one ear and, 
simultaneously, the other half of the span to the 
other ear, then the subjects tested under these 
conditions could reproduce the digits sequentially. 
Furthermore, in such reproduction, elements from 
one half of the span were never alternated with 
elements from the other half. The first half-set 
recalled, however, commonly contained fewer 
errors than the second half; this constituted a kind 
of serial-order effect. Broadbent (1956), in an 


1 This study is part of a project carried out with the 
assistance of a Canadian National Health Grants Ad- 
ministration Mental Health Grant (Project No. 605-5- 
285). The authors are also indebted to the Superin- 
tendent and staff of the Ontario Hospital, Kingston, 
for their help and cooperation. 


experiment in which the half-spans were presented 
to eye and ear simultaneously, obtained similar 
results, demonstrating that these phenomena are 
not peculiar to hearing but may be generalized to 
different sense modalities. 

Broadbent suggested two kinds of mechanism 
which may underlie such performances on the 
digit span test. First, there is a “p-system,” which 
can only pass information successively. Second, 
there is an “s-system,” which can store excess 
information, arriving, for example, when the p- 
mechanism is already fully occupied in trans- 
mitting information from another channel. Broad- 
bent (1957) tried to find out experimentally if the 
s-system could be regarded as operative even in the 
orthodox digit span test or if it could best be 
regarded as a kind of emergency device, only called 
upon when excess information must be handled, 
In this case, with dichotic stimulation, the order of 
recall of the two half-spans was prescribed by the 
experimenter. The recall order was prescribed to 
some of the subjects before the stimuli were pre- 
sented and to some of the subjects after presenta- 
tion of the stimuli. 

Tf it were the case that all the stimuli passed 
through the s-system before entering the p-system, 
then the serial-order effect would be expected to 
appear, regardless of whether the recall order had 
been prescribed before or after the stimuli were 
presented. If, on the other hand, only part of the 
information passed through the s-system, the 
other part going straight through the p-system, 
then the serial-order effect would not be expected 
to appear in the situation in which the recall order 
had been prescribed after the presentation of the 
stimuli. 

This experiment did not produce completely 
unequivocal results. In the situation in which 
recall order was imposed before stimulation, the 
serial-order effect did appear. In the situation in 
which recall order was specified after stimulation, 
the serial-order effect only appeared in the early 
stages of practice. Tn the later stages, this effect 
almost disappeared, the two half-sets of digits 
being recalled with about equal efficiency. This 
suggests that while initially all the data may go 
through the s-system before being passed through 
the p-system, in the later stages of practice only 
part of the data passes through the s-system first, 
Broadbent admits that this outcome is not fully 


conclusive. Nevertheless, he maintains that it is 
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most likely that, in the ordinary immediate mem- 
ory test, the information probably enters the p- 
system at once without passing first through the 
s-system. 

This notion is supported by other evidence 
(Inglis, 1957) which shows that elderly psychiatric 
patients with memory disorder (and hence showing 
presumptive evidence of some breakdown of their 
storage processes) may do about as well on ordinary 
digit span tests as patients of the same age but 
without memory disorder. Zangwill (1943) had, of 
course, previously pointed out that restriction of 
immediate memory as measured by the digit span 
test may be regarded as the exception, even in 
cases showing a severe retention defect. 

If memory disorder in elderly psychiatric pa- 
tients is, in part at least, based upon some break- 
down of the kind of storage mechanism postulated 
by Broadbent, then we may expect such patients 
to show specifiable disturbances of recall in the 
dichotic stimulation situation relative to the 
performance of a matched group of patients with- 
out memory disorder. The experiment reported 
here was designed to investigate this notion. 

METHOD 
Subjects 

The experimental group comprised 15 elderly pa- 
tients clinically estimated to be suffering from un- 
doubted memory disorder. This estimate was made on 
the basis of a complaint about the patient’s memory 
made by the doctor in charge of the case, by the ward 
staff, or both. 

A control group of 15 patients without memory dis- 
order was also tested. The age range of the experimental 
group was 68-83; that of the control group was 66-84. 
The Wechsler Yerbal scale IQ range in the experimental 
group was 77-103; in the control group it was 75-112, 
There were 4 men and 11 women in each group, all 
psychiatric patients of the Ontario Hospital, Kingston, 
Ontario. They were almost all of the rural working 
class. The women were, for example, almost all farmer’s 
and farm worker’s wives. All the patients (both experi- 
mental and control) were Cooperative during testing, 
and none of them suffered any sensory or motor deficit 
So severe as to interfere with test performance, The 
experimental group was mainly composed of cases 
diagnosed as senile dementia, usually having been 
admitted to hospital for difficulties associated with 
failing memory. The control group was principally com- 
posed of functionally disordered patients diagnosed as 
cases of depression and paranoid schizophrenia, 


Tests 


The WAIS (Wechsler, 1955) was used for 
the two groups on verbal ability (Verbal IQ) and on 
average digit span forward. This test was also used to 
check on a previously observed discrepancy between 
the Verbal and Performance scales (Inglis, 1959) in a 
memory-disordered group. 


matching 
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TABLE 1 
Dicits Usep FOR DOUBLE STIMULATION TEST 
Channel 1 Channel 2 
Practice Series 
a— 
8 Blank 
Blank 5 
58 5 
Test Series 
w J 
6 2 
9 7 
7 4 
5 8 
32 94 
18 65 
37 49 
57 86 
658 417 
432 869 
571 243 
769 354 
6235 1943 
4863 2547 
2719 8475 
9327 6351 


The apparatus used to administer the binaural stimuli 
consisted of a Webcor stereophonic tape recorder 
(Model EP2821-5B) played into a pair of Edison (Model 
30) earphones, Different sets of digits were recorded on 
each channel as shown in Table 1. The digits were 
presented at the rate of 1 per second. 

Each subject was first told, “You are going to hear 
a number. Tell me what you hear.” The spoken digit 
8 was then played on Channel 1. If the subject re- 
sponded correctly, then the same procedure was re- 
peated for the digit 5 on Channel 2. If the subject failed 
to respond or gave the wrong number, the volume was 
increased until the correct response was made. 

Second, the subject was told, “Now you are going to 
hear two numbers together, one in each.ear. Tell me 
what you hear.” The two channels then played the 
spoken digits 5 and 8 simultaneously. If the subject 
responded with the correct digits (i.e., 5-8 or 8-5), then 
the experimenter went on to the test series. If not, this 
item of the practice series was repeated once more. s 

The foregoing constituted a practice series which 
served to ensure that any difference found between 
the groups could not merely be due, say, to sensory 
differences. It also got the subjects used to the situa- 
tion. 

Third, each subject was told, “Now you are going 
to hear n numbers, n/2 in each ear” (where n was 2 of 
4 or 6 or 8). “Tell me what numbers you hear.” These 
instructions were repeated at the beginning of each 
series of a new length. Between each of the items within 
a series, the experimenter said, “Now what numbers 
do you hear?” 

This series of tests were scored as follows: The first 
correct digit repeated in each series determined which 
channel was taken as the “dominant” one for that item 
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TABLE 2 


MEANS, STANDARD DEVIATIONS, AND THE SIGNIFICANCE OF THE DIFFERENCES BETWEEN THE MEANS OF THE 
CRITERION GROUPS ON THE VARIABLES EXAMINED 


Variable Experimental Group Control Group Between Group Differences 
M z M r $ t b 
ge 15.73 4.42 75.00 4.46 
WAIS IQs Verbal scale 89.00 9.27 95.40 10.38 age? 
Performance scale 72.07 o 6.43 88.53 14.02 4.99 <.01 
Verbal minus Performance 16.93 8.29 6.87 8.36 3.27 <.01 
Full Scale 80.67 7,57 92.07 10.39 3.44 <.01 
Digit Span Forward 5.93 1°44 6.53 0.83 1.36 ns 
Dichotic stimulation 
b within within 
group difference group difference 
1 Digit 
ominant 0.98 0.14 1.00 0.00 0.55 
Nondominant 0.47 < We 0.95 0 as 310 <01 
2 p 
ominant 1.83 0.20 1.95 0.20 1.64 
Nondominant 0.09 0-23) < -001 1.80 gass 8.30 1 
3 Dige 
ominant 2.42 0.48 2.90 0.17 3.64 <.01 
Nondominant 005 0. tf} <.001 2.15 eles OL 4186 <.001 
4 Digits 
ominant 2.82 1.06 3.63 0.45 272 <.02 
Nondominant 0.09 besh< 0010.95 oae «001 3.55 <01 


by that subject; the second half of the total digit span 
was called “nondominant.” The score obtained was the 
average number of correct responses for each half digit 
set taking, of course, digit position in the series into 
account, 


RESULTS AND DISCUSSION 

The means and standard deviations and the 
significance of the difference between the means 
of the experimental and control groups are shown 
in Table 2. The significance of the differences was 
aaen by the uncorrelated ¢ test (McNemar, 

49). 

Previous findings are confirmed by the signifi- 
cance of the Verbal Performance difference on the 
WAIS. 

The differences between the groups in the 
dichotic stimulation experiment are significant in 
the “nondominant” channel for all the lengths of 
digit series used. This suggests that the clinical 
picture of memory disorder in elderly psychiatric 
patients is baséd, at least in part, upon a break- 
down of the kind of storage mechanism (s-system) 
Proposed by Broadbent to account for the capacity 
of young normal adults to respond successively to 
information delivered to them simultaneously. 

Tt can also be seen that significant differences of 
lesser magnitude appear between the groups in 
their responses to information in the dominant 
channel for the two longest half-sets (i.e., 3 and 4 
digits), No explanation can at present be offered 
for this finding. 

Some of the differences between the accuracy of 


recall in the dominant and the nondominant 
channels within the experimental and control 
groups are also statistically significant. This finding 
is in line with the notion of a kind of serial-order 
effect in the recall of the second half of each digit 
series. It is also congruent with the present hy- 
pothesis that this effect should appear even in the 
shortest series in the case of the experimental 
group, although it does not appear in the control 
group until a total digit series length of six figures 
is reached. M 

It may even be possible to apply the results 
obtained in this investigation to secure a clinical 
test of memory disorder which would be just as 
convenient to use as the orthodox digit span but 
much more likely to be valid. 

SUMMARY 

This study was designed to test the notion that 
memory disorder, clinically ascertained in elderly 
persons, may þe related to a breakdown in the 
kind of storage system which has been adduced to 
explain successive responses to simultaneous 
stimulation in normal young adults. " 

Two groups of 15 elderly psychiatric patients, 
one group of patients with memory disorder and 
one group without, were tested. The results ob- 
tained support the hypothesis put forward. 
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INTERPERSONAL ATTRACTION AND ATTITUDE SIMILARITY 


DONN BYRNE? 
e University of Texas 


N investigating the direction and the strength of 
the! affect engendered between the two partici- 
‘pants in a dyad, we may arrange the expressed 
feelings of each individual along a continuum 
ranging from strongly positive to strongly negative. 
The accurate prediction of interpersonal attraction 
and repulsion in such relationships will undoubt- 
edly require that we secure knowledge about 
several classes of independent variables. 

Probably the most obvious and also best docu- 
mented variable is that of propinquity. Studies in 
a wide variety of settings have shown that physical 
and functional distance influence interaction and 
interpersonal attraction (Byrne, in press). Once the 
environmental situation permits or encourages 
interaction, affiliation need should be helpful in 
predicting individual differences in interpersonal 
behavior (Atkinson, Heyns, & Veroff, 1954; 
Schachter, 1959). A third class of variables con- 
sists of the overt stimulus properties of each indi- 
vidual to which other individuals would be ex- 
pected to respond on the basis of generalization 
from previous interpersonal interactions. 

Once interaction has begun, reciprocal reward 
and punishment is proposed as the crucial deter- 
mining factor. It has been suggested (Newcomb, 
1956) that attraction between persons is a function 
of the extent to which reciprocal rewards are 
present in their interaction; perhaps dislike is a 
function of reciprocal punishments. A special sub- 
class of this variable would be perceived similarity 
and dissimilarity of the attitudes of two individ- 
uals, It can be assumed that persons in our culture 
have well established learned drives to be logical 
and to make a correct report of the environment. 
Those who seem deficient in this respect are 
generally categorized as being uninformed, of low 
intelligence, immoral, and/or as being out of 
contact with reality. It is primarily through con- 
sensual validation that we determine whether we 
or anyone else is logical or correct in interpreting 
environmental events. Hence, any time that 
another person offers us validation by indicating 
that his percepts and concepts are congruent with 
ours, it constitutes a rewarding interaction and, 


1A portion of this paper was read at the meetings 
of the Southwestern Psychological Association, Galves- 
ton, 1960. 

This investigation was supported in part by a re- 
search grant (EF-140) from the University of Texas 
Research Institute. 

2 From the Laboratory for Personality Research. 


hence, one element in forming a positive relation- 
ship. Any time that another person indicates dis- 
similarity between, our two notions, it constitutes 
a punishing interaction and thus one element in 
forming a negative relationship. Disagreement 
raises the unpleasant possibility that we are to 
some degree stupid, uninformed, immoral, or 
insane, An alternative possibility is that it is the 
other person who is deficient in one or more of these 
characteristics. Probably other variables, such as 
the importance of the issue to each individual, 
contribute to the effect. 

A number of studies have found greater simi- 
larity among friends than among nonfriends with 
respect to a variety of issues (Bonney, 1946; 
Loomis, 1946; Newcomb, 1956; Precker, 1952; 
Richardson, 1940; Winslow, 1937). A few studies 
of a more experimental nature also support the 
notion of a relationship between attitude similarity 
and interpersonal attraction (Jones & Daugherty, 
1959; Smith, 1957). 

In order to test the proposition that the effect 
of attitude similarity is a causative one and to test 
some implications arising from the preceding 
speculations about the reason for the effect, it 
was hypothesized that (a) a stranger who is known 
to have attitudes similar to those of the subject is 
better liked than a stranger with attitudes dis- 
similar to those of the subject, (b) a stranger who 
is known to have attitudes similar to those of the 
subject is judged to be more intelligent, better 
informed, more moral, and better adjusted than a 
stranger with attitudes dissimilar to those of the 
subject, and (c) a stranger who is known to have 
similar attitudes on issues important to the subject 
and dissimilar attitudes on unimportant issues is 
better liked and is evaluated more positively on 
the other four variables than a stranger for whom 
the reverse is true. 


METHOD 


Attitude measure? On the basis of a pilot study, 26 
issues were selected for inclusion in an attitude and 


3 The attitude measure, response frequencies, and 
the rating scales for interpersonal attraction and evalu- 
ation have been deposited with the American Documen- 
tation Institute. Order Document No. 6770 from ADI 
Auxiliary Publications Project, Photoduplication Serv- 
ice, Library of Congress; Washington 95; D. C té 
mitting in advance $1.75 for microfilm or $2.50 for 
photocopies. Make checks payable to: Chief, Photo- 
duplication Service, Library of Congress. 
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opinion scale. Each issue was presented in a seven- 
point scale. The issues ranged from those thought to be 
extremely important by the pilot subjects (e.g., inte- 
gration, God, premarital sex relations) to those con- 
sidered to be of minor importance (e.g., western movies 
and television programs, classical music, politics). 

Procedure. The attittde scale was administered to 
64 students (36 male, 28 female) enrolled in an intro- 
ductory psychology course at the University of Texas. 
Response heterogeneity differed frm item to item, but 
there was moderately wide diversity of opinion among 
the 64 subjects. After filling out the attitude scale, the 
subjects were asked to indicate which they believed to 
be the 13 most important and 13 least important issues. 

Two weeks later they were falsely informed that 
the attitude scale had been given as part of a study in 
interpersonal prediction. They were told that individ- 
uals in another class had been given the same scale 
that they took, students in the two classes were matched 
on the basis of sex, and they were to be given each 
other’s tests (name removed) in order to determine how 
much they could learn about one another from this 
information alone. 

Actually the questionnaire they received at this 
time was a fake one made up by the experimenter. The 
subjects had been randomly divided into four groups; 
one group received attitude scales filled out exactly the 
same as theirs had been, one received scales with ex- 
actly opposite views expressed, one received scales 
with similar opinions on the most important issues and 
dissimilar on the least important, and the fourth Te- 
ceived scaleswith similar opinions on the least important 
issues and dissimilar opinions on the most important. 
The four groups of subjects did not differ significantly 
in their initial responses to any of the 26 issues. 


AND NOTES 


Interpersonal atiraciion and evaluation. A rating 
scale was used as the measure of interpersonal attrac- 
tion and evaluation with each dependent variable rep- 
resented in a seven-point scale. As a measure of inter- 
personal attractibn, subjects were asked to indicate 
how well they felt they would like this person and 
whether they believed they would enjoy working with 
him (or her) as a partner in an experiment. Four scales 
dealt with evaluation; the subjects were asked for their 
judgments asto the other student's intelligence, knowl- 
edge of current events, morality, and adjustment, 

> o 


RESULTS 


First Hypothesis. Table 1 shows the comparisons 
of the two groups on each of the dependent varia- 
bles. The first hypothesis was overwhelmingly 
confirmed for each of the two attraction scales, The 
group with attitude scales filled out the same as 
their own (SA) indicated significantly more posi- 
tive feelings toward the “stranger” than did the 
group which received scales indicating dissimilar 
attitudes (DA). Each difference was significant at 
less than the .001 level. 

Second Hypothesis. As is indicated in Table 1, 
the second hypothesis was also confirmed. The SA 
group rated the “stranger” significantly higher 
than did the DA group on intelligence, knowledge 
of current events, morality, and adjustment. 
Again, each difference reached a level of signifi- 
cance beyond the .001 level. 

Third Hypothesis. The third hypothesis, con- 
cerning the influence of important vs. unimportant 


TABLE 1 


COMPARISON OF THE SIMILAR ATTITUDE 


(SA) AND DISSIMILAR ATTITUDE (DA) Groves 


ON INTERPERSONAL ATTRACTION AND EVALUATION 


= 
SA DA 
; (N = 17) (N = 17) D t df t 
M SD M SD 
Personal Feelings 6.53.50 1.76 6784 1.46 32 <.001 
Desirability as Work Partner 6.47 150.268 E 38h a 32  <.001 
Intelligence 5.65 68 3.06 187 2:50 9.37 32 <-001 
Knowledge of Current Events 4.65 1.14 2.65 912.00 5.51 32 <.001 
Morality 5:76 73 3AT 02.09 2.29 44 32 <.001 
Adjustment 6.00 84 2.71 1.13 3.29 9.36 32 <.001 
TABLE 2 


COMPARISON OF THE SIMILAR ON IMPORTANT 
Artirupes (SUA) G: 


Attrrupes (SIA) AND SIMILAR on UNIMPORTANT 
ROUPS ON INTERPERSONAL ATTRACTION AND EVALUATION 


SIA UA 
(W = 15) we 15) D t df b 
“u SD M SD 

Personal Feelings 4.20 1.51 2.60 i 
Pe as Work Partner 4.27 1.44 3.33 La rb ie 28 a 

telligence 443° 62 3.73 1134 “a0 28 ns 
Faon edge of Current Events 3.60 aI 3.53 .96 .07 19 28 ns 
Morality 5-33 1.25 3.33 1.66 2.00 3.60 23 <.01 

ljustment 4.07 1.57 2:98 418 «1114 2:17 28 <.05 
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issues, was only partially confirmed. As is shown 
in Table 2, the Similar on Important Attitudes 
Group (SIA) rated the “stranger” significantly 
more positively than did the Similar on Unim- 
portant Attitudes Group (SUA) with respect to 
their personal feelings about him, his morality, and 
his adjustment. On the other three variables, the 
two groups did not differ. 


° 
Discussion 


The experimental confirniation of the first two 
hypotheses is very encouraging for further research 
designed to investigate other aspects of the rela- 
tionship between interpersonal attraction and 
attitude similarity. It should be possible now to 
study the effect of attitude differences less extreme 
than those in the present study and to combine 
this variable with the others that influence inter- 
personal attraction in order to determine interac- 
tion effects. 

Because of the fact that this group of subjects 
showed a degree of homogeneity of opinion on 
some of the attitude items, a possible alternative 
interpretation is that they were responding nega- 
tively to unusual and deviant beliefs rather than 
to disagreement per se. On 19 of the 26 issues it was 
possible for a subject to fall in the deviant one 
fourth of the group by expressing a positive or a 
negative opinion. The range among the subjects 
was from no deviant attitudes to nine; they were 
divided into high and low subgroups on the basis 
of this score. Since these “conforming” and “devi- 
ant” subgroups did not differ from one another in 
responding to strangers with similar vs. different 
attitudes, there is no evidence to support this 
other interpretation of the results. 

The partial failure of the third hypothesis led 
to a comparison of all four groups on each of the 
dependent variables. The results suggest that the 
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Personal Feelings scale is the most sensitive meas- 
ure of interpersonal attraction. With the other five 
interpersonal judgment scales, additional factors 
apparently contribute to the variance. 
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ANXIETY, BIRTH ORDER, AND SUSCEPTIBILITY TO SOCIAL INFLUENCE! 


FREDERICK R. STAPLES anp RICHARD H. WALTERS 
University of Toronto e 


CHACHTER (1959) daims that first-born children 
are subjected to more inconsistent nurturance 

than are later-born children andsconsequently show 
more dependency behavior in the form of affiliative 
responses. Schachter finds support for his hy- 
pothesis in studies reported by Rosonow and 
Whyte (1931), Sletto (1934), Sears (1950), Sears, 
Maccoby, and Levin (1957), and Haeberle (1958). 
In addition he reports experiments carried out at 
Minnesota which show that first-born subjects 
more readily change their opinions when fictitious 
group norms are provided (Ehrlich, 1958) and 
experience greater anxiety reduction through 
affiliation with others (Wrightsman, 1959) than 
do later-born subjects. The experiment reported 
in this paper is an attempt to test Schachter’s 
hypothesis, using suggestibility in the autokinetic 
situation as an index of susceptibility to social 
influence. 

The following hypotheses were put forward: 

1. In the autokinetic situation, even when an 
anxiety inducing situation is not created, first- 
born subjects show greater suggestibility than do 
later-born subjects, 

2. The introduction of anxiety increases suscep- 
tibility to social influence. This effect is more 
marked for first-born than for later-born subjects, 


MetHop 
Subjects > 


Sixty-four female undergraduates served as subjects. 
Thirty-two subjects were first-born children; the re- 
mainder were later-born. In order to permit finer anal- 
ysis, 16 oldest and 16 
among the first-born subjects, and 16 middle and 16 
youngest children were included among the later-born 


Because the age difference between a child and his 
next oldest and/or next youngest sibling may influence 
the dependency training he receives, some attempt was 
made to control this variable. The mean difference be- 
tween oldest subjects and their next oldest sibling, 
between middle-born subjects and their next olde 
and next youngest siblings, and between youngest sub- 
jects and their next youngest sibling was controlled at 
3.0 years, Subjects were assigned to control and ex- 
perimental groups in such a way that (for oldest, mid- 
dle, and youngest subjects), in addition to control of 
age differences, a mean family size of 3.5 children was 
insured within each group. 


1 This study was supported in part by a grant from 
the National Research Council of Canad 
to R. H. Walters, ee 


Apparatus 


The autokinetic apparatus has been described by 
Walters and Quinn (1960), With the addition of a set 
of relays and an electric timer, this apparatus provided 
a controlled exposure of the stimulus (a point of light 
7 feet from the subject) and an automatic recording of 
the interval between the presentation of the stimulus 
and the subject’s signal that the light had commenced 
to move (the subject’s latency), (During testing, the 
only illumination in the room was provided by two 
small night-lights which allowed the experimenter to 
read the timer and record the subject’s responses.) 


Procedure 

In order to be sure that the subject could identify 
the stimulus and follow through the instructions cor- 
rectly, a few preliminary trials were given. The subject 
was then instructed: 

Tam going to turn on the light several times. Each 
time, after a short pause, the light will begin to move. 
As soon as you see the light start to move, press this 
switch (the experimenter indicated a lever to the right 
of the subject) and hold it down until the light goes 
out. The light will go out shortly after you press the 
switch. Then I shall ask you how far you think the 
light moved that time. Please try and give an answer 
to the nearest half inch. The light may not move the 
same distance each time and it may not always move 
in the same direction, O.K.? The light will come on, 
you press the switch as soon as you see the light start 
to move, and then I shall ask you how far you think 
it moved. 0.K.? 

Five trials, constituting the presuggestion series, 
were then given. On each trial the subject’s latency and 
distance estimate were recorded. 

Control subjects were now told: 

For the next five trials I'll give you some feedback 
as to how well you are estimating the distance the 
light moves. The procedure will be the same as for 
the first five trials, but after you give each estimate 
I shall tell you how far the light actually moved that 
time. O.K.? 

For the experimental subjects, instructions were as 
follows: 

There will be five trials in which an electric shock 
will be given for any estimate which differs from the 
Correct distance by more than one and a half inches. 
However, to give you some idea of how well you are 
doing, I am first going to give you some information 
as to how close your estimates are. There will be a 
series of five trials in which I shall tell you how far 
the light actually moved each time. The procedure 
will be the same as for the first five trials, but after 
you give each estimate I shall tell you how far the 
light moved that time. O.K.? 

Five more trials were now given. To each of the 
subject’s estimates the experimenter added a variable 
amount, averaging 2.5 inches over the five trials. Ex- 
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perimental and control subjects were treated exactly 
alike. “Feedback” was given on each trial. 

Control subjects were now told: 

Now there will be five more trials that will be just 
like the first five. That is, after fou give your esti- 
mate I shall give you no information as to how far 
the light actually moved. 0.K.? e 
In the case of experimental subjects, the experi- 

menter now took a thimble, from which a wire extended 
to the rear end of the apparatus, and placed it on a 
finger of the subject’s left hand. The subject was now 
told: e 2 
There will be five more trials following the same 
procedure as the first five. That is, after you give 
your estimate I’ll give you no information as to how 
far the light actually moved. However, if your esti- 
mate differs from the actual distance by more than 
one and a half inches, you will receive an electric 
shock through the thimble. The shock is rather se- 
vere, but if your judgments are accurate you may 

not receive any. O.K.? 

Five postsuggestion trials were given all subjects. 
No subject in the experimental group was, in fact, 
shocked. 

Before leaving the experimental room, each subject 
was handed a pencil and paper and was asked to draw 
lines 1 inch, 3 inches, and 5 inches in length. 

About one month after the experiment all subjects 
were asked to indicate whether they were definitely 
willing, probably willing, or definitely not willing to 
participate in another experiment similar to the one 
already carried out. 


RESULTS 


Inspection of data indicated that, at all stages 
of the experiment, differences between oldest and 
only subjects and between middle and youngest 
subjects were negligible. Consequently, results 
will be given only for the larger groups of first-born 
and later-born subjects. Statistical analyses are 
based on the median presuggestion latency and 
judgment and the median postsuggestion latency 
and judgment of individual subjects. 

Table 1 gives the average of the median latencies 
of first-born and later-born subjects on the pre- 
suggestion trials. The results were in the predicted 
direction but were significant only at the .06 level. 

Before subjects were compared for their re- 
sponses to suggestion, analyses of their judgments 
of length, based on their drawings of lines, were 
carried out. An analysis of variance (Lindquist’s, 
1953, Type I design), showed that there were no 
systematic differences among the eight possible 
groups of subjects (four birth-orders, two condi- 
tions). The groups were then combined, and at 
test for correlated data was carried out to ascertain 
Whether the judged distance between 1 and 3 
inches was the same as the judged distance between 
3 and 5 inches. Mean judgments of these two 
intervals did not differ. On the basis of these 
analyses, it was decided to treat the judged dis- 
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TABLE 1 
Means AND SDs (in Seconds) of PRESUGGESTION 
LATENCIES oF First-BorN AND LATER-BORN 
SUBJECTS 
(N = 32 in each group) 


First-Born o Later-Born 
M SD M SD 


9.33 10.61 14.50 15.08 
CR = 1.58;  <.06 (one-tailed test) 


Note.—Since heterogeneity of variance was present, a critical 
ratio was obtained instead of a # ratio. 


tances in the autokinetic situation as if these fell 
on an interval scale that remained relatively con- 
stant from group to group. This decision permitted 
a test of the interaction effect predicted by Hy- 
pothesis 2. 

Since group regression lines were parallel (F = 
1.73; p > .05), an analysis of covariance of sub- 
jects’ median estimates in the postsuggestion trials 
was carried out, using median estimates in, the 
presuggestion trials as the covariant. The adjusted 
group means are given in Table 2; the results of 
the analysis of covariance are reported in Table 3. 
It is clear from these tables that first-born subjects 
and later-born subjects were differentially in- 
fluenced by the threat of shock. Subsequent ¢ tests 
showed that the first-born experimental subjects 
were significantly more suggestible than any of 
the other three groups, among which there were 
no significant differences. 

Results from the questionnaire are summarized 
in Table 4. Results for first-born and later-born 
subjects were first combined to form a 3 X 2 con- 
tingency table. A chi square test showed that 
subjects who had been threatened with shock 
showed less willingness to participate in a further 
experiment than did subjects who had not been so 
threatened (p < .01). The “definitely willing” and 


«“probably willing” categories were then combined 


for separate tests within the two birth-order 
groups. First-born subjects who had been threat- 
ened with shock were less willing to return than 
first-born subjects who had not been threatened 
(p < .01), but there was no difference between 
later-born subjects who had been threatened and 
those who had not been threatened (p = .218 by 
Fisher’s exact probability test [Siegel, 1956]). 


Discussion 


In the autokinetic situation, first-born subjects 
responded more quickly than later-born subjects 
to the suggestion that the light would move. In 
addition, first-born subjects under the anxiety 
arousing condition were more suggestible than 
first-born subjects under the nonanxiety arousing 
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TABLE 2 


Apjustep Group MEANS (in Incues) or DISTANCE 
TRIALS 


ESTIMATES ON POSTSUGGESTION 
(N = 16 in each Group) 


5) 


Experimental Control 
First-Born —Later-Born’ First-Bom _Later-Born 
5.36 3.57 3.48 4.10 


TABLE 3 


ANALYsIs OF COVARIANCE oF POSTSUGGESTION 
Distance ESTIMATES 


Source SS | df US F ’ 
Birth Orders 5.269 | 1| 5.269 | 1.71 | >.05 
Threat of 7.333 | 1| 7.333 | 2.38 | >.05 

Shock 
Interaction 22.119 | 1 | 22.119 | 7.17 | <.01 
Within 182.010 | 59 | 3.085 
TABLE 4 
DISTRIBUTION OF RESPONSES TO QUESTIONNAIRE 
First-Born Later-Born 
R TR 
OM BB Cinto ES Conta 
Definitely willing 2 5 3 7 
Probably willi 3 9 7 6 
Definitely unwilling 11 2 6 3 


condition, whereas later-born subjects seemed to 


be unaffected by the 
findings give consideral 


anxiety condition. These 
ble support to Schachter’s 


(1959) theory concerning the effects of birth-order 


on affiliative behavior, 
However, acvording 


subjects under nonanxiety conditions should be 


similar conditions; as Table 2 indicates, there was 
virtually no difference between the two groups in 


hypothesis, one would not expect significant 
differences between first-born and PS 
subjects in the presuggestion trials. 


Moreover, Staples? 


administered four TAT 


2 Unpublished study carried out in partial fulfilment 


of the requirements for a 
versity of Toronto. 


graduate course at the Uni- 


cards (6GF, 7GF, 8GM, and 10) to 26 firs 
and 26 later-born female subjects. Slides 
Presented in the second half of a regular 
period, with no previous indication that 
would be carried out. The situation thus roi 
correcponded to the “neutral” affiliation aro 
conditions of previous studies of affiliative fan 
(Atkinson, Heyns, & Veroff, 1954; Shiple 
Veroff, 1952)» Standard instructions for the 

TAT were used. Stories were scored for affiliati 
imagery according to’ the manual provided 
Heyns, Veroff, and Atkinson (1958). Scoring 
done independently by two scorers, both of wl 
were ignorant of the birth-order of subjects; 
crepancies were subsequently settled by mutua 


that first-born subjects produced more affiliati 
imagery than later-born subjects (z = 1.64; p , 
-05, one-tailed test). i 

According to Schachter (1959), “dependen 
behavior will be most strongly manifested in condi 
tions of disturbance and anxiety but... th 
should be other non-anxiety related indications 
(a relationship between ordinal position 
dependence)” (p. 89). Differences between fi 
born and later-born subjects in the presuggesti o] 
trials and on the fantasy measure are indication 
of this kind. 

Walters and Ray (1960) found no differen 
between first-born and later-born grade-si 
boys in rate of conditioning to a social reinforer 
(approval). This finding contrasts with the bi rt 
order effects found in the present study; it 
trasts also with the majority of findings reported 
by Schachter (1959). However, Schachter’s posi 
tive findings have been secured mainly when 


SUMMARY J a 

Sixty-four female undergraduates, 32 first-born” 
and 32 later-born, were given suggestions in the 
autokinetic situation. Before the Postsuggestio! 
trials, half the first-born and half the later-born | 
subjects were told that they would receive shock — 
when their judgments were in error (anxiety con- i 
dition). bike: 
In presuggestion trials first-born subjects re- 
sponded more rapidly than later-born subjects to 
the suggestion that the light would move. On the 
Postsuggestion trials first-born subjects under the 
anxiety condition were more suggestible than first 
born subjects under the nonanxiety condition. — 


\ CRITIQUE AND NOTES 


There was no significant difference between later- 
born subjects under the anxiety condition and 
later-born subjects under the nonanxiety condition. 

The findings were discussed with reference to 
hypotheses advanced by Schaclfter (1959) and 
Walters and Karal (1960) concerning antecedents 
of susceptibility to social influence. k 
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THE INFLUENCE OF VALENCE ON DISTANCE JUDGMENTS OF HUMAN FACES 
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K D 

ARLIER theories of perception, taking their 

cue from empiricist philosophy, made a 
sharp distinction between the descriptive properties 
of an object, which included colors and the metric 
properties of size, shape, and distance, and what 
one might call the ascriptive properties, which 
included expressive, physiognomic, or affective 
qualities. The latter, it was held, being invested 
in the object by dint of association, could be 
ignored from the strictly perceptual standpoint as 
belonging rather to the psychology of the emotions, 
Modern theories of perception no longer allow us 
to make such a clear cut dichotomy, if only be- 
cause it is now realized that there is interaction 
between both types of property. This is true even 
of inanimate objects but it is especially true of 
Person perception, where the problem is often to 
ascertain how, to quote Ittelson and Slack (1958), 
“the feelings and experiences of the observer 
influence the way in which he perceives the physical 
characteristics of the person.” 

These authors have discussed the theoretical 
issues in this field and reviewed the empirical 
findings from studies using aniseikonic lenses, the 
stereoscope, and the Ames Thereness-Thatness 
apparatus, (The latter is a means by which dis- 
tance judgments must be made under conditions 
of minimal cues.) 

‘The present study was generally inspired by this 
review and specifically by the studies of Smith 
(1953). Smith» verbally reinforcing the evident 
expressions on photographed faces, showed that 
“liked” people with pleasant expressions were 
Placed nearer the observer than “disliked” 
people with unpleasant expressions, in the There- 
ness-Thatness apparatus, He used selected photo» 
graphs from the Frois-Wittmann series, These 
have the advantage that the impression they make 
is known, but on the other hand Frois-Wittmann 
himself posed for all of them, and this must have 
been embarrassingly obvious to the subjects! 

The variable manipulated by Smith, called 
“emotional loading” by Ittelson and Slack (1958), 
can perhaps be more incisively thought of as 
valence. Then it becomes clear that the paradigm 

. ofa testing situation is indeed one involving dis- 
tance judgment, in that a stimulus object with 
Positive valence, which “attracts,” must be Placed 
too near, and one with negative, “repelling,” val- 
ence be placed too far, Parenthetically it is in- 
teresting to note that in his analysis of adjectives 
by the Semantic Differential, Osgood (1957) 
showed “near-far” to have an appreciable loading 


on he factor of Evaluation, and negligible load- “ 
ings on the other two factors. 

In exploring further in this field we too used the 
Thereness-Thatness apparatus to test our general ` 
proposition that faces hypothesized to have posi- 
tive*valence for our subjects, are set nearer the 
subjects than faces that have negative valence, _ 
Then, aiming to present a situation with stimuli 
both more natural and more realistic than Smith’s. 
(1953), we used principally photographs of people 
in public life, or personally known to the subjects, 
Valences were thus not manufactured in the labora- 
tory but were already part of the subjects’ general 
attitude structure. L 

The following are the several specific examples | 
of the general hypothesis investigated: 

1. As the primary test of the hypothesis we 
tested the influence of valence pure, unconfounded 
by physiognomic cues, in terms of our “Nonsuch” 
person, This was a photograph of a nondescript 
middle-aged man, to whom were ascribed admired 
and despised roles for alternate subjects, That is, 
half of them were told that he was a World Health 
Organization worker in Asia and the other half 
that he was a United States labor racketeer. If 
the identical stimulus when made positively valent 
would be set nearer than when negatively valent, 
this would be a crucial confirmation. 

2. Positive and negative attitudes already 
anchored to particular faces were investigated by 
Presenting people deemed widely popular in 
Britain (Prince Philip and President Eisenhower), 
contrasted with two definitely unpopular (a leader 
of the Communist Party in Hungary and a member 
of the Japanese General Staff from the Second 
World War). 

3. Authority figures have been shown by Wit- 
treich and Radcliffe (1956) to be perceived differ- 
entially. In our turn we predicted that an authority 
figure should have negative valence, in comparison 
with a group of peers, who would bè predicted to 
have approximately neutral valence. Hence a 
photograph of the Head of the Department was 

1 After some preliminary inquiry we found that 
among our students at least there were no present day 
public figures who would be both instantly recognized 
and abhorred. Hence the negative faces had to be iden- 
tified by the experimenters, i.e. as Erno Gerd, First 
Secretary of the Hungarian Communist Party and thus 
Tesponsible for the causes of the revolution and its 
brutal Suppression, and as a Japanese general executed 
for atrocities to Allied prisoners of war. We feel confi- 
dent, however, that this identification did call up exist- 
ing attitudes. 
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presented to our subjects (his students) to be 
compared with a group of six fellow students 
(those nearest each subject alphabetically by 
surname). 

4. That the self is responded to nore favorably 
in a hidden situation than comparable stimuli, has 
been shown by the present authors (Beloft & 
Beloff, 1959). And Ittelson and Slack (1958) 
report an unpublished study in which if was found 
that self-photographs were set “differently” from 
those of a stranger in thee Thereness-Thatness 
apparatus. We predicted more specifically that the 
self is positively valent, and should be set nearer 
the subjects than the average of the peer settings. 
Following Jourard and Secord (1955) we further 
predicted that this effect would be more striking 
in the women subjects. 


s METHOD 


Subjects 


‘Twenty-four men and 30 women attending elemen- 
tary psychology courses at Queen’s University served 
as subjects. They may be considered psychologically 
naive. 


Apparatus 


The Thereness-Thatness apparatus is described in 
detail by Ittelson (1952). We used the simple, mirror 
model version, the only noteworthy modification being 
that our table or field was shorter than the one de- 
scribed, being only 6’ in length. 

Essentially it consists of two fields of vision per- 
ceived as side by side by the subject. The comparison 
field, containing five comparison posts is viewed binocu- 
larly and presents relatively full distance cues. The 
experimental field consists of only the stimulus figure 
seemingly suspended in blackness, which is viewed 
monocularly. Under this condition of reduced cues, the 
stimulus must be adjusted to be level with the middle 


comparison post. 


Material 
black and white 
photographs, in which the variables of size of the face 


perimenters. 


Procedure 

The subject was brought into the completely dark- 
seated at the apparatus. At no 
point did he see the arrangement of the.apparatus. The 
viewing aperture having been open 
as follows: 

You see in front of you @ photograph of a face 
beside a series of five rods. We want you to turn the 
knob which moves the photograph back and forth 
until you are satisfied that it is level with the middle 


en 
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121 
TABLE 1 
MEAN ERROR IN DISTANCE JUDGMENT 
(inches) 
Stimulus Combined Men Women 
= 
Nonsuch +ve 9.08 
Nonsuch —ve 6.86 
Philip 8.62 9.20 8.45 
Eisenhower 8.52 7.72 9.44 
Gerö $ 6.86 8.00 6.18 
Japanese 6.69 7.40 6.34 
Professor 6.93 7.24 6.92 
Self 9.14 8.12 10.26 
Student mean 8.30 8.36 8,52 


Note. —All errors are in the direction of setting the stimulus 
nearer than the point of equality. 


TABLE 2 
MEAN DIFFERENCE IN DISTANCE JUDGMENTS 
Stimuli Combined Men ‘Women 
Sa a ler 
Nonsuch +-ve:Nonsuch 2.22* 
—ve 

Philip: Geré sa fb 1.20* | 2.27** 
Philip: Japanese 1,93** 1.80**| 2.11% 
Eisenhower: Gerd 1.66% | —0.28 | 3.26%" 
Eisenhower: Japanese 1.83** 0.32. | 3.10** 
Student mean: Professor 1.37** .12* | 1.60" 
Self:Student mean 0.84 —0.24 | 1.74* 


Note.—First stimulus predicted to be nearer than the second. 
One-tailed tests. 
*p < 05. 
op <0. 


rod. [Two practice stimuli were adjusted in this way.) 
We are going to show you a number of such faces. 
Some will be people you will recognize from public 
life, others will be people you actually know. This 
has been done just to add some variety to an other- 
wise monotonous task. Because your job each time 
is simply to adjust thé photograph until you judge it 
to be level with the middle post. 
Between each stimulus the viewing aperture was closed 
ànd the error in distance setting recorded in inches. 
All stimuli were presented in a random order, and 
further randomized with respect 
the point of equality (5, 10, or 15” away), and with 
respect to nearer than or further than, equality. 


RESULTS AND DISCUSSION 


From Table 1 it is seen that the subjects demon- 
strated consistent and strong constant errors in the 
direction of placing the stimuli too near themselves. 
This is of course, irrelevant to our hypothesis. In 
the apparatus the most important cue to distance 
is provided by some estimate of the absolute size 


of the stimulus card. We suspect then that the 


size, while in fact they were considerably 
‘Also from Table 1 we may construct the order 
of nearness to the subjects of the whole series of 
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photographs. Considering both men and women 
the order is: Nonsuch +ve, Philip, Eisenhower, 
Peers, Professor, Nonsuch —ve, Gerd, Japanese 
General. a ee 

The spegific hypotheses were tested by non- 
a for ihe oe reasons. The Wil- 
coxon matched-pairs’ signed-ranks tests was used 
for all comparisons, except that between the two 
Nonsuch responses where the Medians test, was 

‘applied. Table 2 gives the difference in mean’ 
distances between the appropriate stimuli and 
their levels of significance. Let us consider each of 
our hypotheses in turn. 

1,fn the case of the”nonsuch photograph where 
the variable of attitude acted without reinforce- 
‘ment of any possible physiognomic factors, the 
subjects who had been given the favorable label 
did put the photograph nearer themselves than 
those who had had the unfavorable one. The 
difference of 2.22” is significant at the .05 level. 
The basic premise has been confirmed. 

2. Of the two faces from public life which we 
predicted would evoke a positive valence only 
Prince Philip was placed significantly nearer than 
both the negative figures by both men and women. 
Eisenhower was distinguished from the negative 
figures by the women but not by the men. (Is it 
that they were impervious to his charm and con- 
sidered only his political record?)? 

3. Both men and women kept the authority 
figure “at a distance,” that is they set the Pro- 
fessor significantly further back than the average 
of the fellow students. 

4. We predicted that the self-photograph would 
be set nearer the subjects than the ayerage setting 
of the peers, and that this phenomenon would be 
more pronounted among the women. It is seen 
now that for men the difference in the two settings 
is indeed negligible, while for women it is significant 
in the predicted direction at the .05 level. 

In general we may conclude that women are 
influenced to a very significant extent by thè 
valence of the stimuli, even when they were mak- 
ing what seemed to them objective distance 
comparisons. This would appear to be the case to a 
lesser extent with men, 

Therefore the present study has once again 
shown that what we perceive is influenced in 
some degree by our attitude towards the object of 
perception. What is behind this effect? Because in 


? From Table 1 it is patent that there are no signifi- 
cant differences between Philip and Eisenhower, or 
between Gerd and the Japanese general. $ 
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our experimental arrangement the cues for distance 
judgment are in fact minimal and perception is 
thus ambiguous and fluid, we suggest that the 
variable of assumed size of the stimulus card, 
which we meéntioned before, was being uncon- 
sciously overlaid by the process shown most 
clearly in Smith’s (1953) “self-reference” setting, 
Here the subject is shown only the experimental 
field of the Thereness-Thatness apparatus and 
instructed to set the face simply where he would 
like, to see it in reletion to himself. This may be 
thought ofras making concrete the metaphor of 
social distance. In other words, the perceptual 
Tesponses we obtained were a compromise between 
an objective size-distance, and a self-reference 
setting. 


Sunaary 


An experiment was conducted to test the hy- 
pothesis that photographs of positively valent 
faces would be set nearer the subject than negative 
faces, in ostensibly objective distance judgments. 
The Ames Thereness-Thatness apparatus was used. 
Several examples of valent faces were compared, 
€g., popular vs. unpopular figures’ from public 
life, an authority figure vs. peers, and- self vs. 
peers. The responses of women subjects bore out 
the predictions, those of the men did not da so 
uniformly, 
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